Reinforcement Learning Policy Approximation By Behavior Trees