Reinforcement Learning for Control of Inherently Unstable Robots