Reinforcement Learning with acrobot from OpenAI gym and visualize it