What is Q* | Reinforcement learning 101 & Hypothesis