MDP & RL: Value Function and Bellman Equation - Reinforcement Learning in Finance