Harri Valpola: System 2 AI and Planning in Model-Based Reinforcement Learning