Information Theoretic Model Predictive Q-Learning