Increasing Training Stability with Double DQNs