A friendly introduction to deep reinforcement learning, Q-networks and policy gradients