Self-Attention with Relative Position Representations – Paper explained