Attention and Transformer | Self-Supervised Learning | CS.601.475 Machine Learning @ JHU -- Part 2