Vision Transformer(ViT) Architecture Explained | Deep Learning Clips