Sora Explained - Part2 - The Encoders of Vision Model