3D MindMap about large language models, transformer networks, pretraining, rotary embeddings...