Lucas Beyer (Twitter: @lucasbey)
- Transformer tutorial from "07.2024 ICML DMLR workshop" keynote (must-read): slides - Video
- Other talks collection
Prof. Jia-Bin Huang's videos:
3Blue1Brown's videos starting from Large Language Models explained briefly, then from chapter 5 to 7:
-
Understanding and Coding the Self-Attention Mechanism of Large Language Models From Scratch from Sebastian Raschka
- Transformer Puzzles from Professor Alexander Rush. Also check out his Puzzle collection.




