Similar Tracks
Stanford CS336: Language Modeling from Scratch | Spring 2025 | Architectures, Hyperparameters
Stanford Online
Mixture of Experts: How LLMs Are Getting Smarter Without Getting Slower (LLaMA 4, DeepSeek)
Julia Turc
Stanford CS336 Language Modeling from Scratch | Spring 2025 | Overview and Tokenization
Stanford Online