Marimo LLMs From Scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step with marimo.

About • Getting Started • Chapters • Why Marimo • License

📖 About The Project

This repository provides interactive marimo notebook implementations of the best-selling book "Build a Large Language Model From Scratch" by Sebastian Raschka.

We focus on providing a seamless experience for readers of the Traditional Chinese Edition (讓 AI 好好說話！從頭打造 LLM 實戰秘笈), featuring:

Dual Language Support: Code comments and explanations in both English and Traditional Chinese.
Interactive Learning: Visualizing Attention mechanisms and Transformers using Marimo's UI.

Original Source: LLMs-from-scratch by Sebastian Raschka

🚀 Getting Started

Prerequisites

Python 3.12+
pip

Installation

Clone the repository:

git clone https://github.com/thliang01/marimo-LLMs-from-scratch.git
cd marimo-LLMs-from-scratch

Install dependencies:
```
pip install -r requirements.txt
```

🐱 Twinkle's Tip: Running Marimo

"Wait! Don't use python filename.py!"

Marimo files look like standard Python scripts, but to see the magic (graphs, sliders, and interactivity), you need to run them with the marimo editor.

To open a notebook in edit mode:

marimo edit ch3/marimo_ch03.py           # English version
marimo edit ch3/marimo_ch03_zh_tw.py     # Traditional Chinese version

To run a notebook as an app:

marimo run ch3/marimo_ch03.py

📂 Repository Structure

marimo-LLMs-from-scratch/
├── ch3/                           # Chapter 3: Coding Attention Mechanisms
│   ├── ch03.ipynb                # Original Jupyter notebook (English)
│   ├── marimo_ch03.py            # Marimo notebook (English)
│   ├── marimo_ch03_zh_tw.py      # Marimo notebook (繁體中文)
│   └── README.md                 # Chapter 3 documentation
├── requirements.txt              # Python dependencies
└── README.md                     # This file

📚 Chapters

Chapter 3: Coding Attention Mechanisms

Covers the implementation of attention mechanisms, the core engine of LLMs:

Simple self-attention without trainable weights
Scaled dot-product attention with Q, K, V matrices
Causal attention with masking
Multi-head attention

📁 See ch3/README.md for detailed documentation

💡 Why Marimo?

Marimo is a next-generation Python notebook that offers several advantages:

✅ Reactive: Cells automatically update when dependencies change
✅ Reproducible: No hidden state, deterministic execution order
✅ Git-friendly: Notebooks are stored as .py files
✅ Interactive: Rich UI elements and real-time feedback
✅ Executable: Can be run as both notebooks and Python scripts

🤝 Contributing

Contributions are welcome! Please feel free to submit issues or pull requests.

📄 License

This project is licensed under the Apache License 2.0. It is based on the original work from LLMs-from-scratch by Sebastian Raschka.

📖 References

Original Repository: https://github.com/rasbt/LLMs-from-scratch
Book: Build a Large Language Model From Scratch by Sebastian Raschka
Marimo: https://marimo.io

🙏 Acknowledgments

Special thanks to Sebastian Raschka for creating the original LLMs-from-scratch materials and book.

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
ch3		ch3
images		images
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Marimo LLMs From Scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step with marimo.

📖 About The Project

🚀 Getting Started

Prerequisites

Installation

🐱 Twinkle's Tip: Running Marimo

📂 Repository Structure

📚 Chapters

Chapter 3: Coding Attention Mechanisms

💡 Why Marimo?

🤝 Contributing

📄 License

📖 References

🙏 Acknowledgments

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Marimo LLMs From Scratch

Implement a ChatGPT-like LLM in PyTorch from scratch, step by step with marimo.

📖 About The Project

🚀 Getting Started

Prerequisites

Installation

🐱 Twinkle's Tip: Running Marimo

📂 Repository Structure

📚 Chapters

Chapter 3: Coding Attention Mechanisms

💡 Why Marimo?

🤝 Contributing

📄 License

📖 References

🙏 Acknowledgments

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages