🚀 5-dollar-llm: Setup & Speedrun Guide

Welcome to the 5-dollar-llm repository! This project is dedicated to pushing the limits of training efficiency for 151M parameter models on consumer hardware.

Whether you are a human researcher or an AI agent, this guide will help you set up the environment and start competing on the leaderboard.

🛠️ 1. Environment Setup

We recommend using Python 3.10+.

Clone the Repository

git clone https://github.com/Open-Superintelligence-Lab/5-dollar-llm
cd 5-dollar-llm

Install Dependencies

pip install -r requirements.txt

Option A: Quick Start (40M Tokens) - Recommended for Speedruns

Perfect for hitting the 4.5 and 3.5 loss milestones. Downloads in seconds.

python3 -c "
from datasets import load_dataset
import os
print('Downloading 40M Token Subset...')
ds = load_dataset('vukrosic/blueberry-1B-pretrain', split='train[:20000]')
os.makedirs('processed_data/speedrun_40M', exist_ok=True)
ds.save_to_disk('processed_data/speedrun_40M')
print('✅ Speedrun Data Ready!')
"

Option B: Full Dataset (1B Tokens) - Recommended for the Marathon

Required for the 1B token world record challenge.

python3 -c "
from datasets import load_dataset
import os
print('Downloading 1B Pretraining Data...')
ds = load_dataset('vukrosic/blueberry-1B-pretrain')
os.makedirs('processed_data/pretrain_1B', exist_ok=True)
ds.save_to_disk('processed_data/pretrain_1B')
print('✅ Full Data Ready!')
"

🏎️ 2. The Speedrun Leaderboard

Our community competes to reach specific training loss milestones in the shortest time possible on a single NVIDIA RTX 4090.

⚡ Speedrun 1: The 4.5 Loss Challenge

Goal: Reach a training loss of ≤ 4.5 as quickly as possible.
Purpose: Ideal for quick architecture tests, testing new optimizers, or rapid hyperparameter searches.
Rules: Must follow the Official Speedrun Rules.
- Expected Time: ~2-3 minutes.

Command:

python train_llm.py \
    --dataset_path processed_data/speedrun_40M \
    --target_train_loss 4.5 \
    --experiment_name arch_test_v1

⚡ The 3.5 Loss Speedrun

Goal: Reach a training loss of ≤ 3.5.
Purpose: Used for deeper research. Smaller gains in the 4.5 speedrun are verified here to ensure they don't collapse or plateau early.
Expected Time: ~9 minutes.

Command:

python train_llm.py \
    --dataset_path processed_data/speedrun_40M \
    --target_train_loss 3.5 \
    --experiment_name deep_research_v1

📊 3. Iterating & Research

Configs: Modify configs/llm_config.py to change configs (keep the parameter size around 151M), learning rates, or optimization schedules.
Model: Edit models/llm.py to experiment with new attention mechanisms or layer types.
Logs: Check the logs/ directory for detailed training metrics.
Baseline Measurement: Before submitting any changes, you must measure the baseline on your setup and compare it with your improvements.
GPU Memory: If the model doesn't fit on your GPU, you can reduce the model size (e.g., batch_size or n_layer) for faster local iteration.
Leaderboard: See docs/LEADERBOARD.md for current world records and submission instructions.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🚀 5-dollar-llm: Setup & Speedrun Guide

🛠️ 1. Environment Setup

Clone the Repository

Install Dependencies

Option A: Quick Start (40M Tokens) - Recommended for Speedruns

Option B: Full Dataset (1B Tokens) - Recommended for the Marathon

🏎️ 2. The Speedrun Leaderboard

⚡ Speedrun 1: The 4.5 Loss Challenge

⚡ The 3.5 Loss Speedrun

📊 3. Iterating & Research

FilesExpand file tree

SETUP_INSTRUCTIONS.md

Latest commit

History

SETUP_INSTRUCTIONS.md

File metadata and controls

🚀 5-dollar-llm: Setup & Speedrun Guide

🛠️ 1. Environment Setup

Clone the Repository

Install Dependencies

Option A: Quick Start (40M Tokens) - Recommended for Speedruns

Option B: Full Dataset (1B Tokens) - Recommended for the Marathon

🏎️ 2. The Speedrun Leaderboard

⚡ Speedrun 1: The 4.5 Loss Challenge

⚡ The 3.5 Loss Speedrun

📊 3. Iterating & Research