Skip to content

NoSaaS-me/5-dollar-llm

 
 

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

591 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

5-Dollar LLM (Blueberry 151M)

Watch the video

🎥 Watch our introduction video to learn more about the project!

Check out our speedrun leaderboard!

🗺️ Open Superintelligence Lab Roadmap

Our goals:

  1. GPT-1 Level by Dec 20 2025 ✓ Watch
  2. GPT-2 Level by Jan 20 2026
  3. GPT-3 Level by Feb 20 2026
  4. Top 150 in LMArena (GPT-4o-mini level) by April 2026
  5. Top 50 by Dec 2026
  6. Top 10 by April 2027
  7. We could aim for Top 1 by 2028, TBD

We will partner for compute while keeping all research/engineering/code fully open source.


🧪 Research Tasks

Current research projects and experiments:


It is best to read the Quick Start directly in the tasks linked above.

🏎️ Quick Start

git clone https://github.com/Open-Superintelligence-Lab/5-dollar-llm
cd 5-dollar-llm
pip install -r requirements.txt
python data/download_hf_data.py   # Downloads 40M token subset
python train_llm.py --target_train_loss 4.5

👉 Full Setup Guide | Leaderboard | Contributing Guide


🤝 Partners & Support

We will partner with compute providers while keeping all research/engineering/code fully open source.

Potential partners include: Hugging Face, NVIDIA, Microsoft, Google, Amazon, Meta, IBM, Oracle, Alibaba, Tencent, Huawei, Baidu, CoreWeave, Lambda Labs, Hyperbolic, Stability AI, OpenAI, Anthropic, xAI, Cohere, Mistral AI, Graphcore, Tenstorrent, Intel, AMD, Dell Technologies, ai2, a16z, Sequoia Capital, and more.

If you or someone you know has extensive research experience and wants to contribute to this open source initiative, contact me.

About

Train LLM from scratch for $5 USD - Research.

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages

  • Python 100.0%