Skip to content
Change the repository type filter

All

    Repositories list

    • Merge of megatron-train, autoexperiment and oellm_pretrain.
      Python
      26101Updated Feb 16, 2026Feb 16, 2026
    • Repo for post-training LLMs
      Python
      0201Updated Feb 13, 2026Feb 13, 2026
    • oellm-cli

      Public
      Python
      1840Updated Feb 12, 2026Feb 12, 2026
    • Curated Repository of LLM (Pre-)Training Data
      Shell
      26340Updated Feb 12, 2026Feb 12, 2026
    • OpenJury

      Public
      Evaluating LLM with swappable judges: local, remote, openrouter on multiple benchmarks.
      Python
      3701Updated Feb 11, 2026Feb 11, 2026
    • Python
      0000Updated Feb 6, 2026Feb 6, 2026
    • Setup environment variables and slurm configuration automatically on EuroHPC clusters
      Shell
      0100Updated Jan 29, 2026Jan 29, 2026
    • Ongoing research training transformer models at scale
      Python
      3.6k000Updated Jan 28, 2026Jan 28, 2026
    • Ongoing research training transformer models at scale
      Python
      3.6k000Updated Jan 27, 2026Jan 27, 2026
    • notebooks

      Public
      Jupyter Notebook
      0100Updated Jan 19, 2026Jan 19, 2026
    • Shell
      0000Updated Jan 7, 2026Jan 7, 2026
    • MegaTron open-sci fork
      Python
      3.6k000Updated Oct 14, 2025Oct 14, 2025
    • Python
      0000Updated Oct 2, 2025Oct 2, 2025
    • Evaluate a list of models and tasks
      Python
      2010Updated Aug 18, 2025Aug 18, 2025
    • Python
      0000Updated Jul 29, 2025Jul 29, 2025
    • MultiSynt

      Public
      MultiSynt: an open multilingual synthetic dataset for LLM pre-training.
      0010Updated Jun 2, 2025Jun 2, 2025
    • Taskboard

      Public
      01960Updated Apr 14, 2025Apr 14, 2025
    • Report slurm compute usage on Discord automatically every week.
      Shell
      0010Updated Mar 17, 2025Mar 17, 2025