Skip to content
Change the repository type filter

All

    Repositories list

    • ZClip

      Public
      Official implementation of the paper: "ZClip: Adaptive Spike Mitigation for LLM Pre-Training".
      Python
      Apache License 2.0
      1014533Updated Nov 20, 2025Nov 20, 2025
    • perftest

      Public
      Shell
      0000Updated Nov 3, 2025Nov 3, 2025
    • Official implementation of the "Variance control via weight rescaling in LLM pretraining" paper.
      Python
      Apache License 2.0
      0500Updated Jun 29, 2025Jun 29, 2025
    • Official implementation of the paper: "A Refined Analysis of Massive Activations in LLMs".
      Python
      MIT License
      31100Updated May 21, 2025May 21, 2025
    • raydp

      Public
      RayDP provides simple APIs for running Spark on Ray and integrating Spark with AI libraries.
      Scala
      Apache License 2.0
      79000Updated Apr 22, 2025Apr 22, 2025
    • Ongoing research training transformer models at scale
      Python
      Other
      3.6k100Updated Mar 21, 2025Mar 21, 2025
    • vLLM’s reference system for K8S-native cluster-wide deployment with community-driven performance optimization
      Python
      Apache License 2.0
      370000Updated Mar 14, 2025Mar 14, 2025
    • lingua

      Public
      Meta Lingua: a lean, efficient, and easy-to-hack codebase to research LLMs.
      Python
      BSD 3-Clause "New" or "Revised" License
      268000Updated Feb 6, 2025Feb 6, 2025
    • Fast and memory-efficient exact attention
      Python
      BSD 3-Clause "New" or "Revised" License
      2.4k000Updated Jan 16, 2025Jan 16, 2025
    • Python
      0000Updated Jan 15, 2025Jan 15, 2025
    • blt

      Public
      Code for BLT research paper
      Python
      BSD 3-Clause "New" or "Revised" License
      189000Updated Jan 10, 2025Jan 10, 2025
    • Python
      0000Updated Nov 1, 2024Nov 1, 2024
    • netifaces

      Public
      C
      MIT License
      89000Updated Oct 7, 2024Oct 7, 2024
    • The AdEMAMix Optimizer: Better, Faster, Older.
      Python
      MIT License
      10000Updated Oct 7, 2024Oct 7, 2024
    • wheels

      Public
      0000Updated Oct 4, 2024Oct 4, 2024
    • beam

      Public
      Apache Beam is a unified programming model for Batch and Streaming data processing.
      Java
      Apache License 2.0
      4.5k000Updated Oct 4, 2024Oct 4, 2024
    • lighteval

      Public
      LightEval is a lightweight LLM evaluation suite that Hugging Face has been using internally with the recently released LLM data processing library datatrove and…
      Python
      MIT License
      429000Updated Aug 5, 2024Aug 5, 2024
    • Bazel Python Rules
      Starlark
      Apache License 2.0
      672000Updated Jul 31, 2024Jul 31, 2024
    • C++
      2000Updated Jul 29, 2024Jul 29, 2024