Skip to content
Change the repository type filter

All

    Repositories list

    • TTRL

      Public
      [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
      Python
      MIT License
      811.1k170Updated Apr 15, 2026Apr 15, 2026
    • P1-VL

      Public
      P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads
      21500Updated Feb 11, 2026Feb 11, 2026
    • FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
      Python
      Apache License 2.0
      66620Updated Jan 26, 2026Jan 26, 2026
    • [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
      Python
      MIT License
      1051.6k461Updated Jan 6, 2026Jan 6, 2026
    • P1

      Public
      P1: Mastering Physics Olympiads with Reinforcement Learning
      48430Updated Dec 29, 2025Dec 29, 2025
    • The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
      Python
      1543520Updated Jul 11, 2025Jul 11, 2025
    • PRIME

      Public
      Scalable RL solution for advanced reasoning of language models
      Python
      Apache License 2.0
      1111.9k82Updated Mar 18, 2025Mar 18, 2025
    • Repo of paper "Free Process Rewards without Process Labels"
      Python
      Apache License 2.0
      11171120Updated Mar 14, 2025Mar 14, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.