|
1 | | -# RM-Gallery |
| 1 | +<div align="center"> |
2 | 2 |
|
3 | | -A unified platform for building, evaluating, and applying reward models. |
| 3 | +<picture> |
| 4 | + <source media="(prefers-color-scheme: dark)" srcset="https://readme-typing-svg.demolab.com?font=Inter&weight=700&size=72&duration=1&pause=1000000&color=22D3EE&background=FFFFFF00¢er=true&vCenter=true&repeat=false&width=280&height=100&lines=RM"> |
| 5 | + <img alt="RM" src="https://readme-typing-svg.demolab.com?font=Inter&weight=700&size=72&duration=1&pause=1000000&color=22D3EE&background=FFFFFF00¢er=true&vCenter=true&repeat=false&width=280&height=100&lines=RM"> |
| 6 | +</picture><picture> |
| 7 | + <source media="(prefers-color-scheme: dark)" srcset="https://readme-typing-svg.demolab.com?font=Inter&weight=700&size=72&duration=1&pause=1000000&color=8B5CF6&background=FFFFFF00¢er=true&vCenter=true&repeat=false&width=500&height=100&lines=Gallery"> |
| 8 | + <img alt="Gallery" src="https://readme-typing-svg.demolab.com?font=Inter&weight=700&size=72&duration=1&pause=1000000&color=8B5CF6&background=FFFFFF00¢er=true&vCenter=true&repeat=false&width=500&height=100&lines=Gallery"> |
| 9 | +</picture> |
| 10 | + |
| 11 | +<h3>A unified platform for building, evaluating, and applying reward models.</h3> |
4 | 12 |
|
5 | 13 | [](https://pypi.org/project/rm-gallery/) |
6 | 14 | [](https://pypi.org/project/rm-gallery/) |
7 | 15 | [](https://modelscope.github.io/RM-Gallery/) |
8 | 16 |
|
9 | 17 | [Documentation](https://modelscope.github.io/RM-Gallery/) | [Examples](./examples/) | [中文](./README_zh.md) |
10 | 18 |
|
| 19 | +</div> |
| 20 | + |
| 21 | +## News |
| 22 | + |
| 23 | +- **2025-10-20** - [Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling](https://arxiv.org/abs/2510.17314) |
| 24 | +- **2025-10-17** - [Taming the Judge: Deconflicting AI Feedback for Stable Reinforcement Learning](https://arxiv.org/abs/2510.15514) |
| 25 | +- **2025-07-09** - Released RM-Gallery v0.1.0 on [PyPI](https://pypi.org/project/rm-gallery/) |
| 26 | + |
11 | 27 | ## Installation |
12 | 28 |
|
13 | 29 | RM-Gallery requires Python 3.10 or higher (< 3.13). |
@@ -95,12 +111,6 @@ Evaluate models on standard benchmarks: |
95 | 111 | - [Training Guide](https://modelscope.github.io/RM-Gallery/tutorial/training_rm/overview/) |
96 | 112 | - [API Reference](https://modelscope.github.io/RM-Gallery/api_reference/) |
97 | 113 |
|
98 | | -## News |
99 | | - |
100 | | -- **2025-10-20** - [Auto-Rubric: Learning to Extract Generalizable Criteria for Reward Modeling](https://arxiv.org/abs/2510.17314) |
101 | | -- **2025-10-17** - [Taming the Judge: Deconflicting AI Feedback for Stable Reinforcement Learning](https://arxiv.org/abs/2510.15514) |
102 | | -- **2025-07-09** - Released RM-Gallery v0.1.0 on [PyPI](https://pypi.org/project/rm-gallery/) |
103 | | - |
104 | 114 | ## Contributing |
105 | 115 |
|
106 | 116 | We welcome contributions! Please install pre-commit hooks before submitting pull requests: |
|
0 commit comments