You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+12-2Lines changed: 12 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -10,7 +10,11 @@
10
10
11
11

12
12
13
-
ether0: a scientific reasoning model, dataset, and reward functions for chemistry.
13
+
_ether0: a scientific reasoning model, dataset, and reward functions for chemistry._
14
+
15
+
# ether0 Reward Model
16
+
17
+
This repo contains the reward model for evaluating ether0 and similar models, along with utilities for working with the verifiable rewards in [our benchmark](https://huggingface.co/datasets/futurehouse/ether0).
14
18
15
19
## Overview
16
20
@@ -38,6 +42,12 @@ This repo contains several packages:
38
42
-`ether0.remotes`: server code for ether0 reward functions involving
39
43
exotic packages and/or third party models.
40
44
45
+
> [!NOTE]
46
+
> This repo does not contain training code,
47
+
> although you can find open source repositories like [NeMo-RL](https://github.com/NVIDIA/NeMo-RL)
48
+
> or [Hugging Face TRL](https://github.com/huggingface/trl)
49
+
> that can do the SFT and RL phases of training.
50
+
41
51
### Open Weights
42
52
43
53
Please see our open-source weights on Hugging Face: https://huggingface.co/futurehouse/ether0
0 commit comments