We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent e100990 commit 60c917bCopy full SHA for 60c917b
README.md
@@ -4,7 +4,7 @@ Reward Model training framework for LLM RLHF. For in-depth understanding of Rewa
4
* Inference
5
```python
6
from transformers import AutoModelForSequenceClassification, AutoTokenizer
7
-MODEL = ""
+MODEL = "shahules786/Reward-model-gptneox-410M"
8
9
model = AutoModelForSequenceClassification.from_pretrained(MODEL)
10
tokenizer = AutoTokenizer.from_pretrained(MODEL)
0 commit comments