Skip to content

Commit 45f5ce6

Browse files
authored
fix: improve local eval config and doc (#1528)
Signed-off-by: Yuki Huang <[email protected]>
1 parent 6fc917f commit 45f5ce6

File tree

2 files changed

+12
-2
lines changed

2 files changed

+12
-2
lines changed

docs/guides/eval.md

Lines changed: 6 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -64,8 +64,12 @@ uv run python examples/run_eval.py --config path/to/custom_config.yaml
6464
# Run evaluation script on one of the supported benchmarks (e.g., GPQA)
6565
uv run python examples/run_eval.py --config examples/configs/evals/gpqa_eval.yaml
6666

67-
# Run evaluation script with a local dataset that is prefetched as a csv file.
68-
uv run python examples/run_eval.py --config examples/configs/evals/local_eval.yaml
67+
# Run evaluation script with a local dataset where the problem and solution keys are "Question" and "Answer" respectively.
68+
uv run python examples/run_eval.py \
69+
--config examples/configs/evals/local_eval.yaml \
70+
data.dataset_name=/path/to/local/dataset \
71+
data.problem_key=Question \
72+
data.solution_key=Answer
6973

7074
# Override specific config values via command line
7175
# Example: Evaluation of DeepScaleR-1.5B-Preview on MATH-500 using 8 GPUs

examples/configs/evals/local_eval.yaml

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,11 +1,17 @@
11
# Evaluation Configuration from local files.
2+
# Other settings (e.g., eval metrics, vLLM, cluster, etc.) are inherited from examples/configs/evals/eval.yaml.
23
defaults: "eval.yaml"
34

45
generation:
56
model_name: "Qwen/Qwen2.5-7B-Instruct"
7+
vllm_cfg:
8+
max_model_len: 2048
69

710
data:
811
prompt_file: "examples/prompts/cot.txt"
12+
system_prompt_file: null
13+
# You can also use custom datasets from a local dataset or HuggingFace.
14+
# e.g., /path/to/local/dataset or hf_org/hf_dataset_name (HuggingFace)
915
dataset_name: "https://openaipublic.blob.core.windows.net/simple-evals/math_500_test.csv"
1016
problem_key: "Question"
1117
solution_key: "Answer"

0 commit comments

Comments
 (0)