fix deploy error

JaredforReal · JaredforReal · commit 468fb15598c7 · 2025-09-15T19:25:57.000+08:00
Signed-off-by: JaredforReal &lt;w13431838023@gmail.com&gt;
diff --git a/website/docs/training/model_performance_eval.md b/website/docs/training/model_performance_eval.md
@@ -84,7 +84,7 @@ python mmlu_pro_vllm_eval.py \
 
 ### What it outputs per model:
 
-- **results/<model_name>_(direct|cot)/**
+- **results/Model_Name_(direct|cot)/**
   - **detailed_results.csv**: one row per question with is_correct and category
   - **analysis.json**: overall_accuracy, category_accuracy map, avg_response_time, counts
   - **summary.json**: condensed metrics
@@ -113,7 +113,7 @@ python arc_challenge_vllm_eval.py \
 
 ### What it outputs per model:
 
-- **results/<model_name>_(direct|cot)/**
+- **results/Model_Name_(direct|cot)/**
   - **detailed_results.csv**: one row per question with is_correct and category
   - **analysis.json**: overall_accuracy, avg_response_time
   - **summary.json**: condensed metrics
@@ -199,7 +199,7 @@ python src/training/model_eval/result_to_config.py \
 - Constructs a new config:
   - default_model: the best average performer across categories
   - categories: For each category present in results, ranks models by accuracy:
-    - category.model_scores = [{model: "<name>", score: <float>}, ...], highest first
+    - category.model_scores = `[{ model: "Model_Name", score: 0.87 }, ...]`, highest first
   - category reasoning settings: auto-filled from a built-in mapping (math, physics, chemistry, CS, engineering -> high reasoning; others default to low/medium; you can adjust after generation)
   - Leaves out any special “auto” placeholder models if present
 
diff --git a/website/sidebars.js b/website/sidebars.js
@@ -38,6 +38,7 @@ const sidebars = {
       label: 'Model Training',
       items: [
         'training/training-overview',
+        'training/model_performance_eval',
       ],
     },
     {

Original file line number	Diff line number	Diff line change
`@@ -38,6 +38,7 @@ const sidebars = {`
`38`	`38`	`label: 'Model Training',`
`39`	`39`	`items: [`
`40`	`40`	`'training/training-overview',`
	`41`	`+ 'training/model_performance_eval',`
`41`	`42`	`],`
`42`	`43`	`},`
`43`	`44`	`{`