Skip to content

Commit 01cd69f

Browse files
committed
feat: update e2e RAG eval
1 parent 48a9dc1 commit 01cd69f

File tree

4 files changed

+475
-379
lines changed

4 files changed

+475
-379
lines changed

docs/rag_evaluation_metrics_zh.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,7 +27,7 @@ python examples/rag/dataset_rag_eval_baseline.py
2727
python examples/rag/sdk_rag_eval.py
2828

2929
# 模拟RAG系统并评估
30-
python examples/rag/eval_with_mock_rag.py
30+
python examples/rag/e2e_RAG_eval_with_mockRAG_fiqa.py
3131
```
3232

3333
### 2. SDK方式 - 单个评估

examples/rag/dataset_rag_eval_baseline.py

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -79,7 +79,7 @@ def print_metrics_summary(summary: SummaryModel):
7979
metrics_summary = summary.get_metrics_score_summary(field_key)
8080
sorted_metrics = sorted(metrics_summary.items(), key=lambda x: x[1], reverse=True)
8181

82-
print(f"\n 📈 指标排名(从高到低):")
82+
print("\n 📈 指标排名(从高到低):")
8383
for i, (metric_name, avg_score) in enumerate(sorted_metrics, 1):
8484
display_name = metric_name.replace("LLMRAG", "")
8585
print(f" {i}. {display_name}: {avg_score:.2f}")

0 commit comments

Comments
 (0)