We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
There was an error while loading. Please reload this page.
1 parent b8c85b1 commit 5cb6cc2Copy full SHA for 5cb6cc2
src/agentlab/analyze/error_analysis.py
@@ -133,7 +133,7 @@
133
FEW-SHOT CLASSIFICATION EXAMPLES
134
--------------------------------------------------------------------------------
135
136
-1) EXAMPLE A (Benchmarl Error - Benchmark Design Error)
+1) EXAMPLE A (Benchmark Error - Benchmark Design Error)
137
• Context: The agent correctly finds a cheaper product meeting the user's criteria,
138
but the benchmark expects a more expensive product and marks the solution as wrong.
139
• Classification: ["Benchmark Design Error"]
0 commit comments