Skip to content

Commit 0bceb96

Browse files
committed
introduction
1 parent de84a7c commit 0bceb96

File tree

1 file changed

+3
-3
lines changed

1 file changed

+3
-3
lines changed

evals/introduction.mdx

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,12 +4,12 @@ title: Introduction
44

55
## What are Evals?
66

7-
**Evals** are like tests for your Agents. Use them judiciously to evaluate the performance of your Agents and improve them over time.
7+
**Evals** are unit & integration tests for your Agents. Use them judiciously to evaluate and measure the performance of your Agents and improve it over time.
88

99
We typically evaludate Agents on 3 dimensions:
1010

11-
- **Accuracy:** How accurate is the Agent's response?
12-
- **Performance:** How fast does the Agent produce the output and what is the memory footprint?
11+
- **Accuracy:** How complete/correct/accurate is the Agent's response (LLM-as-a-judge)
12+
- **Performance:** How fast does the Agent respond and what's the memory footprint?
1313
- **Reliability:** Does the Agent make the expected tool calls?
1414

1515
### Accuracy

0 commit comments

Comments
 (0)