-
Notifications
You must be signed in to change notification settings - Fork 12
Evaluating Chatbot responses
yangm2 edited this page Feb 16, 2026
·
2 revisions
| term | definition |
|---|---|
| dataset | tbd |
| evaluation | tbd |
| evaluator | tbd |
| experiment | tbd |
| llm-as-a-judge | tbd |
| single-turn conversation | tbd |
| multi-turn conversation | tbd |
| trajectory evaluation | tbd |
| simulated user | tbd |
| tech | description |
|---|---|
| langchain | tbd |
| langsmith | tbd |
Please see the EVALUATION.md in the repo for setting up and running experiments with LangSmith.
TBD