You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+8-58Lines changed: 8 additions & 58 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -6,17 +6,16 @@ This is a collection of reference implementations for Vector Institute's **Agent
6
6
7
7
## Reference Implementations
8
8
9
-
This repository includes several modules, each showcasing a different aspect of agent-based RAG systems:
9
+
This repository includes three modules, each demonstrating a different aspect of building and evaluating agent-based systems:
10
10
11
-
**3. Evals: Automated Evaluation Pipelines**
12
-
Contains scripts and utilities for evaluating agent performance using LLM-as-a-judge and synthetic data generation. Includes tools for uploading datasets, running evaluations, and integrating with [Langfuse](https://langfuse.com/) for traceability.
A ReAct agent using Google ADK and Google Search to answer questions grounded in live web content. Evaluated on the DeepSearchQA benchmark using LLM-as-a-judge metrics.
An agent that investigates Anti-Money Laundering cases by querying a SQLite database of financial transactions via a read-only SQL tool. Produces structured analysis and supports batch evaluation.
An agent that accepts natural language queries and generates downloadable Excel reports from a relational database. Includes a Gradio demo UI and Langfuse-integrated evaluations.
20
19
21
20
## Getting Started
22
21
@@ -32,7 +31,7 @@ Run integration tests to validate that your API keys are set up correctly.
32
31
uv run --env-file .env pytest -sv tests/tool_tests/test_integration.py
33
32
```
34
33
35
-
## Reference Implementations
34
+
## Running the Implementations
36
35
37
36
For "Gradio App" reference implementations, running the script would print out a "public URL" ending in `gradio.live` (might take a few seconds to appear.) To access the gradio app with the full streaming capabilities, copy and paste this `gradio.live` URL into a new browser tab.
These warnings can be safely ignored, as they are the result of a bug in the upstream libraries. Your agent traces will be uploaded to LangFuse as configured.
0 commit comments