Workshop at Big Data Conference Europe 2025
π Vilnius, Lithuania | π
November 18, 2025
In this hands-on workshop, you'll build an evaluation-driven RAG system that:
- Extracts financial data from Wikipedia
- Calculates Return on Investment (ROI) for films
- Classifies performance (Blockbuster, Profitable, Break-even, Flop)
- Evaluates against ground truth data
You'll learn to systematically improve agent performance through:
- Baseline measurement
- Error analysis
- Iterative improvements
- Metrics tracking (pass rate, cost, latency)
- Python 3.8+
- A Cohere API key (free tier works!)
-
Clone or download this repository
-
Run the setup script:
chmod +x setup.sh ./setup.sh
-
Add your API key:
- Edit
.envfile - Replace
your-api-key-herewith your actual Cohere API key
- Edit
-
Start Jupyter:
source venv/bin/activate jupyter notebook -
Open the notebook:
- Navigate to
notebooks/02_rag_with_eval.ipynb - Follow along with the instructor
- Navigate to
.
βββ README.md # This file
βββ agenda.md # Workshop schedule
βββ setup.sh # Automated setup script
βββ requirements.txt # Python dependencies
βββ .env.example # API key template
βββ data/
β βββ ground_truth/
β βββ film_box_office_ground_truth.csv # Evaluation dataset
βββ notebooks/
βββ 02_rag_with_eval.ipynb # Main workshop notebook
See agenda.md for the full schedule.
Highlights:
- Block 1 (09:20-11:15): Foundations - Build baseline agent & eval harness
- Block 2 (11:40-13:30): Iteration - Error analysis & systematic improvements
- Block 3 (14:15-15:35): Applied Lab - Choose your own use case
- Block 4 (16:00-17:00): Demos & Q&A
By the end of this workshop, you'll be able to:
β
Define task success criteria for agent workflows
β
Build gold-standard evaluation datasets
β
Implement deterministic and LLM-as-judge evaluators
β
Run error analysis to identify failure patterns
β
Iterate systematically to improve pass rates
β
Track quality metrics (pass rate) and operational metrics (cost, latency)
- Visit dashboard.cohere.com
- Sign up (free tier available)
- Generate an API key
- Add it to your
.envfile
- Ensure Python 3.8+ is installed:
python3 --version - Try manually:
python3 -m venv venv && source venv/bin/activate && pip install -r requirements.txt
- Activate the virtual environment:
source venv/bin/activate - Reinstall dependencies:
pip install -r requirements.txt
- Check
.envfile exists and contains your key - Verify format:
COHERE_API_KEY=your-actual-key(no quotes) - Get a key at dashboard.cohere.com/api-keys
- Ensure venv is activated:
source venv/bin/activate - Reinstall Jupyter:
pip install --upgrade jupyter notebook
Cohere Documentation:
Evaluation Resources:
Yann Stoneman
Staff Solutions Architect @ Cohere
Workshop materials provided for educational purposes.
Β© 2025 Big Data Conference Europe
Questions during the workshop? Ask away! π
After the workshop? Connect on LinkedIn or reach out via conference channels.