Think: Clippy... but for GNW traces. A suite of tools intended to make GNW traces accessible, regardless of technical capability.
ℹ️ What this tool does Tracey allows you quickly pull and explore traces from Langfuse.
- 📥 Fetch a single set of traces once
- 📊 Explore the same dataset across tabs
- 📋 Generate reports & understand user behaviour
- 🧪 Sample for human eval & product mining
- Python
>=3.11 - Recommended:
uv
Create a .env file in the repo root:
LANGFUSE_PUBLIC_KEY="..."
LANGFUSE_SECRET_KEY="..."
LANGFUSE_BASE_URL="..."
# Optional (only needed for Gemini-powered features)
GEMINI_API_KEY="..." # or GOOGLE_API_KEYuv syncpython -m venv .venv
source .venv/bin/activate
python -m pip install -U pip
python -m pip install -e .uv run streamlit run streamlit_app.pyThen open the URL Streamlit prints (usually http://localhost:8501).
- The app fetches traces once from the sidebar, then reuses the same dataset across tabs.
- Human eval exports are always available via the Download CSV button.
The 🧠 Product intelligence tab contains Product Development Mining, split into three sub-tabs:
- Evidence Mining: search for traces that support a hypothesis (LLM-scored relevance).
- Tagging: LLM-as-judge tagging for prompt topics/flavours and other criteria.
- Gap Analysis: generate a markdown report on user jobs, coverage, and gaps.
In that tab, open ⚙️ LLM Settings to configure:
- Gemini model
- Optional batching (Batch traces per Gemini request, batch size, max chars per trace)
Each sub-tab exposes an 📝 Edit system prompt expander so you can inspect and tweak the prompts used for Gemini.
Limited number of tests are provided for charts and user segments. To run them:
uv run --with pytest pytest tests/test_charts.py -v