Open Evals

A comprehensive toolkit for generating synthetic test data and evaluating LLM applications with RAG capabilities.

Overview

Open Evals is a modular evaluation framework designed to help developers test and improve their AI applications. It provides tools for:

Synthetic Data Generation: Create realistic test datasets using knowledge graphs, personas, and scenarios
Evaluation Metrics: Pre-built and custom metrics for assessing LLM performance
RAG Utilities: Text splitters for retrieval-augmented generation
Evaluation Framework: Core abstractions for running comprehensive evaluations

Packages

This monorepo contains the following packages:

@open-evals/core

Core evaluation framework with abstractions for datasets, metrics, and evaluation pipelines.

pnpm add @open-evals/core

@open-evals/generator

Synthetic test data generation using knowledge graphs, personas, and query synthesis.

pnpm add @open-evals/generator

@open-evals/rag

RAG utilities including recursive character and markdown text splitters.

pnpm add @open-evals/rag

@open-evals/metrics

Pre-built evaluation metrics including faithfulness, factual correctness, and more.

pnpm add @open-evals/metrics

Development

This project uses pnpm workspaces for managing multiple packages.

# Install dependencies
pnpm install

# Build all packages
pnpm build

# Run tests
pnpm test

Examples

The agents/ directory contains example implementations:

doc-assistant: A RAG-based documentation assistant demonstrating the full stack

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 112 Commits
.changeset		.changeset
.github/workflows		.github/workflows
agents/doc-assistant		agents/doc-assistant
apps/docs		apps/docs
packages		packages
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
package.json		package.json
pnpm-lock.yaml		pnpm-lock.yaml
pnpm-workspace.yaml		pnpm-workspace.yaml
tsconfig.base.json		tsconfig.base.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Open Evals

Overview

Packages

@open-evals/core

@open-evals/generator

@open-evals/rag

@open-evals/metrics

Development

Examples

License

About

Uh oh!

Releases 4

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Open Evals

Overview

Packages

@open-evals/core

@open-evals/generator

@open-evals/rag

@open-evals/metrics

Development

Examples

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases 4

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages