Ragas as an External Provider for Llama Stack

About

This repository implements Ragas as an out-of-tree Llama Stack evaluation provider.

Features

The goal is to provide all of Ragas' evaluation functionality over Llama Stack's eval API, while leveraging the Llama Stack's built-in APIs for inference (llms and embeddings), datasets, and benchmarks.

There are two versions of the provider:

inline: runs the Ragas evaluation in the same process as the Llama Stack server. This is always available with the base installation.
remote: runs the Ragas evaluation in a remote process, using Kubeflow Pipelines. Only available when remote dependencies are installed with pip install llama-stack-provider-ragas[remote].

Prerequisites

Python 3.12
uv
The remote provider requires a running Kubeflow Pipelines server.

Setup

Clone this repository

git clone <repository-url>
cd llama-stack-provider-ragas

Create and activate a virtual environment
```
uv venv
source .venv/bin/activate
```
Install (optionally as an editable package). There's distro, remote and dev optional dependencies to run the sample LS distribution and the KFP-enabled remote provider. Installing the dev dependencies will also install the distro and remote dependencies.
```
uv pip install -e ".[dev]"
```
The sample LS distributions (one for inline and one for remote provider) is a simple LS distribution that uses Ollama for inference and embeddings. See the provider-specific sections below for setup and run commands.

Inline provider (default with base installation)

Create a .env file with the required environment variable:

EMBEDDING_MODEL=ollama/all-minilm:l6-v2

Run the server:

dotenv run uv run llama stack run distribution/run.yaml

Remote provider (requires optional dependencies)

First install the remote dependencies:

uv pip install -e ".[remote]"

Create a .env file with the following:

# Required for both inline and remote
EMBEDDING_MODEL=ollama/all-minilm:l6-v2

# Required for remote provider
KUBEFLOW_LLAMA_STACK_URL=<your-llama-stack-url>
KUBEFLOW_PIPELINES_ENDPOINT=<your-kfp-endpoint>
KUBEFLOW_NAMESPACE=<your-namespace>
KUBEFLOW_BASE_IMAGE=quay.io/diegosquayorg/my-ragas-provider-image:latest
KUBEFLOW_PIPELINES_TOKEN=<your-pipelines-token>
KUBEFLOW_RESULTS_S3_PREFIX=s3://my-bucket/ragas-results
KUBEFLOW_S3_CREDENTIALS_SECRET_NAME=<secret-name>

Where:

KUBEFLOW_LLAMA_STACK_URL: The URL of the llama stack server that the remote provider will use to run the evaluation (LLM generations and embeddings, etc.). If you are running Llama Stack locally, you can use ngrok to expose it to the remote provider.
KUBEFLOW_PIPELINES_ENDPOINT: You can get this via kubectl get routes -A | grep -i pipeline on your Kubernetes cluster.
KUBEFLOW_NAMESPACE: The name of the data science project where the Kubeflow Pipelines server is running.
KUBEFLOW_PIPELINES_TOKEN: Kubeflow Pipelines token with access to submit pipelines. If not provided, the token will be read from the local kubeconfig file.
KUBEFLOW_BASE_IMAGE: The image used to run the Ragas evaluation in the remote provider. See Containerfile for details. There is a public version of this image at quay.io/diegosquayorg/my-ragas-provider-image:latest.
KUBEFLOW_RESULTS_S3_PREFIX: S3 location (bucket and prefix folder) where evaluation results will be stored, e.g., s3://my-bucket/ragas-results.

KUBEFLOW_S3_CREDENTIALS_SECRET_NAME: Name of the Kubernetes secret containing AWS credentials with write access to the S3 bucket. Create with:

oc create secret generic <secret-name> \
  --from-literal=AWS_ACCESS_KEY_ID=your-access-key \
  --from-literal=AWS_SECRET_ACCESS_KEY=your-secret-key \
  --from-literal=AWS_DEFAULT_REGION=us-east-1

Run the server:

dotenv run uv run llama stack run distribution/run.yaml

Usage

See the demos in the demos directory.

Name		Name	Last commit message	Last commit date
Latest commit History 210 Commits
.github/workflows		.github/workflows
.vscode		.vscode
demos		demos
distribution		distribution
docs		docs
src/llama_stack_provider_ragas		src/llama_stack_provider_ragas
tests		tests
.dockerignore		.dockerignore
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
CONTRIBUTING.md		CONTRIBUTING.md
Containerfile		Containerfile
LICENCE		LICENCE
README.md		README.md
antora-playbook.yml		antora-playbook.yml
package-lock.json		package-lock.json
package.json		package.json
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
uv.lock		uv.lock

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Ragas as an External Provider for Llama Stack

About

Features

Prerequisites

Setup

Inline provider (default with base installation)

Remote provider (requires optional dependencies)

Usage

About

Uh oh!

Releases 10

Packages

Contributors 5

Uh oh!

Languages

License

trustyai-explainability/llama-stack-provider-ragas

Folders and files

Latest commit

History

Repository files navigation

Ragas as an External Provider for Llama Stack

About

Features

Prerequisites

Setup

Inline provider (default with base installation)

Remote provider (requires optional dependencies)

Usage

About

Resources

License

Contributing

Uh oh!

Stars

Watchers

Forks

Releases 10

Packages 0

Contributors 5

Uh oh!

Languages

Packages