Arbitrium Core

Extends ../CLAUDE.md

MIT-licensed AI tournament framework for decision synthesis through model competition and critique.

Commands

Local Development

# Auto-discover Ollama models (generates config.yml)
make discover-ollama

# Run tournament
arbitrium --config config.yml

# Run YAML workflow
arbitrium workflow execute examples/workflow.yml

# Development
source venv/bin/activate
make dev              # install dev dependencies
make fmt              # format code
make lint             # lint + type check
make test             # run tests
pre-commit run -a     # all quality checks

Python API

from arbitrium_core import Arbitrium

async def main():
    arb = await Arbitrium.from_settings({
        "models": {
            "gpt": {"provider": "openai", "name": "gpt-4o"},
            "claude": {"provider": "anthropic", "name": "claude-3-5-sonnet-20241022"},
        }
    })
    result, metrics = await arb.run_tournament("What is the best approach to...")
    print(result)

Architecture Overview

┌─────────────────────────────────┐
│  arbitrium --config config.yml  │
│  ┌───────────────────────────┐  │
│  │  Tournament Engine        │  │
│  │  (src/arbitrium/core/)    │  │
│  │  ├─ Competitors (LLMs)    │  │
│  │  ├─ Judges (LLMs)         │  │
│  │  ├─ Rubrics & Scoring     │  │
│  │  └─ Knowledge Bank        │  │
│  └───────────────────────────┘  │
│         ▼                        │
│  Console Output + JSON Reports  │
└─────────────────────────────────┘

Core Components

src/arbitrium/
├── core/
│   ├── tournament.py      # Tournament orchestration
│   ├── scorer.py          # Rubric-based scoring
│   ├── knowledge_bank.py  # Insight extraction
│   ├── nodes/             # Workflow node system
│   └── executor/          # Graph executor
├── models/
│   └── litellm.py         # LLM provider adapters
├── cli/
│   └── main.py            # CLI entry point
├── config/                # Configuration loading
├── serialization/         # YAML workflow loader
└── utils/                 # Utilities

Configuration

# config.yml
tournament:
  models: [claude-sonnet, gpt-4o, gemini-pro]
  judges: [claude-sonnet]
  rounds: 3

rubrics:
  - accuracy: 0.4
  - reasoning: 0.3
  - completeness: 0.3

Key Concepts

Competitors: LLM instances generating solutions
Judges: LLM instances scoring pairwise matchups
Rubrics: Weighted scoring criteria
Knowledge Bank: Preserved insights from eliminated models
Champion: Winning solution after tournament rounds

Workflow System

YAML-based workflow execution for building custom AI pipelines:

# examples/simple_workflow.yml
name: Simple LLM Pipeline
nodes:
  - id: input
    type: simple/text
    properties:
      text: "Explain quantum computing"
  - id: llm
    type: llm/completion
    properties:
      model: gpt-4o
edges:
  - source: input
    target: llm
    sourceHandle: output_text
    targetHandle: prompt
outputs: [llm]

Available node types:

arbitrium workflow list-nodes

Testing

Integration tests only (no mocks). Tests run against real LLM providers:

pytest tests/integration/ -v

Requires API keys in environment or .env.

Environment Variables

# LLM API Keys
OPENAI_API_KEY=sk-...
ANTHROPIC_API_KEY=sk-ant-...
GOOGLE_API_KEY=AI...
XAI_API_KEY=xai-...

# Ollama (local models)
OLLAMA_BASE_URL=http://localhost:11434

# LiteLLM logging
LITELLM_LOG=INFO

CLI Usage

# Run tournament with config file
arbitrium --config config.yml

# Run with specific models only
arbitrium --config config.yml --models gpt,claude

# Interactive mode
arbitrium --config config.yml --interactive

# Execute YAML workflow
arbitrium workflow execute workflow.yml

# Validate workflow
arbitrium workflow validate workflow.yml

# List available node types
arbitrium workflow list-nodes

PyPI Installation

pip install arbitrium-core

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
.github		.github
docs		docs
examples		examples
scripts		scripts
src/arbitrium_core		src/arbitrium_core
tests		tests
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.importlinter		.importlinter
.pre-commit-config.yaml		.pre-commit-config.yaml
.secrets.baseline		.secrets.baseline
.semgrepignore		.semgrepignore
.treemapperignore		.treemapperignore
.yamllint		.yamllint
CITATION.cff		CITATION.cff
CLAUDE.md		CLAUDE.md
CONTRIBUTING.md		CONTRIBUTING.md
DESIGN.md		DESIGN.md
LICENSE		LICENSE
Makefile		Makefile
NOTICE		NOTICE
README.md		README.md
SECURITY.md		SECURITY.md
TRADEMARK.md		TRADEMARK.md
config.example.yml		config.example.yml
mkdocs.yml		mkdocs.yml
pyproject.toml		pyproject.toml
renovate.json		renovate.json
sonar-project.properties		sonar-project.properties
vulture_whitelist.py		vulture_whitelist.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Arbitrium Core

Commands

Local Development

Python API

Architecture Overview

Core Components

Configuration

Key Concepts

Workflow System

Testing

Environment Variables

CLI Usage

PyPI Installation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

License

arbitrium-framework/arbitrium-core

Folders and files

Latest commit

History

Repository files navigation

Arbitrium Core

Commands

Local Development

Python API

Architecture Overview

Core Components

Configuration

Key Concepts

Workflow System

Testing

Environment Variables

CLI Usage

PyPI Installation

About

Resources

License

Contributing

Security policy

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages