Parallel reasoning CLI and Python library using multiple LLM agents. TOXP spawns N independent reasoning agents to tackle complex queries, then synthesizes their outputs through a coordinator agent into a coherent, high-confidence answer.
- Parallel Reasoning: Spawn 2-32 independent reasoning agents
- Intelligent Synthesis: Coordinator analyzes agreements, contradictions, synthesizes best answer
- Confidence Levels: Low/Medium/High confidence ratings
- Streaming Output: Real-time coordinator synthesis
- Session Logging: Markdown logs with token counts and cost estimates
- Python API: Use
run_query()directly from your own code - 1M Context: Optional 1M token context window for supported models (Opus 4.6)
- Python 3.10+
- AWS credentials with Bedrock access:
aws configure --profile your-profile
- Claude model access in AWS Bedrock console
# Using uvx (no install needed)
uvx toxp "Your question"
# Or install permanently
uv tool install toxp # recommended
pipx install toxp # alternative
pip install toxp # in current envuv tool upgrade toxp # if installed with uv
pipx upgrade toxp # if installed with pipx
pip install -U toxp # if installed with piptoxp config set aws-profile your-profile
toxp config showtoxp "Solve: x^2 + 5x + 6 = 0"
echo "Explain recursion" | toxp
toxp -v "Analyze quicksort" # verbose
toxp --quiet "What is 2 + 2?" # only answer
toxp --output answer.txt "Question"
toxp --context-1m "Summarize this very long document..."from toxp import run_query, QueryResult, validate_credentials
# Simple usage
result = await run_query("What is 2+2?")
print(result.final_answer, result.confidence)
# With overrides and callbacks
result = await run_query(
"Explain recursion",
config_overrides={"num_agents": 8, "temperature": 0.7},
callbacks=MyCallbacks(),
cancel_token=my_cancel_event,
)
# Check credentials before running
validate_credentials()Stored at ~/.toxp/config.json:
toxp config show # view all
toxp config get model # get value
toxp config set num-agents 24 # set value
toxp config reset # reset defaults| Key | Default | Description |
|---|---|---|
aws-profile |
default |
AWS profile |
region |
us-east-1 |
AWS region |
num-agents |
15 |
Parallel agents (2-32) |
temperature |
0.9 |
Agent temperature |
coordinator-temperature |
0.7 |
Coordinator temperature |
model |
claude-sonnet-4-5 |
Model ID (supports Opus 4.6) |
max-tokens |
8192 |
Max tokens per response |
max-concurrency |
auto |
Max concurrent API requests |
context-1m |
false |
Enable 1M token context window beta |
log-enabled |
true |
Enable session logging |
log-retention-days |
30 |
Days to keep logs |
Environment variables: TOXP_AWS_PROFILE, TOXP_REGION, TOXP_NUM_AGENTS
toxp [OPTIONS] [QUERY]
Options:
-q, --query TEXT Query string
-n, --num-agents INT Agents (2-32)
-t, --temperature FLOAT
-c, --max-concurrency N Max concurrent API requests
--aws-profile TEXT
--region TEXT
--context-1m Enable 1M token context window beta
-o, --output FILE
-v, --verbose
--quiet
--help
- Query sent to N reasoning agents (T=0.9)
- At least 50% must succeed
- Coordinator synthesizes responses
- Final answer with confidence level
# Credentials expired
aws sso login --profile your-profile
# Rate limiting
toxp config set num-agents 8
# Debug
toxp -v "Your question"pip install -e ".[dev]"
pytest -vMIT