Real-world energy benchmarks for local LLMs.
🏆 View the Live Leaderboard: novadedog.github.io/energy-leaderboard-runner
This tool measures the actual hardware energy consumption (Wh) and CO2 emissions of Large Language Models running on your local machine. No estimates, no cloud APIs—just real data from your hardware sensors.
As LLMs become ubiquitous, their energy footprint grows. We believe in:
- Transparency: Real measurements, not theoretical estimates.
- Reproducibility: Standardized containerized benchmarks.
- Community: Crowdsourced data from diverse hardware.
- 🔌 Real Hardware Metering:
- macOS: Apple Silicon & Intel (via
powermetrics) - Linux: NVIDIA GPUs (NVML), AMD GPUs (ROCm), Intel/AMD CPUs (RAPL)
- macOS: Apple Silicon & Intel (via
- 🤖 Broad Support: Works with Ollama, vLLM, and OpenAI-compatible endpoints.
- 📊 Rich Metrics: Energy (Wh), CO2 (g), Tokens/Watt, and more.
- 🐳 Docker Ready: Consistent environments for reproducible testing.
- Python 3.10+
- Ollama running locally (e.g.,
ollama serve) - Pull a model:
ollama pull llama3
git clone https://github.com/NOVADEDOG/energy-leaderboard-runner.git
cd energy-leaderboard-runner
pip install -r requirements.txt# Run the full suite (Recommended for contributors)
python run_all_tests.py --model llama3:latest
# Or run a specific test set
python src/main.py run-test --model llama3:latest --test-set easyHelp us build the most comprehensive energy dataset.
- Your results are saved to
results/output_*.json. - Move this file to
energy-leaderboard-web/public/data/. - Submit a Pull Request with your new data file.
👉 See RUNBOOK.md for detailed instructions on running benchmarks and contributing data.
| Platform | Meter | Status |
|---|---|---|
| macOS | powermetrics |
✅ Native Support (Requires Sudo) |
| Linux + NVIDIA | NVML |
✅ Full Support |
| Linux + AMD | ROCm |
✅ Full Support |
| Linux CPU | RAPL |
✅ Full Support |
| Windows | - | 🚧 Docker Only (No Energy Data yet) |
We love contributions! Whether it's running benchmarks on new hardware, adding support for new providers, or improving the docs.
- Run Benchmarks: See RUNBOOK.md.
- Develop: See AI_DOCS/Project_Plan.md for architecture details.
- Legal: All contributors must agree to the CLA when submitting a Pull Request.
GNU GPLv3 - see LICENSE for details.