The Largest Coding Benchmark for Large Language Models [LLM]
all info about project, which llms used in benchmarks, prompts and etc is in docs/
- Request brevity: Add “Solution must be ≤ 300 lines” to each prompt
- Accept pseudocode/core proofs for very long tasks
- Encourage inclusion of minimal unit tests within responses
Distributed under the MIT License. See LICENSE file for details.