Skip to content

Commit 93a8a3b

Browse files
nanoeval release
1 parent cf29fa0 commit 93a8a3b

1,620 files changed

Lines changed: 58983 additions & 7 deletions

File tree

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.
Lines changed: 51 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,51 @@
1+
name: Run linting on swelancer
2+
3+
on:
4+
pull_request:
5+
paths:
6+
- 'project/swelancer/**'
7+
branches:
8+
- '**'
9+
types:
10+
- opened
11+
- synchronize
12+
- reopened
13+
- ready_for_review
14+
workflow_dispatch:
15+
inputs:
16+
test_branch:
17+
description: 'Branch to run the tests on'
18+
required: false
19+
default: 'main'
20+
21+
jobs:
22+
test:
23+
runs-on: ubuntu-latest
24+
env:
25+
OPENAI_API_KEY: ${{ secrets.OPENAI_API_KEY }}
26+
27+
steps:
28+
- uses: actions/checkout@v4
29+
with:
30+
lfs: false
31+
fetch-depth: 1
32+
33+
- name: Install uv
34+
uses: astral-sh/setup-uv@v5
35+
36+
- name: Install dependencies
37+
working-directory: ./project/swelancer
38+
env:
39+
UV_GIT_LFS: 1
40+
run: |
41+
uv sync
42+
43+
- name: run mypy
44+
working-directory: ./project/swelancer
45+
run: |
46+
uv run poe typecheck
47+
48+
- name: run ruff-strict
49+
working-directory: ./project/swelancer
50+
run: |
51+
uv run ruff check --output-format=github --no-fix

.pre-commit-config.yaml

Lines changed: 10 additions & 6 deletions
Original file line numberDiff line numberDiff line change
@@ -8,23 +8,24 @@ repos:
88
# picks up config from [tool.ruff] in pyproject.toml
99
- id: ruff-format
1010
name: ruff-format (black-style)
11-
files: (?x)^project/paperbench/
11+
files: (?x)^project/(paperbench|swelancer)/
12+
1213
# Lenient hook: runs auto‑fix on all files using a lenient config.
1314
- id: ruff
14-
name: ruff (auto-fix)
15+
name: ruff-lenient (auto-fix)
1516
args:
1617
- --config=./.ruff_autofix_minimal.toml
1718
- --unsafe-fixes
1819
- --exit-non-zero-on-fix
1920

2021
# Strict hook: runs lint only on files opted-in projects (using strict config pulled from pyproject.toml)
2122
- id: ruff
22-
name: ruff (lint)
23+
name: ruff-strict (lint)
2324
args:
2425
- --fix
2526
- --fix-only
2627
- --exit-non-zero-on-fix
27-
files: (?x)^project/(alcatraz|nanoeval|paperbench)/
28+
files: (?x)^project/(alcatraz|nanoeval|paperbench|swelancer)/
2829

2930
- repo: https://github.com/astral-sh/uv-pre-commit
3031
# uv version.
@@ -34,5 +35,8 @@ repos:
3435
name: uv-lock-paperbench
3536
args: [--project, project/paperbench]
3637
files: (?x)^project/paperbench/
37-
38-
exclude: 'project/paperbench/paperbench/agents/aisi-basic-agent/inspect_ai'
38+
- id: uv-lock
39+
name: uv-lock-swelancer
40+
args: [--project, project/swelancer]
41+
files: (?x)^project/swelancer/
42+
exclude: '(?x)^(project/paperbench/paperbench/agents/aisi-basic-agent/inspect_ai|project/swelancer/issues)/'

README.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -9,6 +9,6 @@ Python 3.11 (3.12 is untested; 3.13 will break [chz](https://github.com/openai/c
99
## Evals
1010

1111
- [PaperBench](./project/paperbench/README.md): based on [https://openai.com/index/paperbench/](https://openai.com/index/paperbench/)
12-
- SWELancer (Forthcoming): based on [https://openai.com/index/swe-lancer/](https://openai.com/index/swe-lancer/)
12+
- [SWELancer](./project/swelancer/README.md) : based on [https://openai.com/index/swe-lancer/](https://openai.com/index/swe-lancer/)
1313
- MLE-bench (Forthcoming): based on [https://openai.com/index/mle-bench/](https://openai.com/index/mle-bench/)
1414
- SWE-bench (Forthcoming): based on [https://openai.com/index/introducing-swe-bench-verified/](https://openai.com/index/introducing-swe-bench-verified/)

0 commit comments

Comments
 (0)