Skip to content

Commit f42d9fe

Browse files
authored
Run install and test on several Python versions (#7)
* Run pytest on many versions
1 parent 6bd0405 commit f42d9fe

File tree

3 files changed

+27
-7
lines changed

3 files changed

+27
-7
lines changed

.github/workflows/build.yml

Lines changed: 11 additions & 5 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,4 @@
1-
name: Python application
1+
name: Build & Test
22

33
on:
44
push:
@@ -13,19 +13,25 @@ jobs:
1313
build:
1414

1515
runs-on: ubuntu-latest
16+
strategy:
17+
fail-fast: false
18+
matrix:
19+
python-version: ["3.9", "3.10", "3.11", "3.12"]
1620

1721
steps:
1822
- uses: actions/checkout@v4
19-
- name: Set up Python 3.10
23+
- name: Set up Python ${{ matrix.python-version }}
2024
uses: actions/setup-python@v3
2125
with:
22-
python-version: "3.10"
26+
python-version: ${{ matrix.python-version }}
2327
- name: Install dependencies
2428
run: |
2529
python -m pip install --upgrade pip
26-
pip install pre-commit
27-
pip install -e .
30+
pip install -e ".[testing,develop]"
2831
- name: Run pre-commit checks
2932
run: |
3033
pre-commit install
3134
pre-commit run --all-files
35+
- name: Test with pytest
36+
run: |
37+
pytest

README.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,4 +1,17 @@
1+
<div align="center">
2+
<h1>SWT-Bench 🐛🔍</h1>
3+
4+
[![Build & Test](https://github.com/logic-star-ai/swt-bench/actions/workflows/build.yml/badge.svg)](https://github.com/logic-star-ai/swt-bench/actions/workflows/build.yml)
5+
<a href="https://www.python.org/">
6+
<img alt="Build" src="https://img.shields.io/badge/Python-3.9+-1f425f.svg?color=blue">
7+
</a>
8+
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](https://opensource.org/licenses/MIT)
9+
10+
</div>
11+
12+
113
## 👋 Overview
14+
215
SWT-bench is a benchmark for evaluating large language models on testing generation for real world software issues collected from GitHub.
316
Given a *codebase* and an *issue*, a language model is tasked with generating a *reproducing test* that fails in the original state of the code base and passes after a patch resolving the issue has been applied.
417

pyproject.toml

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ build-backend = "setuptools.build_meta"
44

55
[project]
66
name = "SWT-Bench"
7-
version = "0.0.1"
7+
version = "1.0.1"
88
authors = [
99
{name="Mark Mueller", email="mark@logicstar.ai"},
1010
{name="Niels Mündler", email="niels.muendler@inf.ethz.ch"},
@@ -24,5 +24,6 @@ dependencies = [
2424
]
2525

2626
[project.optional-dependencies]
27-
testing = ["pytest", "pytest-cov", "pytest-xdist", "git"]
27+
develop = ["pre-commit"]
28+
testing = ["pytest", "pytest-cov", "pytest-xdist"]
2829
figures = ["tiktoken", "numpy", "tabulate", "venny4py", "nltk"]

0 commit comments

Comments
 (0)