Skip to content
Change the repository type filter

All

    Repositories list

    • SWE-bench

      Public
      SWE-bench: Can Language Models Resolve Real-world Github Issues?
      Python
      MIT License
      7954.5k5834Updated Mar 17, 2026Mar 17, 2026
    • SWE-smith

      Public
      [NeurIPS 2025 D&B Spotlight] Scaling Data for SWE-agents
      Python
      MIT License
      112597125Updated Mar 16, 2026Mar 16, 2026
    • swe-bench.github.io

      Public
      Landing page + leaderboard for SWE-Bench benchmark
      JavaScript
      Other
      151253Updated Mar 4, 2026Mar 4, 2026
    • SWE-smith-envs

      Public
      Artifacts for building environments (Docker images) for repositories represented in SWE-smith
      Dockerfile
      2500Updated Mar 2, 2026Mar 2, 2026
    • experiments

      Public
      Open sourced predictions, execution logs, trajectories, and results from model inference + evaluation runs on the SWE-bench task.
      Shell
      2952551227Updated Feb 27, 2026Feb 27, 2026
    • reading-list

      Public
      Academic papers and works related to SWE-bench and SWE-agents
      41000Updated Dec 8, 2025Dec 8, 2025
    • .github

      Public
      MIT License
      0000Updated Nov 14, 2025Nov 14, 2025
    • sb-cli

      Public
      Run SWE-bench evaluations remotely
      Python
      MIT License
      859100Updated Aug 14, 2025Aug 14, 2025
    • humanevalfix-results

      Public archive
      Evaluation data + results for SWE-agent inference on HumanEvalFix task
      Jupyter Notebook
      0100Updated Jul 11, 2024Jul 11, 2024