Skip to content

Copilot (app/copilot-swe-agent) EFO Ontology PR Analysis #2592

@dragon-ai-agent

Description

@dragon-ai-agent

This is a report of hits/failures for agent use in EFO, written by an agent on behalf of @cmungall. Feel free to close if not useful. For broader context see ai4curation/aidocs#60

Overall Statistics

Status Count Percentage
Merged 20 39.2%
Closed (not merged) 28 54.9%
Open 3 5.9%
Total 51 100%

Success rate of closed PRs: 20/48 = 41.7%

First-Try Success Rate

From sample analysis of extracted data, all 20 merged copilot PRs are categorized as merged_with_mods.

First-try success rate: ~0% (all merged PRs required modifications)

Note: Full analysis limited by file size (16GB extracted data file).

Failed PRs Checklist

Duplicate PRs (Same Issue, Multiple Attempts)

Issue #2490 - CHEBI terms for 'response to' EFO terms (13 PRs)

Issue #2546 - Bronchiectasis endotype terms (4 PRs)

Issue #2562 - GWAS drug response terms (4 PRs)

Issue #2445 - Slide-seq update (2 PRs)

Other Failed PRs

Failure Pattern Summary

# Failure Mode PRs Affected Count
1 Duplicate PRs for same issue 2491-2514, 2547-2554, 2558-2569 23
2 WIP placeholder PRs never completed 2452, 2495, 2547, 2553, 2563, 2569 6
3 Closed without clear resolution 2450, 2454, 2539, 2588 4
4 Scope/approach issues needing discussion 2532 1

Note: Many PRs appear in multiple categories (e.g., WIP and duplicate).

Detailed Failure Analysis Files

See individual analysis files:

Key Learnings

  1. CRITICAL: Never create multiple PRs for the same issue - Agent created 13 PRs for issue Add CHEBI terms to 'response to ...' EFO terms #2490 alone
  2. Update existing branches instead of creating new PRs when asked to revise
  3. Complete WIP PRs before creating new ones - many [WIP] placeholders were abandoned
  4. Discuss approach first for complex or contentious changes
  5. Check for existing similar terms before adding new ones

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions