chore(ui): move autopilot to first runner #25512

karanh37 · 2026-01-25T11:24:28Z

chore(ui): move autopilot to first runner

Summary by Gitar

New isolated test runner:
- Created dedicated IsolatedTests Playwright project for AutoPilot E2E tests in playwright.config.ts
CI pipeline configuration:
- Added IsolatedTests project to shard 1 in all CI workflows (MySQL/PostgreSQL nightly and E2E)
Test isolation improvement:
- AutoPilot tests now run separately with only setup dependency, excluded from main chromium runner

_{This will update automatically on new commits.}

github-actions · 2026-01-25T11:56:11Z

Jest test Coverage

UI tests summary

Lines	Statements	Branches	Functions
	65.77% (55191/83909)	44.77% (28466/63584)	47.73% (8679/18185)

- Add IsolatedTests project to playwright.config.ts matching AutoPilot.spec.ts - Update playwright-postgresql-e2e.yml to include IsolatedTests in shard 1 - Update mysql-nightly-e2e.yml to include IsolatedTests in shard 1 - Update postgresql-nightly-e2e.yml to include IsolatedTests in shard 1 Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

gitar-bot · 2026-01-25T12:10:47Z

🔍 CI failure analysis for c2fd3e6: CRITICAL: Retry also failed. Shard 1/5 now has 75% failure rate (3 of 4 attempts). Infrastructure issue persists with 215 failures. This is a systemic CI problem requiring immediate infrastructure fixes, NOT a PR code issue.

CRITICAL: Infrastructure Failure Persists Despite Retry

Shard 1/5 Complete History

Attempt	Result	Failures	Status
1	❌	249	Backend unresponsive
2	✅	3	SUCCESS (genuine failures only)
3	❌	206	Infrastructure (3.6h duration)
4 (Retry)	❌	215	Infrastructure persists (3.6h)

Success Rate: 25% (1 of 4 attempts)

Infrastructure Failure Rate: 75%

Attempt 4 Analysis (Latest Retry)

Status: 215 tests failed, 322 passed, 2 flaky

Duration: 3.6 hours

Critical Finding: Retry did NOT resolve the issue - failures actually increased slightly (206 → 215)

Error Escalation:

page.goto timeouts: 14 → 31 (doubled)
Same pattern: backend degradation over 3.6-hour run
Same error types: timeouts, elements not found, cleanup failures

Root Cause: Confirmed Systemic Issue

Three consecutive infrastructure failures with the same pattern prove this is NOT a random flake:

Duration correlation: All 3 failures after 3.6-4 hours
Progressive degradation: Service works initially, degrades over time
Similar failure counts: 249 → 206 → 215
Persistent across retries: Retry didn't help

Diagnosis: CI infrastructure cannot sustain 3.6-hour test runs for this shard

Likely causes:

Memory leaks accumulating over hours
Database connection pool exhaustion
Resource limits insufficient
No health checks or periodic restarts

Known Genuine Failures

From successful Attempt 2: Only 3 genuine test failures

All in ColumnBulkOperations (loader issue)
Pre-existing bugs

The 200+ failures in Attempts 1, 3, 4 are ALL infrastructure-related.

Relationship to PR

NOT caused by this PR. This PR only changes:

AutoPilot test configuration (separate project)
CI workflow organization
No backend/infrastructure code

This is a critical CI environment reliability issue.

Other Shard Status

Shard 3/5: Also had issues but improved 18→3 on retry. Currently has 3 genuine pre-existing failures.

Code Review ✅ Approved

Clean CI configuration change to isolate AutoPilot E2E tests. The changes are consistent across all workflow files and the Playwright config properly excludes the tests from the main runner while creating a dedicated isolated project.

Tip

Comment Gitar fix CI or enable auto-apply: gitar auto-apply:on

Options

Auto-apply is off → Gitar will not commit updates to this branch.
Display: compact → Showing less information.

Comment with these commands to change:

`Auto-apply`	`Compact`
`gitar auto-apply:on`	`gitar display:verbose`

_{Was this helpful? React with 👍 / 👎 | Gitar}

sonarqubecloud · 2026-01-25T12:38:09Z

Quality Gate passed for 'open-metadata-ui'

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

move autopilot to first runner

b7e1f29

karanh37 requested a review from a team as a code owner January 25, 2026 11:24

karanh37 had a problem deploying to test January 25, 2026 11:24 — with GitHub Actions Error

github-actions bot added safe to test Add this label to run secure Github workflows on PRs UI UI specific issues labels Jan 25, 2026

karanh37 requested review from akash-jain-10, harshach and tutte as code owners January 25, 2026 12:04

karanh37 had a problem deploying to test January 25, 2026 12:05 — with GitHub Actions Failure

karanh37 temporarily deployed to test January 25, 2026 12:05 — with GitHub Actions Inactive

karanh37 had a problem deploying to test January 25, 2026 15:57 — with GitHub Actions Failure

ShaileshParmar11 temporarily deployed to test January 26, 2026 05:14 — with GitHub Actions Inactive

ShaileshParmar11 had a problem deploying to test January 26, 2026 05:14 — with GitHub Actions Failure

ShaileshParmar11 temporarily deployed to test January 26, 2026 05:14 — with GitHub Actions Inactive

ShaileshParmar11 had a problem deploying to test January 26, 2026 09:22 — with GitHub Actions Failure

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

chore(ui): move autopilot to first runner #25512

chore(ui): move autopilot to first runner #25512

karanh37 commented Jan 25, 2026 •

edited by gitar-bot bot

Loading

Uh oh!

github-actions bot commented Jan 25, 2026

Uh oh!

gitar-bot bot commented Jan 25, 2026 •

edited

Loading

CRITICAL: Infrastructure Failure Persists Despite Retry

Shard 1/5 Complete History

Attempt 4 Analysis (Latest Retry)

Root Cause: Confirmed Systemic Issue

Known Genuine Failures

Relationship to PR

Other Shard Status

Uh oh!

sonarqubecloud bot commented Jan 25, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

chore(ui): move autopilot to first runner #25512

Are you sure you want to change the base?

chore(ui): move autopilot to first runner #25512

Conversation

karanh37 commented Jan 25, 2026 • edited by gitar-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary by Gitar

Uh oh!

github-actions bot commented Jan 25, 2026

Jest test Coverage

UI tests summary

Uh oh!

gitar-bot bot commented Jan 25, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

CRITICAL: Infrastructure Failure Persists Despite Retry

Shard 1/5 Complete History

Attempt 4 Analysis (Latest Retry)

Root Cause: Confirmed Systemic Issue

Known Genuine Failures

Relationship to PR

Other Shard Status

Uh oh!

sonarqubecloud bot commented Jan 25, 2026

Quality Gate passed for 'open-metadata-ui'

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

karanh37 commented Jan 25, 2026 •

edited by gitar-bot bot

Loading

gitar-bot bot commented Jan 25, 2026 •

edited

Loading