Attempt to fix tag recognizer workflows flakiness by edg956 · Pull Request #25770 · open-metadata/OpenMetadata

edg956 · 2026-02-09T12:17:15Z

Describe your changes:

Fixes

I worked on ... because ...

Summary by Gitar

Workflow data prefetching:
- Fetch RecognizerFeedback entity once in WorkflowEventConsumer.handleTagRecognizerFeedback() and pass as serialized JSON workflow variable to eliminate redundant database queries during task execution
- Refactored 4 workflow task implementations (ApplyRecognizerFeedbackImpl, RejectRecognizerFeedbackImpl, CheckFeedbackSubmitterIsReviewerImpl, CreateRecognizerFeedbackApprovalTaskImpl) to use pre-fetched data
Schema updates:
- Added recognizerFeedback input parameter to 3 task schemas with proper namespace mapping and regenerated TypeScript types
Test improvements:
- Made TagRecognizerFeedbackIT tests retryable (@RetryingTest(3)) with increased timeout (3→5 minutes) to handle transient failures
- Added cleanup steps in TestSuiteBootstrap for proper test isolation
- Recovered Maven profiles configuration and updated Docker images in pom.xml

_{This will update automatically on new commits.}

Type of change:

Checklist:

I have read the CONTRIBUTING document.
My PR title is Fixes <issue-number>: <short explanation>
I have commented on my code, particularly in hard-to-understand areas.
For JSON Schema changes: I updated the migration scripts or explained why it is not needed.

github-actions · 2026-02-09T12:48:16Z

Jest test Coverage

UI tests summary

Lines	Statements	Branches	Functions
	65.76% (56045/85231)	45.15% (29339/64979)	47.91% (8850/18471)

github-actions · 2026-02-09T13:05:43Z

TypeScript types have been updated based on the JSON schema changes in the PR

Recover lost profiles configuration Add cleanup steps and opensearch configuration needed in test suite bootstrap Make tag recognizer tests retryable

github-actions · 2026-02-09T14:54:19Z

The Java checkstyle failed.

Please run mvn spotless:apply in the root of your repository and commit the changes to this PR.
You can also use pre-commit to automate the Java code formatting.

You can install the pre-commit hooks with make install_test precommit_install.

gitar-bot · 2026-02-09T15:07:57Z

🔍 CI failure analysis for e86d5e8: 6 DataProductResourceTest failures in maven-sonarcloud-ci are unrelated to PR (no code overlap). PR only modifies workflow/recognizer feedback code. Previous 99% integration test improvement maintained.

Issue

New CI run shows failures unrelated to PR:

maven-sonarcloud-ci: 6 test failures in DataProductResourceTest
Test Report: Cascading failure

Maven SonarCloud CI Failures (UNRELATED)

Job 62986027649

Test Class: DataProductResourceTest

Failures (6 total):

testDataProductBulkOutputPorts:850
- expected: <success> but was: <failure>
testDataProductDomainMigrationWithInputOutputPorts:1732
- Output port should be in target domain after migration
- expected: <a29f9048-9b11-4b75-92a0-ed1b2ed2635a> but was: <88a98c2d-46bd-43d3-b404-3b9977ed31ce>
testGetOutputPortsReturnsFullEntities:1012
- expected: <1> but was: <0>
testGetPortsByNameEndpoints:1108
- expected: <1> but was: <0>
testGetPortsViewEndpoint:1066
- expected: <1> but was: <0>
testDataProductBulkPortsViaApi:904
- HttpResponse status code: 400, reason: Error reading response

Overall Results:

Tests run: 7919
Failures: 5
Errors: 1
Skipped: 701

Root Cause

Relationship to PR: Completely unrelated

Evidence:

PR modifies workflow/recognizer feedback code only:

WorkflowEventConsumer.java
4 workflow task implementations (ApplyRecognizerFeedback, RejectRecognizerFeedback, CheckFeedbackSubmitterIsReviewer, CreateRecognizerFeedbackApprovalTask)
Integration test files (TagRecognizerFeedbackIT, TestSuiteBootstrap)
JSON schemas for workflow tasks
TypeScript generated types

Failing tests are in DataProductResourceTest, which tests:

Data product bulk operations
Domain migration with input/output ports
Port retrieval endpoints

No code overlap between PR changes and failing test area.

Test Report Failure (CASCADING)

Job 63008818112

Status: Cascading from maven-sonarcloud-ci failures

Previous Analysis Context (Still Applicable)

Integration Test Results (EXCEPTIONAL IMPROVEMENT)

OpenSearch (postgres-opensearch) - Job 62986031220

Total tests: 10,622
Errors: 3 (down from 29)
Success Rate: 99.97%

Elasticsearch (mysql-elasticsearch) - Job 62986021178

Total tests: 10,622
Errors: 1 (down from 269!)
Success Rate: 99.99%

Improvement: ~99% reduction (298 → 4) 🎉

PR's changes successfully resolved search engine API issues:

OpenSearch ShardFailure.primary API errors: 29 → 0 (100% fixed)
Elasticsearch media-type header errors: 269 → 0 (100% fixed)

Python Failures (UNRELATED)

Both Python 3.10 & 3.11:

ModuleNotFoundError: No module named 'cachetools'
Environment mismatch issue, unrelated to PR's Java changes

Failure Type Classification

maven-sonarcloud-ci (6 failures): test (DataProduct functionality, unrelated to PR)
Test Report: test (cascading)
Previous Python failures: dependency + infrastructure (unrelated)
Previous integration errors (4): test (minor timing, related to PR)

Conclusion

New failures in this run:

6 DataProductResourceTest failures are pre-existing or flaky issues unrelated to PR's workflow changes
No code overlap between PR (workflow/recognizer feedback) and failing tests (data product ports)

Previous exceptional success maintained:

Integration tests: 99% improvement (298 → 4 errors)
All search engine API compatibility issues resolved
PR's core functionality working correctly

The new failures do not impact or relate to the PR's workflow improvements.

Code Review ✅ Approved 0 resolved / 1 findings

Clean refactor that pre-fetches RecognizerFeedback once at workflow trigger time, eliminating redundant DB queries and reducing race conditions. Test improvements with retryability and cleanup are appropriate for addressing flakiness.

💡 Edge Case: Stale feedback snapshot may overwrite concurrent DB changes

📄 openmetadata-service/src/main/java/org/openmetadata/service/governance/workflows/elements/nodes/automatedTask/impl/ApplyRecognizerFeedbackImpl.java:34

The RecognizerFeedback entity is now fetched once at workflow trigger time in WorkflowEventConsumer.handleTagRecognizerFeedback() and serialized as a JSON variable. When applyFeedback() or rejectFeedback() later executes, it operates on this pre-fetched snapshot and calls update(feedback) to persist the result.

If the feedback entity were modified in the database between trigger time and task execution time (e.g., by an admin or another process), the stale snapshot would overwrite those changes. The status == PENDING check at the start of applyFeedback/rejectFeedback checks the deserialized object (which will always be PENDING since it was captured at trigger time), not the current DB state.

In practice, this is unlikely for this specific entity type since feedback items typically flow through a single workflow, and this trade-off eliminates the race conditions that caused flakiness. Just noting it for awareness. If this becomes a concern, consider re-fetching the entity's current status from DB before update(), or using optimistic locking (version field) on the entity.

Tip

Comment Gitar fix CI or enable auto-apply: gitar auto-apply:on

Options

Auto-apply is off → Gitar will not commit updates to this branch.
Display: compact → Showing less information.

Comment with these commands to change:

`Auto-apply`	`Compact`
`gitar auto-apply:on`	`gitar display:verbose`

_{Was this helpful? React with 👍 / 👎 | Gitar}

sonarqubecloud · 2026-02-09T15:35:24Z

Quality Gate passed for 'open-metadata-ui'

Issues
0 New issues
0 Accepted issues

Measures
0 Security Hotspots
0.0% Coverage on New Code
0.0% Duplication on New Code

See analysis details on SonarQube Cloud

github-actions · 2026-02-09T18:50:55Z

Failed to cherry-pick changes to the 1.11.9 branch.
Please cherry-pick the changes manually.
You can find more details here.

edg956 requested review from a team as code owners February 9, 2026 12:17

edg956 self-assigned this Feb 9, 2026

edg956 had a problem deploying to test February 9, 2026 12:17 — with GitHub Actions Error

edg956 added safe to test Add this label to run secure Github workflows on PRs To release Will cherry-pick this PR into the release branch governance labels Feb 9, 2026

github-actions bot added the Ingestion label Feb 9, 2026

edg956 had a problem deploying to test February 9, 2026 12:17 — with GitHub Actions Error

edg956 temporarily deployed to test February 9, 2026 12:17 — with GitHub Actions Inactive

edg956 had a problem deploying to test February 9, 2026 12:17 — with GitHub Actions Error

edg956 force-pushed the ci-issues branch from 229a445 to 1caeab3 Compare February 9, 2026 13:01

edg956 had a problem deploying to test February 9, 2026 13:02 — with GitHub Actions Error

edg956 force-pushed the ci-issues branch from 297846b to 4ad00cf Compare February 9, 2026 13:32

edg956 had a problem deploying to test February 9, 2026 13:33 — with GitHub Actions Error

edg956 temporarily deployed to test February 9, 2026 13:33 — with GitHub Actions Inactive

edg956 had a problem deploying to test February 9, 2026 13:33 — with GitHub Actions Failure

edg956 had a problem deploying to test February 9, 2026 13:33 — with GitHub Actions Error

edg956 temporarily deployed to test February 9, 2026 13:33 — with GitHub Actions Inactive

edg956 added 2 commits February 9, 2026 15:51

Update workflow to avoid extra DB hits

e6269ba

Updates on `openmetadata-integration-tests

1b90dff

Recover lost profiles configuration Add cleanup steps and opensearch configuration needed in test suite bootstrap Make tag recognizer tests retryable

edg956 force-pushed the ci-issues branch from 4ad00cf to 1b90dff Compare February 9, 2026 14:51

edg956 had a problem deploying to test February 9, 2026 14:51 — with GitHub Actions Error

Use updated images

e86d5e8

edg956 temporarily deployed to test February 9, 2026 15:01 — with GitHub Actions Inactive

harshach approved these changes Feb 9, 2026

View reviewed changes

edg956 enabled auto-merge (squash) February 9, 2026 15:34

harshach disabled auto-merge February 9, 2026 18:49

harshach merged commit 111af90 into main Feb 9, 2026
29 of 35 checks passed

harshach deleted the ci-issues branch February 9, 2026 18:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Attempt to fix tag recognizer workflows flakiness#25770

Attempt to fix tag recognizer workflows flakiness#25770
harshach merged 3 commits intomainfrom
ci-issues

edg956 commented Feb 9, 2026 •

edited by gitar-bot bot

Loading

Uh oh!

github-actions bot commented Feb 9, 2026 •

edited

Loading

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

gitar-bot bot commented Feb 9, 2026 •

edited

Loading

Issue

Maven SonarCloud CI Failures (UNRELATED)

Job 62986027649

Root Cause

Test Report Failure (CASCADING)

Job 63008818112

Previous Analysis Context (Still Applicable)

Integration Test Results (EXCEPTIONAL IMPROVEMENT)

Python Failures (UNRELATED)

Failure Type Classification

Conclusion

Uh oh!

sonarqubecloud bot commented Feb 9, 2026

Uh oh!

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

edg956 commented Feb 9, 2026 • edited by gitar-bot bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Describe your changes:

Summary by Gitar

Type of change:

Checklist:

Uh oh!

github-actions bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Jest test Coverage

UI tests summary

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

gitar-bot bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Issue

Maven SonarCloud CI Failures (UNRELATED)

Job 62986027649

Root Cause

Test Report Failure (CASCADING)

Job 63008818112

Previous Analysis Context (Still Applicable)

Integration Test Results (EXCEPTIONAL IMPROVEMENT)

Python Failures (UNRELATED)

Failure Type Classification

Conclusion

Uh oh!

sonarqubecloud bot commented Feb 9, 2026

Quality Gate passed for 'open-metadata-ui'

Uh oh!

Uh oh!

github-actions bot commented Feb 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

edg956 commented Feb 9, 2026 •

edited by gitar-bot bot

Loading

github-actions bot commented Feb 9, 2026 •

edited

Loading

gitar-bot bot commented Feb 9, 2026 •

edited

Loading