Test [Backport 7.75.x] Fix for Spark driver init readiness pending #22306

datadog-agent-integrations-bot · 2026-01-12T20:30:39Z

Ignore this PR. Just used to test a workflow.

Backport 7477060 from #22252.

What does this PR do?

Prevents JSON decode errors when the Spark driver is starting but not quite ready.

Motivation

Customer inquiry via support ticket.

Review checklist (to be filled by reviewers)

Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
Add the qa/skip-qa label if the PR doesn't need to be tested during QA.
If you need to backport this PR to another branch, you can add the backport/<branch-name> label to the PR and it will automatically open a backport PR once this one is merged

* Fix for Spark driver init readiness pending * Add changelog * Lint * Add config param for retries * Simplify description * Address comments * Update spark/assets/configuration/spec.yaml Co-authored-by: Steven Yuen <[email protected]> * ddev sync models and config --------- Co-authored-by: Steven Yuen <[email protected]> (cherry picked from commit 7477060)

codecov · 2026-01-12T20:35:59Z

Codecov Report

❌ Patch coverage is 98.68421% with 1 line in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (7.75.x@99d4ee0). Learn more about missing BASE report.

Additional details and impacted files

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 42b986748f

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

chatgpt-codex-connector · 2026-01-12T20:36:54Z

spark/datadog_checks/spark/spark.py

+        # Reset startup retry counter on successful JSON parse
+        self._startup_retry_count = 0


Avoid global retry reset across different Spark drivers

_startup_retry_count is a single counter for all requests and gets reset on any successful JSON parse. In driver or YARN modes, the check makes multiple _rest_request_to_json calls to different tracking URLs in the same run; if one app responds with valid JSON, the counter resets even if another app keeps returning the “Spark is starting up” message. That means the retry limit can be perpetually skipped for a stuck driver, so its startup error is never surfaced as CRITICAL and metrics stay silently missing. Consider tracking retries per address/driver or only resetting the counter for the same URL that succeeded.

Useful? React with 👍 / 👎.

datadog-agent-integrations-bot bot requested a review from a team as a code owner January 12, 2026 20:30

datadog-agent-integrations-bot bot added the bot label Jan 12, 2026

datadog-agent-integrations-bot bot requested a review from a team as a code owner January 12, 2026 20:30

datadog-agent-integrations-bot bot added backport documentation integration/spark team/agent-integrations team/documentation labels Jan 12, 2026

steveny91 changed the title ~~[Backport 7.75.x] Fix for Spark driver init readiness pending~~ Test [Backport 7.75.x] Fix for Spark driver init readiness pending Jan 12, 2026

steveny91 marked this pull request as draft January 12, 2026 20:34

chatgpt-codex-connector bot reviewed Jan 12, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Test [Backport 7.75.x] Fix for Spark driver init readiness pending #22306

Test [Backport 7.75.x] Fix for Spark driver init readiness pending #22306

Uh oh!

datadog-agent-integrations-bot bot commented Jan 12, 2026 •

edited by steveny91

Loading

Uh oh!

codecov bot commented Jan 12, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

chatgpt-codex-connector bot Jan 12, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

		# Reset startup retry counter on successful JSON parse
		self._startup_retry_count = 0

Test [Backport 7.75.x] Fix for Spark driver init readiness pending #22306

Are you sure you want to change the base?

Test [Backport 7.75.x] Fix for Spark driver init readiness pending #22306

Uh oh!

Conversation

datadog-agent-integrations-bot bot commented Jan 12, 2026 • edited by steveny91 Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What does this PR do?

Motivation

Review checklist (to be filled by reviewers)

Uh oh!

codecov bot commented Jan 12, 2026

Codecov Report

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

chatgpt-codex-connector bot Jan 12, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

datadog-agent-integrations-bot bot commented Jan 12, 2026 •

edited by steveny91

Loading