Skip to content

Conversation

@datadog-agent-integrations-bot
Copy link
Contributor

@datadog-agent-integrations-bot datadog-agent-integrations-bot bot commented Jan 12, 2026

Ignore this PR. Just used to test a workflow.

Backport 7477060 from #22252.


What does this PR do?

Prevents JSON decode errors when the Spark driver is starting but not quite ready.

Motivation

Customer inquiry via support ticket.

Review checklist (to be filled by reviewers)

  • Feature or bugfix MUST have appropriate tests (unit, integration, e2e)
  • Add the qa/skip-qa label if the PR doesn't need to be tested during QA.
  • If you need to backport this PR to another branch, you can add the backport/<branch-name> label to the PR and it will automatically open a backport PR once this one is merged

* Fix for Spark driver init readiness pending

* Add changelog

* Lint

* Add config param for retries

* Simplify description

* Address comments

* Update spark/assets/configuration/spec.yaml

Co-authored-by: Steven Yuen <[email protected]>

* ddev sync models and config

---------

Co-authored-by: Steven Yuen <[email protected]>
(cherry picked from commit 7477060)
@datadog-agent-integrations-bot datadog-agent-integrations-bot bot requested a review from a team as a code owner January 12, 2026 20:30
@datadog-agent-integrations-bot datadog-agent-integrations-bot bot requested a review from a team as a code owner January 12, 2026 20:30
@steveny91 steveny91 changed the title [Backport 7.75.x] Fix for Spark driver init readiness pending Test [Backport 7.75.x] Fix for Spark driver init readiness pending Jan 12, 2026
@steveny91 steveny91 marked this pull request as draft January 12, 2026 20:34
@codecov
Copy link

codecov bot commented Jan 12, 2026

Codecov Report

❌ Patch coverage is 98.68421% with 1 line in your changes missing coverage. Please review.
⚠️ Please upload report for BASE (7.75.x@99d4ee0). Learn more about missing BASE report.

Additional details and impacted files
🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.
  • 📦 JS Bundle Analysis: Save yourself from yourself by tracking and limiting bundle sizes in JS merges.

Copy link

@chatgpt-codex-connector chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 42b986748f

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

  • Open a pull request for review
  • Mark a draft as ready
  • Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

Comment on lines +739 to +740
# Reset startup retry counter on successful JSON parse
self._startup_retry_count = 0

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

P2 Badge Avoid global retry reset across different Spark drivers

_startup_retry_count is a single counter for all requests and gets reset on any successful JSON parse. In driver or YARN modes, the check makes multiple _rest_request_to_json calls to different tracking URLs in the same run; if one app responds with valid JSON, the counter resets even if another app keeps returning the “Spark is starting up” message. That means the retry limit can be perpetually skipped for a stuck driver, so its startup error is never surfaced as CRITICAL and metrics stay silently missing. Consider tracking retries per address/driver or only resetting the counter for the same URL that succeeded.

Useful? React with 👍 / 👎.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants