feat(crashtracking): capture unhandled exception with the crashtracker by gyuheon0h · Pull Request #5321 · DataDog/dd-trace-rb

gyuheon0h · 2026-02-05T20:51:57Z

What does this PR do?
This PR adds support for crash report collection and emission for ruby unhandled exceptions. We do this by hooking into at_exit and accessing the exception stack. We send the exception stack over from the Ruby side to the native code side, and use it to build a crash report. We also send a crash ping, mainly for parity.

Native stack collection planned to be implemented but is out of scope for this stage.

Motivation:
Nice to see non-signal based crashes (not captured by regular errortracking) and was a feature request from SSI team.

Ticket: PROF-13673
Change log entry
Yes. Crashtracking: unhandled exceptions are caught and reported by the crashtracker

Additional Notes:

How to test the change?
Unit tests

Run a test ruby program instrumented with the crashtracker and look at the report being sent.

{
  "data_schema_version": "1.4",
  "error": {
    "is_crash": true,
    "kind": "UnhandledException",
    "message": "Unhandled ArgumentError: Test argument crash",
    "source_type": "Crashtracking",
    "stack": {
      "format": "Datadog Crashtracker 1.0",
      "frames": [
        {
          "file": "/home/bits/go/src/github.com/DataDog/dd-trace-rb/spec/datadog/core/crashtracking/component_spec.rb",
          "function": "block (4 levels) in <top (required)>",
          "line": 161
        },
        {
          "file": "/home/bits/go/src/github.com/DataDog/dd-trace-rb/spec/datadog/core/crashtracking/component_spec.rb",
          "function": "block (6 levels) in <top (required)>",
          "line": 168
        },
        ...
        {
          "file": "/var/lib/gems/3.0.0/gems/rspec-core-3.13.6/lib/rspec/core/runner.rb",
          "function": "invoke",
          "line": 45
        },
        {
          "file": "/var/lib/gems/3.0.0/gems/rspec-core-3.13.6/exe/rspec",
          "function": "<top (required)>",
          "line": 4
        },
        {
          "file": "/usr/local/bin/rspec",
          "function": "load",
          "line": 25
        },
        {
          "file": "/usr/local/bin/rspec",
          "function": "<main>",
          "line": 25
        }
      ],
      "incomplete": false
    }
  },
  "incomplete": false,
  "metadata": {
    "library_name": "dd-trace-rb",
    "library_version": "2.29.0",
    "family": "ruby",
    "tags": [
      "tag1:value1",
      "tag2:value2",
      "language:ruby-testing-123",
      "service:ruby-testing-123"
    ]
  },
  "os_info": {
    "architecture": "x86_64",
    "bitness": "64-bit",
    "os_type": "Ubuntu",
    "version": "22.4.0"
  },
  "proc_info": {
    "pid": 220117
  },
  "timestamp": "2026-02-06 00:25:31.590807434 UTC",
  "uuid": "9082567b-686a-4897-95cb-e596c929ba78"
}

github-actions · 2026-02-05T20:52:08Z

Thank you for updating Change log entry section 👏

^{Visited at: 2026-02-10 01:22:21 UTC}

datadog-official · 2026-02-06T00:12:23Z

✅ Tests

🎉 All green!

❄️ No new flaky tests detected
🧪 All tests passed

🎯 Code Coverage
• Patch Coverage: 87.37%
• Overall Coverage: 95.14% (-0.04%)

View detailed report

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: ba2fa9e | Docs | Datadog PR Page | Was this helpful? Give us feedback!}

pr-commenter · 2026-02-06T00:27:47Z

Benchmarks

Benchmark execution time: 2026-02-10 14:30:49

Comparing candidate commit ba2fa9e in PR branch gyuheon0h/capture-non-signal-crash with baseline commit 7631952 in branch master.

Found 0 performance improvements and 0 performance regressions! Performance is the same for 44 metrics, 2 unstable metrics.

ivoanjo

I've given it a pass!

ext/libdatadog_api/ruby_crash_reporting.c

spec/datadog/core/crashtracking/component_spec.rb

ext/libdatadog_api/crashtracker.c

ext/libdatadog_api/crashtracker.h

ext/libdatadog_api/crashtracker_report_exception.c

lib/datadog/core/crashtracking/component.rb

p-datadog

I read the C code and while nothing jumped out at me I also don't know if everything there is correct.

I left comments for the Ruby code.

In general, since we do have a crash tracker for crashes, I would like to see "unhandled exceptions" (and more precisely, "unhandled exceptions on main thread") NOT be referred to as "crashes" in Ruby code or documentation. I understand that eventually the libdatadog data structures will be created that have "crash" in their name, but I would prefer to see everything upstream of that use correct terminology and refer to "unhandled exceptions".

lib/datadog/core.rb

lib/datadog/core/crashtracking/component.rb

ext/libdatadog_api/crashtracker_report_exception.c

spec/datadog/core/crashtracking/component_spec.rb

lib/datadog/core/crashtracking/component.rb

ivoanjo

Gave it another pass!

ext/libdatadog_api/crashtracker_report_exception.c

lib/datadog/core/crashtracking/component.rb

spec/datadog/core/crashtracking/component_spec.rb

ivoanjo · 2026-02-09T11:52:20Z

In general, since we do have a crash tracker for crashes, I would like to see "unhandled exceptions" (and more precisely, "unhandled exceptions on main thread") NOT be referred to as "crashes" in Ruby code or documentation. I understand that eventually the libdatadog data structures will be created that have "crash" in their name, but I would prefer to see everything upstream of that use correct terminology and refer to "unhandled exceptions".

+1 on this -- I suggest updating the PR title and changelog entry as well.

github-actions · 2026-02-09T15:56:01Z

Typing analysis

Note: Ignored files are excluded from the next sections.

Untyped methods

This PR introduces 1 partially typed method, and clears 1 partially typed method. It increases the percentage of typed methods from 59.87% to 59.96% (+0.09%).

Partially typed methods (+1-1)

❌ Introduced:

sig/datadog/core/crashtracking/component.rbs:15
└── def initialize: (
          tags: ::Hash[::String, ::String],
          agent_base_url: ::String,
          ld_library_path: ::String,
          path_to_crashtracking_receiver_binary: ::String,
          logger: untyped
        ) -> void

✅ Cleared:

sig/datadog/core/crashtracking/component.rbs:11
└── def initialize: (
          tags: ::Hash[::String, ::String],
          agent_base_url: ::String,
          ld_library_path: ::String,
          path_to_crashtracking_receiver_binary: ::String,
          logger: untyped
        ) -> void

Untyped other declarations

This PR introduces 1 untyped other declaration, and clears 1 untyped other declaration.

Untyped other declarations (+1-1)

❌ Introduced:

sig/datadog/core/crashtracking/component.rbs:35
└── attr_reader logger: untyped

✅ Cleared:

sig/datadog/core/crashtracking/component.rbs:31
└── attr_reader logger: untyped

If you believe a method or an attribute is rightfully untyped or partially typed, you can add # untyped:accept on the line before the definition to remove it from the stats.

Signal based crash report (crash done, need to do ping) Revert "Gitignore weird files that keep popping up (will pop this commit later)" This reverts commit aeb3017. Revert "Remove VS Code config files from tracking" This reverts commit 2b30b86. Use locations array Clean Lazy logging Fix memory leak

Fmt fmt Do work on ruby side, fix sus calls Remove noisy log Update symbol name Check result, build message in ruby unit test and test cleanup Inline + no order dependency + cleanup Number of frames logic on ruby side frame processing in helper Restore accidentally deleted comment Update tags on fork Fmt Fix potential mem leak move to core clean Extract into helper Fix more potential leaks Fmt Remove comment from Ruby exception crash reporting context Removed comment about Ruby exception crash reporting tests. Respond to oleg -(rescuing all exceptions) Flip negation No more do-while, crash vs exception naming, test sleep fix, minor refactoring Tag builder helper func, move all logic into ct component, move builder into build function

Trigger CI rbs file Trigger CI CI debug We need to explicitly check, not depend on order Be explicit with typing Trigger CI Incomplete stack Clarity in tests

ivoanjo

👍 LGTM, I like this latest iteration!

ext/libdatadog_api/crashtracker_report_exception.c

lib/datadog/core/crashtracking/component.rb

spec/datadog/core/crashtracking/component_spec.rb

gleocadie · 2026-02-10T16:12:05Z

ext/libdatadog_api/crashtracker_report_exception.c

@@ -0,0 +1,205 @@
+#include <datadog/common.h>


I know that it's crashing, but we could have call ddog_Error_drop on the error. (cleaner)

Uh, right, we are definitely leaking the error, if it ever gets triggered -- very worth cleaning it up. Can you look into it @gyuheon0h ?

(Again, this API is sooooooooo awkward and I'm looking forward to actually having libdatadog handle things in a much nicer way instead of dropping all this complexity on the Ruby side)

Remove VS Code config files from tracking

2b30b86

gyuheon0h marked this pull request as ready for review February 5, 2026 20:52

gyuheon0h requested review from a team as code owners February 5, 2026 20:52

gyuheon0h marked this pull request as draft February 5, 2026 20:52

github-actions bot added the core Involves Datadog core libraries label Feb 5, 2026

gyuheon0h force-pushed the gyuheon0h/capture-non-signal-crash branch 2 times, most recently from c5d3fce to e4b1623 Compare February 5, 2026 21:35

gyuheon0h marked this pull request as ready for review February 6, 2026 01:05

ivoanjo reviewed Feb 6, 2026

View reviewed changes

gyuheon0h requested a review from ivoanjo February 6, 2026 19:16

gyuheon0h force-pushed the gyuheon0h/capture-non-signal-crash branch from 6f5fc9b to 25077d0 Compare February 6, 2026 19:50

p-datadog reviewed Feb 6, 2026

View reviewed changes

gyuheon0h force-pushed the gyuheon0h/capture-non-signal-crash branch from 87fad47 to 808b3f6 Compare February 6, 2026 22:16

gleocadie reviewed Feb 6, 2026

View reviewed changes

ext/libdatadog_api/crashtracker_report_exception.c Outdated Show resolved Hide resolved

p-datadog reviewed Feb 7, 2026

View reviewed changes

ivoanjo reviewed Feb 9, 2026

View reviewed changes

gyuheon0h changed the title ~~feat(crashtracking): capture non signal based crashes~~ feat(crashtracking): capture unhandled exception based crashes Feb 9, 2026

gyuheon0h changed the title ~~feat(crashtracking): capture unhandled exception based crashes~~ feat(crashtracking): capture unhandled exception with the crashtracker Feb 9, 2026

gyuheon0h requested a review from ivoanjo February 9, 2026 15:57

gyuheon0h force-pushed the gyuheon0h/capture-non-signal-crash branch from ea94407 to 3fabed1 Compare February 9, 2026 18:33

gyuheon0h added 3 commits February 9, 2026 20:37

Comprehensive testing

2c143cb

Timestamp is handled by libdatadog

c0aec88

Trigger CI rbs file Trigger CI CI debug We need to explicitly check, not depend on order Be explicit with typing Trigger CI Incomplete stack Clarity in tests

gyuheon0h force-pushed the gyuheon0h/capture-non-signal-crash branch from 9050813 to c0aec88 Compare February 9, 2026 20:37

ivoanjo approved these changes Feb 10, 2026

View reviewed changes

Final cleanups

ba2fa9e

gyuheon0h merged commit 40f46ee into master Feb 10, 2026
1459 of 1466 checks passed

gyuheon0h deleted the gyuheon0h/capture-non-signal-crash branch February 10, 2026 14:57

github-actions bot added this to the 2.29.0 milestone Feb 10, 2026

gleocadie reviewed Feb 10, 2026

View reviewed changes

gyuheon0h mentioned this pull request Feb 10, 2026

chore(crashtracking): update max frames collected by unhandled exception collector to 512 and clarify comment #5345

Merged

Conversation

gyuheon0h commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Feb 5, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

datadog-official bot commented Feb 6, 2026 • edited by datadog-datadog-prod-us1 bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

pr-commenter bot commented Feb 6, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Benchmarks

Uh oh!

ivoanjo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

p-datadog left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ivoanjo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

ivoanjo commented Feb 9, 2026

Uh oh!

github-actions bot commented Feb 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Typing analysis

Untyped methods

Untyped other declarations

Uh oh!

ivoanjo left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

gyuheon0h commented Feb 5, 2026 •

edited

Loading

github-actions bot commented Feb 5, 2026 •

edited

Loading

datadog-official bot commented Feb 6, 2026 •

edited by datadog-datadog-prod-us1 bot

Loading

pr-commenter bot commented Feb 6, 2026 •

edited

Loading

github-actions bot commented Feb 9, 2026 •

edited

Loading