Skip to content

Conversation

@OCHyams
Copy link
Contributor

@OCHyams OCHyams commented Sep 5, 2025

See #157090 description for motivation.

Printing the dap logs to stdout/err on failure sadly isn't enough (see #157130).

With this patch, we might be able to get clues about the flaky tests with well timed log inspection.

@github-actions
Copy link

github-actions bot commented Sep 5, 2025

✅ With the latest revision this PR passed the Python code formatter.

Copy link
Contributor

@SLTozer SLTozer left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Seems like the most robust way to get information on the ongoing intermittent failures, SGTM.

@OCHyams OCHyams merged commit a20fc93 into llvm:main Sep 8, 2025
9 checks passed
OCHyams added a commit that referenced this pull request Sep 9, 2025
OCHyams added a commit that referenced this pull request Sep 9, 2025
Reverts #157145

This was just temporary to help debug a flaky bot
OCHyams added a commit that referenced this pull request Sep 9, 2025
Buildbot cross-project-tests-sie-ubuntu has been unstable recently
(https://lab.llvm.org/buildbot/#/builders/181).

Dexter uses lldb-dap in these tests. Occasionally a one will fail with a
KeyError because of a missing "stackFrames" "source" "path".

I can't reproduce the failure locally, but with PRs #157130 and
#157145 I've got DAP logs from a pass and fail.

In a failure, "path" is missing for a step out of main (off the final brace),
and the passing test has one additional "module" event for libc.so.6.

```
<- {
  "body": {
    "module": {
      "addressRange": "0x7ffff7dd1000",
      "debugInfoSize": "4.9MB",
      "id": "5792732F-7831-58C6-6FB4-F3756458CA24-E46E827D",
      "name": "libc.so.6",
      "path": "/lib/x86_64-linux-gnu/libc.so.6",
      "symbolFilePath": "/usr/lib/debug/.build-id/57/92732f783158c66fb4f3756458ca24e46e827d.debug",
      "symbolStatus": "Symbols loaded."
    },
    "reason": "new"
  },
  "event": "module",
  "seq": 0,
  "type": "event"
}
```

That explains why we get a step that is missing a "path" component in the
failure. I don't understand why LLDB (or LLDB-DAP) is sometimes unable to load
the module (in time?). But "path" is an optional field anyway, so I think it's
worth handling in dexter even if LLDB's behaviour here is confusing.

This commit should stabilize the bot if the only time a module goes missing is
for steps outside main. (Ideally none would go missing, but those shouldn't
interfere with the tests).
llvm-sync bot pushed a commit to arm/arm-toolchain that referenced this pull request Sep 9, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants