instrument gettrace endpoint more by xurui-c · Pull Request #7741 · getsentry/snuba

xurui-c · 2026-02-17T07:12:29Z

Approach:

Goal: get a high-level signal on where the slowdown is (e.g., validate whether post-processing is the issue), not detailed diagnostics (what is the longest it took to process a row or how many attributes such a row has)
Plan to iterate across multiple PRs to progressively narrow down the bottleneck.
Avoid instrumenting every part of the endpoint to prevent trace waterfall bloat.
Hypothesis: _process_results is the likely bottleneck, so instrumentation is focused there.
Instrumentation is intentionally lightweight until we have stronger evidence.
Some instrumentation may be removed as we rule out potential bottlenecks.

execute
├── _query_item_group ×N (one per item)
│ ├── _build_query
│ ├── run_query
│ └── _process_results
│ ├── add_attributes
│ └── sort_attributes
└── assemble_response

Copilot

Pull request overview

This PR adds lightweight instrumentation to the gettrace endpoint to identify performance bottlenecks in the request processing pipeline. The instrumentation is focused on the _process_results function, which is hypothesized to be a bottleneck, while also adding spans to other key functions for high-level visibility.

Changes:

Added @with_span(op="function") decorators to _build_query, _process_results, and _query_item_group functions for Sentry APM tracing
Added detailed timing measurements within _process_results to track time spent adding attributes and sorting attributes per row
Wrapped the response assembly logic in _execute with a span for visibility
Added span data to record rows processed and cumulative time spent on attribute processing operations

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

Copilot · 2026-02-17T16:46:29Z

snuba/web/rpc/v1/endpoint_get_trace.py

@@ -527,8 +541,23 @@ def add_attribute(key: str, value: Any) -> None:
                key=attrgetter("key.name"),
            ),
        )
+
+        # End timing sorting
+        t2 = time.perf_counter()


The timing measurement for "sort_attributes_seconds" includes not only the sorting operation but also the construction of the GetTraceResponse.Item object. If the goal is to specifically measure the sorting performance, consider moving t2 = time.perf_counter() immediately after the sorted() operation completes, before the GetTraceResponse.Item construction. This would provide more accurate timing for the sorting operation alone versus the entire item construction process.

Protobuf message construction is negligible overhead

phacops · 2026-02-17T17:15:40Z

snuba/web/rpc/v1/endpoint_get_trace.py

        )
+
+        # End timing sorting
+        t2 = time.perf_counter()


Why don't we have 2 spans here instead of manually getting timings for this? That's what spans are for.

We do the attribute adding and sorting for every row, so if we have 2 spans here, we'd have 2 spans for every row, which is a lot of spans and will bloat up the trace waterfall. We also don't care about the adding and sorting time for individual rows, we care about the aggregated times across all rows in the returned data.

A span represents a contiguous operation, but the loop interleaves attributes addition and attributes sort across all the rows.

I could create two spans after the loop completes and manually set timestamps to reflect the aggregated durations, but that felt hacky.

Then separate the 2 operations in 2 different loops. It's important to be able to visualize where the time is spent in the trace waterfall, so we need spans for this. Having attributes with some data is not fully getting us there and people will still have ask where the time is spent.

Plus, if sorting ends up being a problem, easier to remove too.

valid reasons 👍

This reverts commit e1313eb.

snuba/web/rpc/v1/endpoint_get_trace.py

xurui-c · 2026-02-17T20:23:57Z

snuba/web/rpc/v1/endpoint_get_trace.py

-                value=attribute_value,
-            )
+    # First pass: parse rows and build attribute dicts
+    parsed_rows: list[tuple[str, Timestamp, dict[str, GetTraceResponse.Item.Attribute]]] = []


alternatively, we can just create the GetTraceResponse.Item with unsorted attributes, and then sort them later. I decided against this approach because Protobuf's RepeatedCompositeContainer.sort() internally copies elements to a Python list, sorts it, clears the container, and re-adds them all. So each attribute gets inserted into the protobuf container twice — once during construction (unsorted), and once during the sort (after clear + re-add).

Is this going to be surpassed by the sorting overhead? Maybe. Best to keep things as same as before (single loop)

instrument gettrace endpoint more

84d9b0d

xurui-c marked this pull request as ready for review February 17, 2026 16:36

xurui-c requested a review from a team as a code owner February 17, 2026 16:36

Copilot AI review requested due to automatic review settings February 17, 2026 16:36

xurui-c requested a review from a team as a code owner February 17, 2026 16:36

Copilot started reviewing on behalf of xurui-c February 17, 2026 16:39 View session

Copilot AI reviewed Feb 17, 2026

View reviewed changes

phacops reviewed Feb 17, 2026

View reviewed changes

Rachel Chen added 2 commits February 17, 2026 11:52

extra data structure

9b41baa

sort in place

e1313eb

xurui-c force-pushed the rachel/instrument branch from 743ee28 to e1313eb Compare February 17, 2026 19:57

Revert "sort in place"

ef23a80

This reverts commit e1313eb.

xurui-c force-pushed the rachel/instrument branch from 385d9ba to aac6a31 Compare February 17, 2026 20:18

sentry bot reviewed Feb 17, 2026

View reviewed changes

snuba/web/rpc/v1/endpoint_get_trace.py Outdated Show resolved Hide resolved

xurui-c commented Feb 17, 2026

View reviewed changes

cleanup

3a7151d

xurui-c force-pushed the rachel/instrument branch from aac6a31 to 3a7151d Compare February 17, 2026 20:30

xurui-c requested a review from phacops February 17, 2026 21:46

phacops approved these changes Feb 17, 2026

View reviewed changes

xurui-c merged commit 72eae32 into master Feb 17, 2026
34 checks passed

xurui-c deleted the rachel/instrument branch February 17, 2026 22:50

sentry-release-bot bot mentioned this pull request Feb 17, 2026

publish: getsentry/snuba@26.2.0 getsentry/publish#7197

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Comments

instrument gettrace endpoint more#7741

instrument gettrace endpoint more#7741
xurui-c merged 5 commits intomasterfrom
rachel/instrument

xurui-c commented Feb 17, 2026 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Copilot AI Feb 17, 2026

Uh oh!

xurui-c Feb 17, 2026

Uh oh!

phacops Feb 17, 2026

Uh oh!

xurui-c Feb 17, 2026 •

edited

Loading

Uh oh!

phacops Feb 17, 2026

Uh oh!

xurui-c Feb 17, 2026

Uh oh!

Uh oh!

xurui-c Feb 17, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Comments

Conversation

xurui-c commented Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Uh oh!

Copilot AI Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

xurui-c Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

phacops Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

xurui-c Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

phacops Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

xurui-c Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

xurui-c Feb 17, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

xurui-c commented Feb 17, 2026 •

edited

Loading

xurui-c Feb 17, 2026 •

edited

Loading

xurui-c Feb 17, 2026 •

edited

Loading