feat: get tags from top level span from tracer #546

duncanista · 2025-02-06T22:52:16Z

What?

Provides a proposal to get the tags from the incoming top-level span and attach them to the aws.lambda span generated in this project.

Motivation

SVLS-4645

Top level span gets custom tags directly set on the span
Missing metrics are set on top level span

also refactored to add a quick constructor with `request_id`

i dont like this type of state sharing tho

against my will, but this would solve a lot of problems that we would have to do in other obscure ways

so we can pass it to trace agent

duncanista · 2025-02-06T22:55:01Z

bottlecap/src/traces/trace_processor.rs

    }
    if span.resource == INVOCATION_SPAN_RESOURCE {
+        let mut guard = context_buffer.lock().expect("lock poisoned");
+        if let Some(request_id) = span.meta.get("request_id") {


We assume the top level span is including the request_id, allowing us to match the data in the context buffer.

This is done in Ruby and Go.

.NET would have to send the top-level span first, as that one gets ditched and never arrives, should be a quick fix, but might need to get access to context so we can tag the current span with the request_id.

Not sure how Java handles this.

…ion into jordan.gonzalez/enrich-tags-from-tracer-top-level-span

astuyve · 2025-03-24T18:53:17Z

bottlecap/src/lifecycle/invocation/context.rs

+    }
+}
+
+impl Default for Context {


do we only need this for tests? if so can we wrap it in a test macro? Defaults can cause subtle bugs

We do use it for logs processor, where we have default empty at the beginning all the time

astuyve · 2025-03-24T18:54:17Z

bottlecap/src/lifecycle/invocation/processor.rs

                _ = offsets.process_chan_tx.send(());
            }
        }
+        drop(context_buffer);


are we sure we need the manual drop here? Doesn't the closure on 307 help us by 581? Is the drop necessary? If we add new code after, we have to re-lock right?

Yeah, so since we only await at the very end, I decided to lock in the last place we modify the context_buffer, in reality, I'm dropping to be 100% sure that nothing strange happens. I'll add another comment to note that we drop because we await later on 332

a more idiomatic approach is to create

let context_for_request= { let mut context_buffer = self.context_buffer.lock().expect("lock poisoned"); context_buffer.add_runtime_duration(request_id, metrics.duration_ms); context_buffer.get(request_id) };

at line 278 and

if let Some(context) = context_for_request{

at line 311.

This clearly drops the lock when it's not needed.
We have a number of useless drop() here and there in the code.

But as for previous comment, I would avoid the lock altogether

Instead, since I don't need to assign it to anything, I think I can refactor this by moving the first method to the bottom close to where we actually use the get(...) method and then use a scope to directly drop the lock 🤔

astuyve · 2025-03-24T18:55:40Z

bottlecap/src/traces/trace_agent.rs

        Ok(())
    }

+    #[allow(clippy::too_many_arguments)]


lol maybe it's builder time

fr, when we move the crate to the new repo, might be a good exercise to re-implement it as a builder to see how it would look to simplify this

also added a comment on why we drop the lock

alexgallotta

Instead of passing around the context buffer under a lock all over the place,
if only the request_id is needed, can't it be just cloned as string and passed?

If anything else is needed also, the trace_agent already has the ServerlessTraceProcessor which has the context buffer too.
It seems a bit convoluted to pass it around everywhere like that

bottlecap/src/lifecycle/invocation/context.rs

alexgallotta · 2025-03-24T23:48:04Z

bottlecap/src/lifecycle/invocation/processor.rs

 pub struct Processor {
    // Buffer containing context of the previous 5 invocations
-    pub context_buffer: ContextBuffer,
+    pub context_buffer: Arc<Mutex<ContextBuffer>>,


Why is this public?
I tried to remove the pub and run the test and it still works.

Considering that the Processor is already guarded my an arc mutex, I am wondering why we are wrapping it in another mutex and expose it outside

I can remove the pub, and I'd like to remove the lock altogether, yet I cannot find a way to get the desired outcome without the lock.

The lifecycle processor is just another component which has access to the context buffer, which the trace processor should have access too. They both need access because if the tracer sends by any chance a top level span, we need to make sure to add it to the current context of the given request id, that way whenever there's a platform runtime done, we can attach the tracer top level span information onto the one we create. WDYT?

alexgallotta · 2025-03-24T23:53:33Z

bottlecap/src/lifecycle/invocation/processor.rs

                _ = offsets.process_chan_tx.send(());
            }
        }
+        drop(context_buffer);


a more idiomatic approach is to create

let context_for_request= { let mut context_buffer = self.context_buffer.lock().expect("lock poisoned"); context_buffer.add_runtime_duration(request_id, metrics.duration_ms); context_buffer.get(request_id) };

at line 278 and

if let Some(context) = context_for_request{

at line 311.

This clearly drops the lock when it's not needed.
We have a number of useless drop() here and there in the code.

But as for previous comment, I would avoid the lock altogether

…ion into jordan.gonzalez/enrich-tags-from-tracer-top-level-span

duncanista added 5 commits February 6, 2025 17:45

add tracer_span field to Context

b674534

also refactored to add a quick constructor with `request_id`

use ::default()

c829b0b

use context_buffer in trace agent

3dff6be

i dont like this type of state sharing tho

make context_buffer mutex in order to get custom tags from tracer span

8a6e0f9

against my will, but this would solve a lot of problems that we would have to do in other obscure ways

create context_buffer in main.rs

f67672b

so we can pass it to trace agent

duncanista requested a review from a team as a code owner February 6, 2025 22:52

duncanista commented Feb 6, 2025

View reviewed changes

duncanista added 3 commits March 19, 2025 10:34

Merge branch 'main' of ssh://github.com/DataDog/datadog-lambda-extens…

9006b67

…ion into jordan.gonzalez/enrich-tags-from-tracer-top-level-span

update runtime duration metric value sent

c3412ae

fmt

9b0438f

duncanista changed the title ~~experimental: get tags from top level span from tracer~~ feat: get tags from top level span from tracer Mar 20, 2025

astuyve reviewed Mar 24, 2025

View reviewed changes

astuyve approved these changes Mar 24, 2025

View reviewed changes

copy metrics and meta_struct too

d5ad27a

also added a comment on why we drop the lock

alexgallotta reviewed Mar 25, 2025

View reviewed changes

duncanista added 6 commits March 24, 2025 21:04

use ::default in from_request_id

d218e41

create a scope to drop lock asap

b3d263d

fmt

63c85c6

Merge branch 'main' of ssh://github.com/DataDog/datadog-lambda-extens…

ca0285d

…ion into jordan.gonzalez/enrich-tags-from-tracer-top-level-span

fmt

9f5677c

Merge branch 'main' of ssh://github.com/DataDog/datadog-lambda-extens…

e0f36fd

…ion into jordan.gonzalez/enrich-tags-from-tracer-top-level-span

duncanista merged commit 7724ddb into main Mar 27, 2025
27 of 35 checks passed

duncanista deleted the jordan.gonzalez/enrich-tags-from-tracer-top-level-span branch March 27, 2025 19:08

feat: get tags from top level span from tracer #546

feat: get tags from top level span from tracer #546

Uh oh!

Conversation

duncanista commented Feb 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

What?

Motivation

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

alexgallotta left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

duncanista commented Feb 6, 2025 •

edited

Loading