-
Notifications
You must be signed in to change notification settings - Fork 391
fix: improve telemetry spans #1472
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Conversation
|
Ran on local devnet and observed the Aggregator spans even when |
The problem with the events is that they are associated with a parent span, but if for some reason we don't finalize it, we lose all the associated events. |
MarcosNicolau
left a comment
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Tested on macos and it is working well.
|
Addressed in #1670 . |


Improve Telemetry Spans
Motivation
We found that sometimes our Batcher tries to cancel batches that were actually included in the net, calling the
batcherTaskCreationFailedendpoint, which finalizes the trace and prevents the Aggregator from registering its spans in the trace.Description
batcherTaskCreationFailedoccurs.Observations
On a real
batcherTaskCreationFailed, the Aggregator won't receive the new task, and the trace will remain unfinished. Furthermore, the trace metadata won't be removed from the Telemetry server store. Despite that, we will be able to visualize the orphans spans with a warning that their parent ID is invalid.#1477 was created to address this issue.
How To Test
Run anvil, all Aligned components with one or more operators and start telemetry:
Go to jaeger and explore the generated traces.
Change the Batcher
create_new_task_retryablefunction inbatcher/aligned-batcher/src/retry/batcher_retryables.rs:165to return an error after receiving the receipt:Then, start all components again and you should be able to see the Aggregator spans even when the Batcher sends
Batcher - Task Creation FailedType of change
Please delete options that are not relevant.
Checklist
testnet, everything else tostaging