Better root span management #999

Dwij1704 · 2025-05-22T10:49:09Z

📥 Pull Request

📘 Description
Closes #991

Enhance AgentOps SDK with tracing capabilities

Introduced start_trace and end_trace functions for user-managed tracing, allowing concurrent traces.
Updated init method to handle auto-starting traces and improved session management.
Refactored legacy session handling to integrate with new tracing architecture.
Deprecated the use of global active session in favor of trace context management.
Improved error handling and logging during SDK initialization and trace lifecycle.
Added trace decorator. Updated log_trace_url to include titles for improved logging context.

This is a hybrid approach:

Default Behavior (auto_start_session=True, which is the default for init):
agentops.init() will start a main trace (let's call it the "init trace" or "auto-trace").
This "init trace" will be automatically managed and ended when the program exits. This means we need to reinstate a form of atexit handling for this specific trace if and only if it was started by init and auto_start_session was true.
agentops.init() will not return this automatically managed session/trace object to the user. If it did, and the user also tried to end it, it could lead to double-ending.
Explicit Trace Management (agentops.start_trace, agentops.end_trace):
Users can still call agentops.start_trace() to create additional, independent traces.
agentops.start_trace() will return a Trace (or Session-like) object.
Users will be responsible for explicitly ending these traces using agentops.end_trace(trace_object).
These explicit traces will allow for multiple concurrent, user-managed traces, independent of the "init trace."
auto_start_session=False:
If agentops.init(auto_start_session=False) is called, no "init trace" is automatically started.
init() would return None in this case.
The user is then fully responsible for starting and ending all traces using agentops.start_trace() and agentops.end_trace().

🧪 Testing
View the test code gist

…racing, allowing concurrent traces.

…ving session handling. Added `trace`, `session`, `agent`, `task`, `workflow`, and `operation` decorators for better instrumentation. Updated `log_trace_url` to include titles for improved logging context. Refactored `Client` initialization trace name and adjusted end trace state handling. Improved error handling during trace logging in `TracingCore` and removed deprecated session decorator usage.

…gs`, `auto_init`, `skip_auto_end_session`, and `fail_safe`. Updated documentation to reflect changes and merged `tags` with `default_tags` for improved session management. Refactored client initialization to accommodate new options.

bboynton97 · 2025-05-23T15:57:36Z

closes #999

agentops/__init__.py

agentops/client/client.py

bboynton97

definitely want tests to pass before merging. we should also add tests for the multi-session functionality as well as a test script / example :)

very nice work Dwij!!

…ession` in `agentops` module. This change improves code clarity and aligns with the updated session management approach.

…cessor. Added tests for start and end trace URL logging, handling failures gracefully, and verifying root span tracking. Improved test coverage for session decorators and ensured proper handling of unsampled spans. Refactored existing tests for clarity and consistency.

…nd attribute tracking. Updated span creation to use dynamic workflow names and improved error handling. Adjusted span kinds from CLIENT to INTERNAL for better clarity in tracing. Streamlined attribute setting for agents and tasks, ensuring accurate logging of results and metrics.

…r, simplifying the flush process. Updated logging to indicate completion of the flush operation.

…ditions for the API key. Only trigger a warning if a different non-None API key is provided during re-initialization, enhancing the clarity of client behavior.

… legacy session wrapper. Update unit tests to enhance coverage for new session management functionality, including explicit trace handling and decorator behavior. Ensure proper integration between new and legacy APIs for session and trace management.

codecov · 2025-05-27T19:40:59Z

Codecov Report

Attention: Patch coverage is 67.47788% with 147 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
agentops/legacy/__init__.py	64.07%	37 Missing ⚠️
agentops/sdk/core.py	70.83%	35 Missing ⚠️
agentops/sdk/decorators/factory.py	70.21%	28 Missing ⚠️
agentops/client/client.py	64.38%	26 Missing ⚠️
agentops/client/api/versions/v3.py	29.41%	12 Missing ⚠️
agentops/__init__.py	69.56%	7 Missing ⚠️
agentops/config.py	75.00%	1 Missing ⚠️
agentops/instrumentation/crewai/instrumentation.py	0.00%	1 Missing ⚠️

📢 Thoughts on this report? Let us know!

…xception handling for token fetching and response processing, ensuring clearer error logging and re-raising of exceptions for better testability. Updated integration tests to reset client state between tests.

…on and error handling. Mock API client and tracing core to avoid real authentication during tests. Simplify concurrency test descriptions and ensure proper cleanup of client state between tests.

…ng all active traces when no context is provided. Refactor end_all_sessions to utilize the new end_trace functionality, ensuring legacy global state is cleared. Introduce thread-safe handling of active traces in TracingCore with locking mechanisms for improved concurrency.

… for invalid trace IDs. This ensures robustness when dealing with mocked spans or non-integer trace IDs, improving overall trace management reliability.

…agentops into better-root-span-management

…figuration and initialization. This allows for customizable trace/session naming, improving trace management and clarity in logs. Updated relevant classes and methods to utilize the new parameter.

…agentops into better-root-span-management

Copilot

Pull Request Overview

This PR improves root span management in the AgentOps SDK by enhancing trace lifecycle handling and updating the tracing API. Key changes include:

Introducing explicit start_trace and end_trace functions with improved configuration (including a trace name parameter)
Refactoring session management in the client and deprecating legacy session handling
Updating decorator behavior and telemetry shutdown procedures

Reviewed Changes

Copilot reviewed 16 out of 16 changed files in this pull request and generated 1 comment.

Show a summary per file

File	Description
tests/integration/test_session_concurrency.py	Updated test cases to reflect removal of legacy session decorators and improved error handling
tests/integration/test_auth_flow.py	Added client reset fixture for consistent auth flow testing
agentops/semconv/span_attributes.py	Added new span attribute for session end state
agentops/sdk/processors.py	Removed duplicate log_trace_url calls to streamline span logging
agentops/sdk/decorators/factory.py	Introduced type hints and parameter “tags” in decorators, along with enhanced logging messages
agentops/sdk/core.py	Added start_trace/end_trace methods with TraceContext management and improved shutdown flushing
agentops/instrumentation/crewai/instrumentation.py	Adapted workflow span naming to use configurable trace_name
agentops/helpers/dashboard.py	Updated log_trace_url to include an optional title for better logging context
agentops/config.py	Extended configuration to support default trace name
agentops/client/client.py	Refactored auto-start trace logic and re-initialization handling, deprecating global variables for legacy sessions
agentops/client/api/versions/v3.py	Improved error logging in auth token processing
agentops/init.py	Added start_trace/end_trace API and integrated new trace_name configuration in SDK init

Comments suppressed due to low confidence (1)

agentops/init.py:193

Auto-initializing the SDK within start_trace may lead to side effects; consider requiring explicit initialization to ensure predictable behavior and clearer control flow for SDK consumers.

if not tracing_core.initialized { ... init()  // Attempt to initialize with environment variables / defaults

agentops/client/client.py

…ne code and improve clarity.

YADA YADA YADA

dot-agi

Dwij1704 · 2025-05-27T22:15:53Z

For anyone who is wondering, thats me :)

Dwij1704 added 4 commits May 22, 2025 15:12

Introduced start_trace and end_trace functions for user-managed t…

6fa9dd5

…racing, allowing concurrent traces.

cleanup

a369761

bboynton97 self-requested a review May 23, 2025 15:49

bboynton97 mentioned this pull request May 23, 2025

add end_trace() #917

Closed

bboynton97 reviewed May 23, 2025

View reviewed changes

agentops/__init__.py Outdated Show resolved Hide resolved

bboynton97 reviewed May 23, 2025

View reviewed changes

agentops/client/client.py Show resolved Hide resolved

bboynton97 approved these changes May 23, 2025

View reviewed changes

bboynton97 previously requested changes May 23, 2025

View reviewed changes

dot-agi and others added 7 commits May 26, 2025 01:07

Merge branch 'main' into better-root-span-management

79c7637

Refactor legacy session handling by replacing LegacySession with `S…

6f284c7

…ession` in `agentops` module. This change improves code clarity and aligns with the updated session management approach.

Refactor force_flush method in TracingCore to remove timeout paramete…

cd17b13

…r, simplifying the flush process. Updated logging to indicate completion of the flush operation.

Refactor Client initialization logic to clarify re-initialization con…

1faf0e3

…ditions for the API key. Only trigger a warning if a different non-None API key is provided during re-initialization, enhancing the clarity of client behavior.

Dwij1704 and others added 11 commits May 28, 2025 01:14

Improve authentication error handling in Client and V3Client. Added e…

67b2f80

…xception handling for token fetching and response processing, ensuring clearer error logging and re-raising of exceptions for better testability. Updated integration tests to reset client state between tests.

Refactor integration tests for session concurrency to improve isolati…

fa82e41

…on and error handling. Mock API client and tracing core to avoid real authentication during tests. Simplify concurrency test descriptions and ensure proper cleanup of client state between tests.

Merge branch 'main' into better-root-span-management

c49191c

Enhance trace ID handling in TracingCore by adding exception handling…

fa6eca3

… for invalid trace IDs. This ensures robustness when dealing with mocked spans or non-integer trace IDs, improving overall trace management reliability.

Merge branch 'better-root-span-management' of github.com:AgentOps-AI/…

310aeed

…agentops into better-root-span-management

revert crewai

27f1701

Merge branch 'main' into better-root-span-management

abd53c3

Merge branch 'main' into better-root-span-management

ce4ab1a

Enhance tracing functionality by adding trace_name parameter to con…

51bbf7f

…figuration and initialization. This allows for customizable trace/session naming, improving trace management and clarity in logs. Updated relevant classes and methods to utilize the new parameter.

Merge branch 'better-root-span-management' of github.com:AgentOps-AI/…

89767bb

…agentops into better-root-span-management

dot-agi requested a review from Copilot May 27, 2025 22:04

Copilot AI reviewed May 27, 2025

View reviewed changes

agentops/client/client.py Show resolved Hide resolved

Remove unused span variables in entity decorator function to streamli…

4fe79d9

…ne code and improve clarity.

Dwij1704 requested a review from dot-agi May 27, 2025 22:11

dot-agi approved these changes May 27, 2025

View reviewed changes

Dwij1704 merged commit 5b419d3 into main May 27, 2025
9 of 10 checks passed

Dwij1704 deleted the better-root-span-management branch May 27, 2025 22:15

dot-agi mentioned this pull request May 27, 2025

Rename root span #1000

Closed

Dwij1704 mentioned this pull request May 31, 2025

[Feature]: Ability to rename root span names #988

Closed

3 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Better root span management #999

Better root span management #999

Uh oh!

Dwij1704 commented May 22, 2025

Uh oh!

bboynton97 commented May 23, 2025

Uh oh!

Uh oh!

Uh oh!

bboynton97 left a comment

Uh oh!

codecov bot commented May 27, 2025 •

edited

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

dot-agi left a comment

Uh oh!

Uh oh!

Dwij1704 commented May 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

Better root span management #999

Better root span management #999

Uh oh!

Conversation

Dwij1704 commented May 22, 2025

📥 Pull Request

Uh oh!

bboynton97 commented May 23, 2025

Uh oh!

Uh oh!

Uh oh!

bboynton97 left a comment

Choose a reason for hiding this comment

Uh oh!

codecov bot commented May 27, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull Request Overview

Reviewed Changes

Uh oh!

Uh oh!

dot-agi left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Dwij1704 commented May 27, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

4 participants

codecov bot commented May 27, 2025 •

edited

Loading