Retrieval Span Support #2924

JWinermaSplunk · 2025-10-16T17:14:38Z

Changes

Add retrieval span support to db spans

Important

Pull requests acceptance are subject to the triage process as described in Issue and PR Triage Management.
PRs that do not follow the guidance above, may be automatically rejected and closed.

Merge requirement checklist

CONTRIBUTING.md guidelines followed.
Change log entry added, according to the guidelines in When to add a changelog entry.
- If your PR does not need a change log, start the PR title with [chore]
Links to the prototypes or existing instrumentations (when adding or changing conventions)

lmolkova

Retrieval is in many cases based on the database retrieval. e.g. postgreds or MongoDB instrumentation has no knowledge that it's used in the context of GenAI application.

So retrieval in a general case is just a DB call with semantics described in https://github.com/open-telemetry/semantic-conventions/blob/main/docs/database/README.md

There is a question of whether search engines are databases or should be covered by a separate or additional set of the conventions - #1869 - this is where OpenAI retrieval API should probably belong.

lmolkova · 2025-10-28T00:13:45Z

Adding more context from GenAI SIG call:

langchain, llamaindex, haystack offer retriever abstraction.
the question is: should corresponding implementations be instrumented and if so, which conventions they should follow

My take:

Reading through lanchain, llamaindex, haystack docs, retriever is in most cases is an thin layer on top of a database or a search client which may be instrumented using database conventions and/or hypothetical search conventions.

Retrievers could be more complicated and combine multiple source of data or perform additional logic, in these cases, the spans they emit might be significantly different than underlying DB calls and then multiple layers may be instrumented at the same time. In the case of thin layer abstraction / wrapper, having two spans does not improve observability, but increases costs and noise.

When it comes to abstractions, there is a classic problem of instrumentation layers (ORM vs database spans, lanchain LLM vs underlying model-client spans, etc): both layers could be instrumented and there are pros and cons (DB layer has more low-level info, framework layer represents caller perspective better). The duplication is a common problem. Solutions may include:

users decide to enable/disable corresponding instrumentation (by installing instr library or by enabling/disabling retrival spans on lanchain / llamaindex / etc instrumentation)
suppressing internal DB spans in scope of higher level retrieval span (similar to http suppression or more broadly Instrumentation layers and suppressing duplicates oteps#172)

I think path forward for this PR:

separate retrieval from GenAI domain. Most of the attributes defined here can be generic db or search attributes. Please follow the discussion in Should search engines follow database semantic conventions? #1869
lanchain / llamaindex / etc instrumentation would cover retrieval following that db or search conventions and would provide an option to disable their retrieval instrumentation (e.g. when user prefers to use underlying existing DB client instrumentation)

lmolkova · 2025-10-28T16:56:55Z

Also related #1231

JWinermaSplunk · 2025-10-29T22:21:27Z

Hi @lmolkova,

Here are a few examples from trace loop and our instrumentation of proposed retrieval spans, also updated in the google doc linked to the issue
https://docs.google.com/document/d/1DVE3Ht686nuxww-Z1JNyy2VInn9LeCJQYw8NffbjGtg/edit?tab=t.0.

I believe, we also discussed that we are fine moving retrieval spans from the genai to db spaces, but would the optionality to have or enable/disable genai attributes be possible as well, similar to enabling/disabling retrievals as a whole? So that retrievals could parent a db + embedding operation.

github-actions · 2025-11-14T03:34:05Z

This PR was marked stale due to lack of activity. It will be closed in 7 days.

github-actions · 2025-11-21T19:07:42Z

This PR contains changes to area(s) that do not have an active SIG/project and will be auto-closed:

database

Such changes may be rejected or put on hold until a new SIG/project is established.

Please refer to the Semantic Convention Areas
document to see the current active SIGs and also to learn how to kick start a new one.

# Conflicts: # docs/registry/attributes/gen-ai.md

JWinermaSplunk requested review from a team as code owners October 16, 2025 17:14

github-project-automation bot added this to Semantic Conventions Triage Oct 16, 2025

github-project-automation bot moved this to Untriaged in Semantic Conventions Triage Oct 16, 2025

github-actions bot added the area:gen-ai label Oct 16, 2025

lmolkova requested changes Oct 19, 2025

View reviewed changes

github-project-automation bot moved this from Untriaged to Blocked in Semantic Conventions Triage Oct 19, 2025

jsuereth moved this from Blocked to Awaiting codeowners approval in Semantic Conventions Triage Oct 27, 2025

github-actions bot added the Stale label Nov 14, 2025

breedx-splk removed the Stale label Nov 19, 2025

JWinermaSplunk requested a review from a team as a code owner November 21, 2025 19:07

github-actions bot added the triage:rejected:declined label Nov 21, 2025

github-actions bot closed this Nov 21, 2025

lmolkova added triage:accepted:ready and removed triage:rejected:declined labels Nov 21, 2025

lmolkova reopened this Nov 21, 2025

lmolkova mentioned this pull request Nov 21, 2025

[chore] Rename */database to */db to match area names #3108

Merged

JWinermaSplunk added 2 commits November 21, 2025 12:00

initial commit

912d590

# Conflicts: # docs/registry/attributes/gen-ai.md

updates for db retrieval span

64ff6d9

JWinermaSplunk force-pushed the retrieval-span-support branch from 080e240 to 64ff6d9 Compare November 21, 2025 20:02

update change log for db

bf22f30

github-actions bot added the area:db label Nov 21, 2025

updates

3b69cd5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Retrieval Span Support #2924

Retrieval Span Support #2924

Uh oh!

JWinermaSplunk commented Oct 16, 2025 •

edited

Loading

Uh oh!

lmolkova left a comment

Uh oh!

lmolkova commented Oct 28, 2025

Uh oh!

lmolkova commented Oct 28, 2025

Uh oh!

JWinermaSplunk commented Oct 29, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Retrieval Span Support #2924

Are you sure you want to change the base?

Retrieval Span Support #2924

Uh oh!

Conversation

JWinermaSplunk commented Oct 16, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Changes

Merge requirement checklist

Uh oh!

lmolkova left a comment

Choose a reason for hiding this comment

Uh oh!

lmolkova commented Oct 28, 2025

Uh oh!

lmolkova commented Oct 28, 2025

Uh oh!

JWinermaSplunk commented Oct 29, 2025

Uh oh!

github-actions bot commented Nov 14, 2025

Uh oh!

github-actions bot commented Nov 21, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

JWinermaSplunk commented Oct 16, 2025 •

edited

Loading