Skip to content

Conversation

@dciprian-petrisor
Copy link

@dciprian-petrisor dciprian-petrisor requested a review from a team as a code owner November 28, 2024 18:04
@welcome
Copy link

welcome bot commented Nov 28, 2024

Thanks for opening your first pull request! If you haven't yet signed our Contributor License Agreement (CLA), then please do so that we can accept your contribution. A link should appear shortly in this PR if you have not already signed one.

@linux-foundation-easycla
Copy link

linux-foundation-easycla bot commented Nov 28, 2024

CLA Signed

The committers listed above are authorized under a signed CLA.

  • ✅ login: dciprian-petrisor / name: Petrisor Ciprian Daniel (24e30e1)

@dciprian-petrisor dciprian-petrisor force-pushed the fix-symfony-auto-instrumentation-boundaries branch from e78c015 to 19f787e Compare November 28, 2024 18:15
?\Throwable $exception
): void {
$scope = Context::storage()->scope();
if (null === $scope || null === $exception) {
Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

So if an exception occurs, this post callback will run, otherwise the terminate one will? If so, we should document this.

Copy link
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

That is correct.
I'm assuming this should be documented in the README.md, right?

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

The examples on this page seem to suggest that the handle and terminate can both run.

So it looks like you'd expect 2 spans to be created and then with these changes only close one of them? I'm not so sure that behaviour is correct.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

There appears to be a parameter to configure whether handle should throw here.

So when $catch = false the handle post-hook might have an exception and terminate will not have been reached yet. I think this would be fine as it handles the recording of the exception and detaches the scope.

However, if $catch = false and there is no exception, then terminate() would be called after handle and the terminate post-hook could try to close a scope which does not belong to it.

Copy link
Author

@dciprian-petrisor dciprian-petrisor Nov 29, 2024

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@ChrisLightfootWild

  1. If handle throws an exception which is never caught, then terminate will never be reached either, according to the HttpKernelRunner implementation.
    In this situation, it is correct to close the scope on the post hook of handle.

  2. If handle does not throw an exception (or has $catch = true), then terminate would run afterwards, closing the scope.

The only problem I see is there is not a 100% guarantee that terminate will always be executed after handle.

For example, something else might go wrong in HttpKernelRunner after handle was called, e.g. fastcgi_finish_request()/litespeed_finish_request() might crash, flush() might crash, etc, leaving the scope still open.

Unsure how much of a real issue this is, in most cases.

Normally this would be solved by hooking into HttpKernelRunner::run with both pre and post hooks, as it covers both handle and terminate, however $request and $response are not available as parameters there, so I am at a loss right now.

The main issue is that the parent span is lost in between handle and terminate, due to handle detaching it, thus producing two different traces for the same request:

  • one containing everything from the start of the HTTP request up to the end of handle
  • one containing just the dispatch of TerminateEvent done by $this->kernel->terminate afterwards

the second should be part of the first, as it is still logically part of the same HTTP request lifecycle.

We need either a compromise or another solution entirely.

Copy link
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Thanks for that @dciprian-petrisor - I had another look with less bleary eyes and I finally caught up to you guys 💪

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I am having the same problem for Drupal, but opted to do an auto root span for now, which obviously duplicates the logic to do that.

That way I have a duplicated span for the Kernel::handle() + Kernel::terminate(), but that does not matter as it's still within one root trace.

The Auto Root Span uses the SDK ShutdownHandler::register so that might be also an option here.

The other thing is:

You could hook into KernelRunner::run(), but not open the scope here, but instead close it just during the post() [if ::run() was called in pre].

While that is more logic to handle, because handle() needs to check that ::run was actually called, so that it can close it, cross hook frontiers are a good way to handle this.

::run pre -> $runCalled = 1
::handle pre -> Open root scope like normal
::handle post -> if ($runCalled) {} -> Do not bother to close scope, else fallback to current logic
::terminate pre -> Open span like normal
::terminate post -> Close span like normal
::run post -> Close scope

In general I personally will still call the Auto Root Scope hook as early as possible (e.g. DrupalKernel::__construct in my case), because else there is always an ::IO extension call, which escapes.

Until that's converted to SPI obviously there are still race conditions.

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

This is my implementation:

<?php

namespace OpenTelemetry\Contrib\Instrumentation\Drupal;

use Drupal\Core\DrupalKernel;
use OpenTelemetry\SDK\Trace\AutoRootSpan;
use function OpenTelemetry\Instrumentation\hook;

class DrupalAutoRootSpan {

  public static function register(): void {

    hook(
      DrupalKernel::class,
      '__construct',
      static::registerAutoRootSpan(),
      NULL,
    );
  }

  /**
   * @param \OpenTelemetry\API\Instrumentation\CachedInstrumentation $instrumentation
   *
   * @return \Closure
   */
  public static function registerAutoRootSpan(): \Closure {
    return static function (DrupalKernel $kernel, array $params, string $class, string $function, ?string $filename, ?int $lineno) {
      $request = AutoRootSpan::createRequest();

      if ($request) {
        AutoRootSpan::create($request);
        AutoRootSpan::registerShutdownHandler();
      }
    };
  }

}

It works really well.

@codecov
Copy link

codecov bot commented Dec 3, 2024

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 82.81%. Comparing base (3d6f839) to head (24e30e1).

Additional details and impacted files

Impacted file tree graph

@@             Coverage Diff              @@
##               main     #317      +/-   ##
============================================
- Coverage     82.85%   82.81%   -0.04%     
+ Complexity     2353     2309      -44     
============================================
  Files           164      159       -5     
  Lines          9563     9256     -307     
============================================
- Hits           7923     7665     -258     
+ Misses         1640     1591      -49     
Flag Coverage Δ
Aws 93.41% <ø> (ø)
Context/Swoole 0.00% <ø> (ø)
Exporter/Instana 49.42% <ø> (ø)
Instrumentation/AwsSdk 81.13% <ø> (ø)
Instrumentation/CakePHP 20.40% <ø> (ø)
Instrumentation/CodeIgniter 78.99% <ø> (ø)
Instrumentation/Curl 87.66% <ø> (ø)
Instrumentation/Doctrine 92.92% <ø> (ø)
Instrumentation/ExtAmqp 88.48% <ø> (ø)
Instrumentation/ExtRdKafka 86.11% <ø> (ø)
Instrumentation/Guzzle 75.58% <ø> (ø)
Instrumentation/HttpAsyncClient 78.04% <ø> (ø)
Instrumentation/IO 0.00% <ø> (ø)
Instrumentation/Laravel 71.66% <ø> (ø)
Instrumentation/MongoDB 74.28% <ø> (ø)
Instrumentation/MySqli 95.81% <ø> (ø)
Instrumentation/OpenAIPHP 87.21% <ø> (ø)
Instrumentation/PDO 94.21% <ø> (ø)
Instrumentation/PostgreSql 93.54% <ø> (ø)
Instrumentation/Psr14 76.47% <ø> (ø)
Instrumentation/Psr15 89.15% <ø> (ø)
Instrumentation/Psr16 97.50% <ø> (ø)
Instrumentation/Psr18 77.46% <ø> (ø)
Instrumentation/Psr3 67.01% <ø> (ø)
Instrumentation/Psr6 97.61% <ø> (ø)
Instrumentation/ReactPHP 99.45% <ø> (ø)
Instrumentation/Session 94.52% <ø> (ø)
Instrumentation/Slim 84.28% <ø> (ø)
Instrumentation/Symfony ?
Instrumentation/Yii 83.05% <ø> (ø)
Logs/Monolog 100.00% <ø> (ø)
Propagation/CloudTrace 89.77% <ø> (ø)
Propagation/Instana 98.11% <ø> (ø)
Propagation/ServerTiming 94.73% <ø> (ø)
Propagation/TraceResponse 94.73% <ø> (ø)
ResourceDetectors/Azure 91.66% <ø> (ø)
ResourceDetectors/Container 93.02% <ø> (ø)
ResourceDetectors/DigitalOcean 100.00% <ø> (ø)
Sampler/RuleBased 33.51% <ø> (ø)
Sampler/Xray 78.23% <ø> (ø)
Shims/OpenTracing 92.45% <ø> (ø)
Symfony 87.81% <ø> (ø)
Utils/Test 87.53% <ø> (ø)

Flags with carried forward coverage won't be shown. Click here to find out more.
see 5 files with indirect coverage changes


Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update 3d6f839...24e30e1. Read the comment docs.

🚀 New features to boost your workflow:
  • ❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

@stale
Copy link

stale bot commented Apr 26, 2025

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale label Apr 26, 2025
@jamespavett
Copy link

Just running into the exact same issue in a project I am working on. Is this on the horizon to be merged any time soon?

@stale stale bot removed the stale label Jun 9, 2025
@brettmc
Copy link
Contributor

brettmc commented Jun 11, 2025

Just running into the exact same issue in a project I am working on. Is this on the horizon to be merged any time soon?

No. I'm not sure of the current state. It at least needs to be rebased given its age, and there's some discussion that didn't conclude (and I don't know enough Symfony to make a call myself)
@dciprian-petrisor ?

@dciprian-petrisor
Copy link
Author

Just running into the exact same issue in a project I am working on. Is this on the horizon to be merged any time soon?

No. I'm not sure of the current state. It at least needs to be rebased given its age, and there's some discussion that didn't conclude (and I don't know enough Symfony to make a call myself)
@dciprian-petrisor ?

Hi, I'll look into this over the weekend and if you all agree we could merge it.

@dciprian-petrisor dciprian-petrisor force-pushed the fix-symfony-auto-instrumentation-boundaries branch from 19f787e to 8472a06 Compare June 16, 2025 16:00
@dciprian-petrisor
Copy link
Author

I've rebased this PR.

After a brief look with fresh eyes, given it's been some time since this PR had activity, here are my conclusions:

  1. The implementation in this PR covers the 'normal' execution of any Symfony HTTP Request, minus the situation where any of the following calls in HttpKernelRunner would fail miserably:
fastcgi_finish_request();

or

litespeed_finish_request();

or

Response::closeOutputBuffers(0, true);
flush();

All three of these failing indicate a severe problem and I have personally never encountered failures of them.
These calls failing would indicate issues with FASTCGI/PHP internals or the SAPI.

  1. In any other situation, one of the following two outcomes can happen:
  • the post hook of handle runs whenever an exception occurs/is thrown while processing the Symfony request.
    otherwise, a Response object is returned from handle.
  • the terminate call is always made when handle does not throw.

Both of these will finalize the trace.

In summary, I believe this implementation is good as-is.
Regarding the documentation, if you can give me a few hints as to where to document it, I would gladly do it.

@stale
Copy link

stale bot commented Jul 19, 2025

This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your contributions.

@stale stale bot added the stale label Jul 19, 2025
@jamespavett
Copy link

In there any chance of getting some progress on this? Realise @dciprian-petrisor rebased this a couple of months back.

This issue currently causes quite a bit of pain when using the Symfony autoinstrumentation libaries.

@brettmc
Copy link
Contributor

brettmc commented Sep 29, 2025

Very sorry, I didn't see the update. @dciprian-petrisor are you able to (again) resolve conflicts?

@stale stale bot removed the stale label Sep 29, 2025
@dciprian-petrisor dciprian-petrisor force-pushed the fix-symfony-auto-instrumentation-boundaries branch from 8472a06 to 24e30e1 Compare September 29, 2025 16:34
@dciprian-petrisor
Copy link
Author

@brettmc just did

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

[opentelemetry-php-contrib] Incorrect trace boundaries for Symfony HttpKernel auto instrumentation

5 participants