Intermittent timeouts during langfuse.api.trace.get requests #11801

atul-alf · 2026-01-30T00:04:10Z

atul-alf
Jan 30, 2026

Describe your question

We are currently implementing a logic to calculate the total execution cost of our agents, including all associated sub-tasks. Our current approach involves iterating through sub-tasks and utilizing the langfuse.api.trace.get method to retrieve and aggregate total_costs.

We have observed intermittent timeouts during these API requests, which are affecting the reliability of our cost computations.

Could you please provide recommendations on how to stabilize these calls? We would appreciate any guidance on retry strategies or alternative methods for efficient data extraction.

Langfuse Cloud or Self-Hosted?

Langfuse Cloud

If Self-Hosted

v3.150.0

If Langfuse Cloud

v3.150.0

SDK and integration versions

langfuse 3.12.1

Pre-Submission Checklist

I have checked for existing issues/discussions and consulted Langfuse AI.

2026-01-30T00:04:40Z

inkeep-ai-assistant[bot]
bot Jan 30, 2026

Intermittent timeouts during langfuse.api.trace.get requests in self-hosted Langfuse v3 can stem from several infrastructure bottlenecks. Based on the available sources, here are key recommendations to stabilize your API calls:

Infrastructure Optimization

Backend Resource Scaling: The minimum spec of 2 vCPU/4 GiB for web containers is often insufficient under load(1). Scale up your web container resources and monitor performance during peak API request periods(1).

S3/Blob Storage Performance: If S3 uploads are slow or throttled, consider increasing LANGFUSE_S3_CONCURRENT_WRITES above the default 50(1). Enable S3 SlowDown error detection with LANGFUSE_S3_RATE_ERROR_SLOWDOWN_ENABLED=true(1).

Redis and ClickHouse: High Redis CPU or ClickHouse connection limits can block API requests(1). Use at least 4 CPUs for Redis and enable cluster mode if needed(1). For ClickHouse, increase resources or reduce batch size/intervals if you experience performance issues under load(1).

Load Balancer Configuration: Ensure your keep-alive timeout is at least 5 seconds higher than the load balancer idle timeout(1). This prevents premature connection closures that can cause timeouts.

API Usage Best Practices

Retry Strategy: The Python SDK provides strongly-typed access to the public REST API via the api property on the Langfuse client instance(2). Implement exponential backoff retry logic in your application code when calling langfuse.api.trace.get().

Batch Operations: Instead of making individual trace.get calls for each sub-task, consider using the Observations API for more efficient data retrieval(2). The v2 API offers high-performance data retrieval with cursor-based pagination and selective field retrieval(2).

Alternative Approach - Metrics API: For aggregating cost data across multiple traces, the Metrics API may be more efficient than individual trace lookups(2). It provides aggregated analytics and metrics with customizable dimensions, metrics, filters, and time granularity(2).

Debugging Steps

Enable debug logging with LANGFUSE_LOG_LEVEL=debug to get detailed information about API performance(1). Monitor your web and worker container logs for warnings about trace processing, database connectivity, or S3 upload failures(3).

Check the health of your Langfuse instance using the built-in endpoints(3):

/api/public/health
/api/public/ready

If you continue experiencing timeouts, review your ClickHouse and Redis connectivity from the web container, as backend processing bottlenecks often involve these components(1)(3).

📚 Sources:

Have another question? Just tag @inkeep.

0 replies

jannikmaierhoefer · 2026-01-30T10:34:15Z

jannikmaierhoefer
Jan 30, 2026
Maintainer

Hi @atul-alf, thanks. I would recommend testing our new observations endpoint (v2) for this use case as we improved performance an reliability here: https://langfuse.com/docs/api-and-data-platform/features/observations-api

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Langfuse

Intermittent timeouts during langfuse.api.trace.get requests #11801

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{editor}}'s edit

{{editor}}'s edit

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Langfuse

Intermittent timeouts during langfuse.api.trace.get requests #11801

Uh oh!

Uh oh!

atul-alf Jan 30, 2026

Describe your question

Langfuse Cloud or Self-Hosted?

If Self-Hosted

If Langfuse Cloud

SDK and integration versions

Pre-Submission Checklist

Replies: 2 comments

Uh oh!

inkeep-ai-assistant[bot] bot Jan 30, 2026

Infrastructure Optimization

API Usage Best Practices

Debugging Steps

Uh oh!

jannikmaierhoefer Jan 30, 2026 Maintainer

atul-alf
Jan 30, 2026

inkeep-ai-assistant[bot]
bot Jan 30, 2026

jannikmaierhoefer
Jan 30, 2026
Maintainer