You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: troubleshoot/ingest/opentelemetry/edot-collector/trace-export-errors.md
+14-10Lines changed: 14 additions & 10 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -28,16 +28,17 @@ These errors indicate the Collector is overwhelmed and unable to export data fas
28
28
This issue typically occurs when the `sending_queue` configuration or the Elasticsearch cluster scaling is misaligned with the incoming telemetry volume.
29
29
30
30
:::{important}
31
-
The sending queue is disabled by default in versions earlier than **v0.138.0** and enabled by default from **v0.138.0** onward. If you're using an earlier version, verify that `enabled: true` is explicitly set — otherwise any queue configuration will be ignored.
31
+
{applies_to}`stack: ga 9.0, deprecated 9.3`
32
+
The sending queue is turned off by default. Verify that `enabled: true` is explicitly set — otherwise any queue configuration will be ignored.
32
33
:::
33
34
34
35
Common contributing factors include:
35
36
36
37
* Underscaled Elasticsearch cluster is the most frequent cause of persistent export failures. If Elasticsearch cannot index data fast enough, the Collector’s queue fills up.
37
-
*`sending_queue.block_on_overflow` is disabled in **pre-v0.138.0** releases (defaults to `false`), which can lead to silent data drops. Starting from **v0.138.0**, the Elasticsearch exporter enables this setting by default.
38
-
*`num_consumers` is too low to keep up with the incoming data volume.
39
-
*The queue size (`queue_size`) is too small for the traffic load.
40
-
*Export batching is disabled, increasing processing overhead.
38
+
*{applies_to}`stack: ga 9.0, deprecated 9.3``sending_queue.block_on_overflow` is turned off (defaults to `false`), which can lead to data drops.
39
+
*Sending queue is enabled but `num_consumers` is too low to keep up with the incoming data volume.
40
+
*Sending queue size (`queue_size`) is too small for the traffic load.
41
+
*Both internal and sending queue batching are disabled, increasing processing overhead.
41
42
* EDOT Collector resources (CPU, memory) are insufficient for the traffic volume.
42
43
43
44
:::{note}
@@ -46,11 +47,12 @@ Increasing the `timeout` value (for example from 30s to 90s) doesn't help if the
46
47
47
48
## Resolution
48
49
49
-
The resolution approach depends on which EDOT Collector version you're using.
50
+
The resolution approach depends on your {{stack}} version and Collector configuration.
50
51
51
-
### For EDOT Collector versions earlier than v0.138.0
52
+
### When the sending queue is not enabled by default
53
+
{applies_to}`stack: ga 9.0, deprecated 9.3`
52
54
53
-
Enable the sending queue and block on overflow to prevent silent data drops:
55
+
Enable the sending queue and block on overflow to prevent data drops:
54
56
55
57
```yaml
56
58
sending_queue:
@@ -60,7 +62,8 @@ sending_queue:
60
62
block_on_overflow: true
61
63
```
62
64
63
-
### For EDOT Collector v0.138.0 and later
65
+
### When the sending queue is enabled by default
66
+
{applies_to}`stack: ga 9.3`
64
67
65
68
The Elasticsearch exporter provides default `sending_queue` parameters (including `block_on_overflow: true`) but these can and often should be tuned for specific workloads.
66
69
@@ -97,7 +100,8 @@ Address indexing delays, rejected bulk requests, or shard imbalances that limit
97
100
:::::
98
101
99
102
:::{tip}
100
-
For **v0.138.0+**, focus tuning efforts on Elasticsearch performance, Collector resource allocation, and queue sizing informed by the internal telemetry metrics above.
103
+
{applies_to}`stack: ga 9.3`
104
+
Focus tuning efforts on {{es}} performance, Collector resource allocation, and queue sizing informed by the internal telemetry metrics above.
0 commit comments