You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 2, 2025. It is now read-only.
Copy file name to clipboardExpand all lines: gdi/get-data-in/connect/aws/aws-prereqs.rst
+2-1Lines changed: 2 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -426,7 +426,8 @@ Read more at the official AWS documentation:
426
426
427
427
* :new-page:`AWS Organization Service Control Policies <https://docs.aws.amazon.com/organizations/latest/userguide/orgs_manage_policies_scps.html>`
428
428
* :new-page:`Permissions boundaries for IAM entities <https://docs.aws.amazon.com/IAM/latest/UserGuide/access_policies_boundaries.html>`
429
-
* :new-page:`Troubleshooting IAM permission access denied or unauthorized errors <https://repost.aws/knowledge-center/troubleshoot-iam-permission-errors>`
429
+
430
+
.. tip:: Search for specific troubleshooting at AWS' knowledge center.
Copy file name to clipboardExpand all lines: gdi/opentelemetry/splunk-collector-troubleshooting.rst
+83-80Lines changed: 83 additions & 80 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -9,14 +9,16 @@ Troubleshoot the Splunk OpenTelemetry Collector
9
9
10
10
See the following issues and workarounds for the Splunk Distribution of the OpenTelemetry Collector.
11
11
12
-
.. note:: See also the :new-page:`OpenTelemetry Project troublehooting docs in GitHub <https://github.com/open-telemetry/opentelemetry-collector/blob/main/docs/troubleshooting.md>`.
12
+
.. note:: See also the :new-page:`OpenTelemetry Project troublehooting docs <https://opentelemetry.io/docs/collector/troubleshooting>`.
13
13
14
-
Collector isn't behaving as expected
15
-
=========================================
14
+
.. caution:: Splunk only provides best-effort support for the upstream OpenTelemetry Collector.
15
+
16
+
The Collector isn't behaving as expected
17
+
=================================================
16
18
17
19
The Collector might experience the issues described in this section.
The process might fail to start in a custom built, Windows-based Docker container, resulting in a "The service process could not connect to the service controller" error message.
45
47
46
48
In this case, the ``NO_WINDOWS_SERVICE=1`` environment variable must be set to force the Splunk Distribution of OpenTelemetry Collector to start as if it were running in an interactive terminal, without attempting to run as a Windows service.
47
49
48
-
Collector is experiencing data issues
50
+
Extract a running configuration
51
+
=========================================
52
+
53
+
Extracting a running configuration saves or stores the contents of a configuration file to logs that you can use to troubleshoot issues. You can extract a running configuration by accessing these ports:
For Linux, the support bundle script captures this information. See :ref:`otel-install-linux` for the installer script. This capability is primarily useful if you are using remote configuration options such as Zookeeper where the startup configuration can change during operation.
59
+
60
+
The Collector is experiencing data issues
49
61
============================================
50
62
51
-
You can monitor internal Collector metrics tracking parameters such as data loss or CPU resources in Splunk Observability Cloud's default dashboards at :guilabel:`Dashboards > OpenTelemetry Collector > OpenTelemetry Collector`. To learn more about these metrics, see :new-page:`Monitoring <https://github.com/open-telemetry/opentelemetry-collector/blob/main/docs/monitoring.md>` in the OpenTelemetry GitHub repo.
63
+
You can monitor internal Collector metrics tracking parameters such as data loss or CPU resources in Splunk Observability Cloud's default dashboards at :guilabel:`Dashboards > OpenTelemetry Collector > OpenTelemetry Collector`.
52
64
53
-
The Collector might experience the issues described in this section.
65
+
To learn more see:
66
+
67
+
* :ref:`metrics-internal-collector`
68
+
* :new-page:`Internal telemetry <https://opentelemetry.io/docs/collector/internal-telemetry>` in the OpenTelemetry project documentation
54
69
55
-
Collector is dropping data
70
+
The Collector is dropping data
56
71
--------------------------------
57
72
58
73
Data might drop for a variety of reasons, but most commonly for the following reasons:
59
74
60
-
* The collector is improperly sized, resulting in the Splunk Distribution of OpenTelemetry Collector being unable to process and export the data as fast as it is received. See :ref:`otel-sizing` for sizing guidelines.
75
+
* The Collector is improperly sized, resulting in the Splunk Distribution of the OpenTelemetry Collector being unable to process and export the data as fast as it is received. See :ref:`otel-sizing` for sizing guidelines.
61
76
* The exporter destination is unavailable or accepting the data too slowly. To mitigate drops, configure the ``batch`` processor. In addition, you might also need to configure the queued retry options on activated exporters.
62
77
63
-
Collector isn't receiving data
78
+
The Collector isn't receiving data
64
79
-------------------------------------
65
80
66
-
The collector might not receive data for the following reasons:
81
+
The Collector might not receive data for the following reasons:
67
82
68
83
* Network configuration issues
69
84
* Receiver configuration issues
70
85
* The receiver is defined in the receivers section, but not activated in any pipelines
71
86
* The client configuration is incorrect
72
87
73
-
Check the logs and :new-page:`Troubleshooting zPages <https://github.com/open-telemetry/opentelemetry-collector/blob/main/docs/troubleshooting.md#zpages>` in the OpenTelemetry project GitHub repositories for more information. Note that Splunk only provides best-effort support for the upstream OpenTelemetry Collector.
74
-
75
-
Collector can't process data
88
+
The Collector can't process data
76
89
-----------------------------------
77
90
78
-
The collector might not process data for the following reasons:
91
+
The Collector might not process data for the following reasons:
79
92
80
93
* The attributes processors work only for "tags" on spans. The span name is handled by the span processor.
81
94
* Processors for trace data (except tail sampling) only work on individual spans. Make sure your collector is configured properly.
82
95
83
-
Collector can't export data
96
+
The Collector can't export data
84
97
------------------------------------
85
98
86
-
The collector might be unable to export data for the following reasons:
99
+
The Collector might be unable to export data for the following reasons:
87
100
88
101
* Network configuration issues, such as firewall, DNS, or proxy support
89
102
* Incorrect exporter configuration
@@ -92,8 +105,6 @@ The collector might be unable to export data for the following reasons:
92
105
93
106
If you need to use a proxy, see :ref:`configure-proxy-collector`.
94
107
95
-
Check the logs and :new-page:`Troubleshooting zPages <https://github.com/open-telemetry/opentelemetry-collector/blob/main/docs/troubleshooting.md#zpages>` in the OpenTelemetry project GitHub repositories for more information. Note that Splunk only provides best-effort support for the upstream OpenTelemetry Collector.
96
-
97
108
.. _collector-gateway-metrics-issue:
98
109
99
110
Metrics and metadata not available in data forwarding (gateway) mode
@@ -149,15 +160,6 @@ For example:
149
160
value: staging
150
161
key: deployment.environment
151
162
152
-
Extract a running configuration
153
-
=========================================
154
-
Extracting a running configuration saves or stores the contents of a configuration file to logs that you can use to troubleshoot issues. You can extract a running configuration by accessing these ports:
For Linux, the support bundle script captures this information. See :ref:`otel-install-linux` for the installer script. This capability is primarily useful if you are using remote configuration options such as Zookeeper where the startup configuration can change during operation.
160
-
161
163
Check metric data from the command line
162
164
==============================================
163
165
@@ -168,56 +170,37 @@ To check whether host metrics are being collected and processed correctly, you c
168
170
169
171
You can then pipe the output to ``grep`` (Linux) or ``Select-String`` (Windows) to filter the data. For example, ``curl http://localhost:8888/metrics | grep service_instance_id`` retrieves the service instance ID.
170
172
171
-
You're getting a "bind: address already in use" error message
If you see an error message such as "bind: address already in use", another resource is already using the port that the current configuration requires. This resource could be another application, or a tracing tool such as Jaeger or Zipkin. You can modify the configuration to use another port.
175
-
176
-
You can modify any of these endpoints or ports:
177
-
178
-
* Receiver endpoint
179
-
* Extensions endpoint
180
-
* Metrics address (if port 8888)
181
-
182
-
Conflicts with port 8888
183
-
-----------------------------------
184
-
185
-
If you encounter a conflict with port 8888, you will need to change to port 8889, making adjustments in these two areas:
186
-
187
-
1. Add telemetry configuration under the service section:
You can test the Collector to make sure it can receive spans without instrumenting an application. By default, the Collector activates the Zipkin receiver, which is capable of receiving trace data over JSON.
191
180
192
-
service:
193
-
telemetry:
194
-
metrics:
195
-
address: ":8889"
181
+
To test the UI, you can submit a POST request or paste JSON in this directory, as shown in the following example.
196
182
183
+
.. code-block:: bash
197
184
198
-
2. Update the port for ``receivers.prometheus/internal`` from 8888 to 8889:
curl -X POST localhost:9411/api/v2/spans -H'Content-Type: application/json' -d @yelp.json
199
187
200
-
.. code-block:: yaml
188
+
.. note::
201
189
190
+
Update the ``localhost`` field as appropriate to reach the Collector.
202
191
203
-
receivers:
204
-
prometheus/internal:
205
-
config:
206
-
scrape_configs:
207
-
- job_name: 'otel-collector'
208
-
scrape_interval: 10s
209
-
static_configs:
210
-
- targets: ['0.0.0.0:8889']
192
+
No response means the request was sent successfully. You can also pass ``-v`` to the curl command to confirm.
211
193
212
-
If you see this error message on Kubernetes and you're using Helm charts, modify the configuration by updating the chart values for both configuration and exposed ports.
If you see an error message such as "pattern not matched", this message is from Fluentd, and means that the ``<parser>`` was unable to match based on the log message. As a result, the log message is not collected. Check the Fluentd configuration and update as required.
Here are some common issues related to trace collection on the Collector.
225
+
If you see an error message such as "bind: address already in use", another resource is already using the port that the current configuration requires. This resource could be another application, or a tracing tool such as Jaeger or Zipkin. You can modify the configuration to use another port.
You can test the Collector to make sure it can receive spans without instrumenting an application. By default, the Collector activates the Zipkin receiver, which is capable of receiving trace data over JSON.
229
+
* Receiver endpoint
230
+
* Extensions endpoint
231
+
* Metrics address (if port 8888)
248
232
249
-
To test the UI, you can submit a POST request or paste JSON in this directory, as shown in the following example.
233
+
Conflicts with port 8888
234
+
-----------------------------------
250
235
251
-
.. code-block:: bash
236
+
If you encounter a conflict with port 8888, you will need to change to port 8889, making adjustments in these two areas:
curl -X POST localhost:9411/api/v2/spans -H'Content-Type: application/json' -d @yelp.json
238
+
1. Add telemetry configuration under the service section:
255
239
256
-
.. note::
240
+
.. code-block:: yaml
257
241
258
-
Update the ``localhost`` field as appropriate to reach the Collector.
259
242
260
-
No response means the request was sent successfully. You can also pass ``-v`` to the curl command to confirm.
243
+
service:
244
+
telemetry:
245
+
metrics:
246
+
address: ":8889"
247
+
248
+
249
+
2. Update the port for ``receivers.prometheus/internal`` from 8888 to 8889:
250
+
251
+
.. code-block:: yaml
252
+
253
+
254
+
receivers:
255
+
prometheus/internal:
256
+
config:
257
+
scrape_configs:
258
+
- job_name: 'otel-collector'
259
+
scrape_interval: 10s
260
+
static_configs:
261
+
- targets: ['0.0.0.0:8889']
262
+
263
+
If you see this error message on Kubernetes and you're using Helm charts, modify the configuration by updating the chart values for both configuration and exposed ports.
Copy file name to clipboardExpand all lines: gdi/opentelemetry/troubleshoot-logs.rst
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -8,7 +8,7 @@ Troubleshoot Collector logs
8
8
:description: Describes known issues when collecting logs with the Splunk Distribution of OpenTelemetry Collector.
9
9
10
10
11
-
.. note:: To activate the Collector's debug logging, see the :new-page:`OpenTelemetry project documentation in GitHub <https://github.com/open-telemetry/opentelemetry-collector/blob/main/docs/troubleshooting.md#logs>`.
11
+
.. note:: See also the :new-page:`OpenTelemetry Project troublehooting docs <https://opentelemetry.io/docs/collector/troubleshooting>` for more information about debugging.
12
12
13
13
Here are some common issues related to log collection on the Collector.
0 commit comments