You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
{{ message }}
This repository was archived by the owner on Sep 2, 2025. It is now read-only.
Copy file name to clipboardExpand all lines: gdi/opentelemetry/components/apache-spark-receiver.rst
+26-15Lines changed: 26 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,12 +11,16 @@ The Apache Spark receiver metrics for an Apache Spark cluster through the Apache
11
11
12
12
.. note:: Out-of-the-box dashboards and navigators aren't supported for the Apache Web Server receiver yet, but are planned for a future release.
13
13
14
+
The receiver uses the following endpoints: ``/metrics/json``, ``/api/v1/applications/[app-id]/stages``, ``/api/v1/applications/[app-id]/executors``, and ``/api/v1/applications/[app-id]/jobs endpoints``.
15
+
16
+
Purpose
17
+
The purpose of this component is to allow monitoring of Apache Spark clusters and the applications running on them through the collection of performance metrics like memory utilization, CPU utilization, shuffle operations, garbage collection time, I/O operations, and more.
18
+
14
19
Prerequisites
15
20
======================
16
21
17
-
This receiver supports Apache Web Server version 2.4 or higher.
22
+
This receiver supports Apache Spark versions 3.3.2 or higher.
18
23
19
-
In order to receive server statistics, you must configure the server's ``httpd.conf`` file to enable status support. Learn more at Apache's official documentation :new-page:`Module mod_status <https://httpd.apache.org/docs/2.4/mod/mod_status.html>`.
20
24
21
25
Get started
22
26
======================
@@ -35,14 +39,17 @@ Follow these steps to configure and activate the component:
35
39
Sample configuration
36
40
--------------------------------
37
41
38
-
To activate the Apache Web Server receiver, add ``apache`` to the ``receivers`` section of your configuration file:
42
+
To activate the Apache Spark receiver, add ``apachespark`` to the ``receivers`` section of your configuration file:
To complete the configuration, include the receiver in the ``metrics`` pipeline of the ``service`` section of your configuration file:
48
55
@@ -51,29 +58,33 @@ To complete the configuration, include the receiver in the ``metrics`` pipeline
51
58
service:
52
59
pipelines:
53
60
metrics:
54
-
receivers: [apache]
61
+
receivers: [apachespark]
55
62
56
63
Configuration options
57
64
-----------------------
58
65
59
-
The following settings are required:
66
+
The following settings are optional:
60
67
61
-
* ``endpoint``. ``"http://localhost:8080/server-status?auto"`` by default. The URL of the httpd status endpoint.
68
+
* ``collection_interval``. ``60s`` by default. Sets the interval this receiver collects metrics on.
69
+
70
+
* This value must be a string readable by Golang's ``time.ParseDuration``. Learn more at Go's official documentation :new-page:`ParseDuration function <https://pkg.go.dev/time#ParseDuration>`.
71
+
72
+
* Valid time units are ``ns``, ``us`` (or ``µs``), ``ms``, ``s``, ``m``, ``h``.
* ``application_names``. An array of Spark application names for which metrics are collected from. If no application names are specified, metrics are collected for all Spark applications running on the cluster at the specified endpoint.
68
79
69
80
Settings
70
81
======================
71
82
72
-
The following table shows the configuration options for the Apache Web Server receiver:
83
+
The following table shows the configuration options for the Apache Spark receiver:
0 commit comments