Skip to content
This repository was archived by the owner on Sep 2, 2025. It is now read-only.

Commit 65fbdb8

Browse files
Draft
1 parent 5240c5f commit 65fbdb8

File tree

1 file changed

+26
-15
lines changed

1 file changed

+26
-15
lines changed

gdi/opentelemetry/components/apache-spark-receiver.rst

Lines changed: 26 additions & 15 deletions
Original file line numberDiff line numberDiff line change
@@ -11,12 +11,16 @@ The Apache Spark receiver metrics for an Apache Spark cluster through the Apache
1111

1212
.. note:: Out-of-the-box dashboards and navigators aren't supported for the Apache Web Server receiver yet, but are planned for a future release.
1313

14+
The receiver uses the following endpoints: ``/metrics/json``, ``/api/v1/applications/[app-id]/stages``, ``/api/v1/applications/[app-id]/executors``, and ``/api/v1/applications/[app-id]/jobs endpoints``.
15+
16+
Purpose
17+
The purpose of this component is to allow monitoring of Apache Spark clusters and the applications running on them through the collection of performance metrics like memory utilization, CPU utilization, shuffle operations, garbage collection time, I/O operations, and more.
18+
1419
Prerequisites
1520
======================
1621

17-
This receiver supports Apache Web Server version 2.4 or higher.
22+
This receiver supports Apache Spark versions 3.3.2 or higher.
1823

19-
In order to receive server statistics, you must configure the server's ``httpd.conf`` file to enable status support. Learn more at Apache's official documentation :new-page:`Module mod_status <https://httpd.apache.org/docs/2.4/mod/mod_status.html>`.
2024

2125
Get started
2226
======================
@@ -35,14 +39,17 @@ Follow these steps to configure and activate the component:
3539
Sample configuration
3640
--------------------------------
3741

38-
To activate the Apache Web Server receiver, add ``apache`` to the ``receivers`` section of your configuration file:
42+
To activate the Apache Spark receiver, add ``apachespark`` to the ``receivers`` section of your configuration file:
3943

4044
.. code-block:: yaml
4145
4246
receivers:
43-
apache:
44-
endpoint: "http://localhost:8080/server-status?auto"
45-
collection_interval: 10s
47+
apachespark:
48+
collection_interval: 60s
49+
endpoint: http://localhost:4040
50+
application_names:
51+
- PythonStatusAPIDemo
52+
- PythonLR
4653
4754
To complete the configuration, include the receiver in the ``metrics`` pipeline of the ``service`` section of your configuration file:
4855

@@ -51,29 +58,33 @@ To complete the configuration, include the receiver in the ``metrics`` pipeline
5158
service:
5259
pipelines:
5360
metrics:
54-
receivers: [apache]
61+
receivers: [apachespark]
5562
5663
Configuration options
5764
-----------------------
5865

59-
The following settings are required:
66+
The following settings are optional:
6067

61-
* ``endpoint``. ``"http://localhost:8080/server-status?auto"`` by default. The URL of the httpd status endpoint.
68+
* ``collection_interval``. ``60s`` by default. Sets the interval this receiver collects metrics on.
69+
70+
* This value must be a string readable by Golang's ``time.ParseDuration``. Learn more at Go's official documentation :new-page:`ParseDuration function <https://pkg.go.dev/time#ParseDuration>`.
71+
72+
* Valid time units are ``ns``, ``us`` (or ``µs``), ``ms``, ``s``, ``m``, ``h``.
6273

63-
The following settings are optional:
74+
* .. include:: /_includes/gdi/collector-settings-initialdelay.rst
6475

65-
.. include:: /_includes/gdi/collector-settings-collectioninterval.rst
76+
* ``endpoint``. ``http://localhost:4040`` by default. Apache Spark endpoint to connect to in the form of ``[http][://]{host}[:{port}]``.
6677

67-
.. include:: /_includes/gdi/collector-settings-initialdelay.rst
78+
* ``application_names``. An array of Spark application names for which metrics are collected from. If no application names are specified, metrics are collected for all Spark applications running on the cluster at the specified endpoint.
6879

6980
Settings
7081
======================
7182

72-
The following table shows the configuration options for the Apache Web Server receiver:
83+
The following table shows the configuration options for the Apache Spark receiver:
7384

7485
.. raw:: html
7586

76-
<div class="metrics-standard" category="included" url="https://raw.githubusercontent.com/splunk/collector-config-tools/main/cfg-metadata/receiver/apache.yaml"></div>
87+
<div class="metrics-standard" category="included" url="https://raw.githubusercontent.com/splunk/collector-config-tools/main/cfg-metadata/receiver/apachespark.yaml"></div>
7788

7889
Metrics
7990
======================
@@ -84,7 +95,7 @@ The following metrics, resource attributes, and attributes are available.
8495

8596
.. raw:: html
8697

87-
<div class="metrics-component" category="included" url="https://raw.githubusercontent.com/splunk/collector-config-tools/main/metric-metadata/apachereceiver.yaml"></div>
98+
<div class="metrics-component" category="included" url="https://raw.githubusercontent.com/splunk/collector-config-tools/main/metric-metadata/apachesparkreceiver.yaml"></div>
8899

89100
.. include:: /_includes/activate-deactivate-native-metrics.rst
90101

0 commit comments

Comments
 (0)