Skip to content

Spark History Server support for Azure Data Lake Storage Gen2 (ADLS) #464

@siegfriedweber

Description

@siegfriedweber

Spark application logs should be storable in Azure Data Lake Storage Gen2 (ADLS).

The hadoop-azure module must be added to the Spark image and the spec.logFileDirectory structure could be extended with adls. Currently only s3 is supported. Alternatively, a custom log directory should be specifiable.

The SAS token should be read from a Secret.

Documentation

https://docs.stackable.tech/home/nightly/spark-k8s/usage-guide/history-server#_custom_log_directory

Release Notes

New / extended platform features

Other product features

  • Apache Spark: A custom log directory can be specified for the event logs, allowing to choose another location than an S3 bucket, e.g. HDFS or ABFS.

Metadata

Metadata

Projects

Status

Done

Status

Done

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions