Skip to content

Commit 48587c2

Browse files
msmygitsnyk-bot
andauthored
[Snyk] Security upgrade org.apache.spark:spark-sql_2.13 from 3.5.3 to 3.5.4 (#339)
* fix: pom.xml to reduce vulnerabilities The following vulnerabilities are fixed with an upgrade: - https://snyk.io/vuln/SNYK-JAVA-ORGECLIPSEJETTY-8186142 * Bump Spark to 3.5.4 * Upgrade docker cassandra to 5 --------- Co-authored-by: snyk-bot <[email protected]>
1 parent 433df0a commit 48587c2

File tree

5 files changed

+14
-10
lines changed

5 files changed

+14
-10
lines changed

Dockerfile

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,9 +9,9 @@ RUN mkdir -p /assets/ && cd /assets && \
99
curl -OL https://downloads.datastax.com/enterprise/cqlsh-astra.tar.gz && \
1010
tar -xzf ./cqlsh-astra.tar.gz && \
1111
rm ./cqlsh-astra.tar.gz && \
12-
curl -OL https://archive.apache.org/dist/spark/spark-3.5.3/spark-3.5.3-bin-hadoop3-scala2.13.tgz && \
13-
tar -xzf ./spark-3.5.3-bin-hadoop3-scala2.13.tgz && \
14-
rm ./spark-3.5.3-bin-hadoop3-scala2.13.tgz
12+
curl -OL https://archive.apache.org/dist/spark/spark-3.5.4/spark-3.5.4-bin-hadoop3-scala2.13.tgz && \
13+
tar -xzf ./spark-3.5.4-bin-hadoop3-scala2.13.tgz && \
14+
rm ./spark-3.5.4-bin-hadoop3-scala2.13.tgz
1515

1616
RUN apt-get update && apt-get install -y openssh-server vim python3 --no-install-recommends && \
1717
rm -rf /var/lib/apt/lists/* && \
@@ -44,7 +44,7 @@ RUN chmod +x ./get-latest-maven-version.sh && \
4444
rm -rf "$USER_HOME_DIR/.m2"
4545

4646
# Add all migration tools to path
47-
ENV PATH="${PATH}:/assets/dsbulk/bin/:/assets/cqlsh-astra/bin/:/assets/spark-3.5.3-bin-hadoop3-scala2.13/bin/"
47+
ENV PATH="${PATH}:/assets/dsbulk/bin/:/assets/cqlsh-astra/bin/:/assets/spark-3.5.4-bin-hadoop3-scala2.13/bin/"
4848

4949
EXPOSE 22
5050

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88
Migrate and Validate Tables between Origin and Target Cassandra Clusters.
99

1010
> [!IMPORTANT]
11-
> Please note this job has been tested with spark version [3.5.3](https://archive.apache.org/dist/spark/spark-3.5.3/)
11+
> Please note this job has been tested with spark version [3.5.4](https://archive.apache.org/dist/spark/spark-3.5.4/)
1212
1313
## Install as a Container
1414
- Get the latest image that includes all dependencies from [DockerHub](https://hub.docker.com/r/datastax/cassandra-data-migrator)
@@ -20,14 +20,14 @@ Migrate and Validate Tables between Origin and Target Cassandra Clusters.
2020
### Prerequisite
2121
- **Java11** (minimum) as Spark binaries are compiled with it.
2222
- **Spark `3.5.x` with Scala `2.13` and Hadoop `3.3`**
23-
- Typically installed using [this binary](https://archive.apache.org/dist/spark/spark-3.5.3/spark-3.5.3-bin-hadoop3-scala2.13.tgz) on a single VM (no cluster necessary) where you want to run this job. This simple setup is recommended for most one-time migrations.
23+
- Typically installed using [this binary](https://archive.apache.org/dist/spark/spark-3.5.4/spark-3.5.4-bin-hadoop3-scala2.13.tgz) on a single VM (no cluster necessary) where you want to run this job. This simple setup is recommended for most one-time migrations.
2424
- However we recommend using a Spark Cluster or a Spark Serverless platform like `Databricks` or `Google Dataproc` (that supports the above mentioned versions) for large (e.g. several terabytes) complex migrations OR when CDM is used as a long-term data-transfer utility and not a one-time job.
2525

2626
Spark can be installed by running the following: -
2727

2828
```
29-
wget https://archive.apache.org/dist/spark/spark-3.5.3/spark-3.5.3-bin-hadoop3-scala2.13.tgz
30-
tar -xvzf spark-3.5.3-bin-hadoop3-scala2.13.tgz
29+
wget https://archive.apache.org/dist/spark/spark-3.5.4/spark-3.5.4-bin-hadoop3-scala2.13.tgz
30+
tar -xvzf spark-3.5.4-bin-hadoop3-scala2.13.tgz
3131
```
3232

3333
> [!CAUTION]

RELEASE.md

Lines changed: 4 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -1,5 +1,9 @@
11
# Release Notes
22

3+
## [5.2.0] - 2025-xx-xx
4+
- Upgraded to use Spark `3.5.4`.
5+
- Cassandra Docker image tag is now set to `cassandra:5`.
6+
37
## [5.1.4] - 2024-12-04
48
- Bug fix: Any run started with a `previousRunId` that is not found in the `cdm_run_info` table (for whatever reason), will be executed as a fresh new run instead of doing nothing.
59

SIT/environment.sh

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -68,7 +68,7 @@ fi
6868
###
6969
# These variables are hard-coded for now
7070
SUBNET=$(echo ${CIDR} | cut -d. -f1-3)
71-
CASS_VERSION=5.0
71+
CASS_VERSION=5
7272
CDM_VERSION=latest
7373
#==============================================================================================================================
7474
# Helper Functions

pom.xml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@
1010
<project.build.sourceEncoding>UTF-8</project.build.sourceEncoding>
1111
<scala.version>2.13.15</scala.version>
1212
<scala.main.version>2.13</scala.main.version>
13-
<spark.version>3.5.3</spark.version>
13+
<spark.version>3.5.4</spark.version>
1414
<connector.version>3.5.1</connector.version>
1515
<cassandra.version>5.0-rc1</cassandra.version>
1616
<junit.version>5.9.1</junit.version>

0 commit comments

Comments
 (0)