Skip to content

Commit b49b00c

Browse files
authored
Update to Spark 3.5.5 (#345)
1 parent 8fe5329 commit b49b00c

File tree

3 files changed

+9
-9
lines changed

3 files changed

+9
-9
lines changed

Dockerfile

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -9,9 +9,9 @@ RUN mkdir -p /assets/ && cd /assets && \
99
curl -OL https://downloads.datastax.com/enterprise/cqlsh-astra.tar.gz && \
1010
tar -xzf ./cqlsh-astra.tar.gz && \
1111
rm ./cqlsh-astra.tar.gz && \
12-
curl -OL https://archive.apache.org/dist/spark/spark-3.5.4/spark-3.5.4-bin-hadoop3-scala2.13.tgz && \
13-
tar -xzf ./spark-3.5.4-bin-hadoop3-scala2.13.tgz && \
14-
rm ./spark-3.5.4-bin-hadoop3-scala2.13.tgz
12+
curl -OL https://archive.apache.org/dist/spark/spark-3.5.5/spark-3.5.5-bin-hadoop3-scala2.13.tgz && \
13+
tar -xzf ./spark-3.5.5-bin-hadoop3-scala2.13.tgz && \
14+
rm ./spark-3.5.5-bin-hadoop3-scala2.13.tgz
1515

1616
RUN apt-get update && apt-get install -y openssh-server vim python3 --no-install-recommends && \
1717
rm -rf /var/lib/apt/lists/* && \
@@ -44,7 +44,7 @@ RUN chmod +x ./get-latest-maven-version.sh && \
4444
rm -rf "$USER_HOME_DIR/.m2"
4545

4646
# Add all migration tools to path
47-
ENV PATH="${PATH}:/assets/dsbulk/bin/:/assets/cqlsh-astra/bin/:/assets/spark-3.5.4-bin-hadoop3-scala2.13/bin/"
47+
ENV PATH="${PATH}:/assets/dsbulk/bin/:/assets/cqlsh-astra/bin/:/assets/spark-3.5.5-bin-hadoop3-scala2.13/bin/"
4848

4949
EXPOSE 22
5050

README.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@
88
Migrate and Validate Tables between Origin and Target Cassandra Clusters.
99

1010
> [!IMPORTANT]
11-
> Please note this job has been tested with spark version [3.5.4](https://archive.apache.org/dist/spark/spark-3.5.4/)
11+
> Please note this job has been tested with spark version [3.5.5](https://archive.apache.org/dist/spark/spark-3.5.5/)
1212
1313
## Install as a Container
1414
- Get the latest image that includes all dependencies from [DockerHub](https://hub.docker.com/r/datastax/cassandra-data-migrator)
@@ -22,14 +22,14 @@ Migrate and Validate Tables between Origin and Target Cassandra Clusters.
2222
### Prerequisite
2323
- **Java11** (minimum) as Spark binaries are compiled with it.
2424
- **Spark `3.5.x` with Scala `2.13` and Hadoop `3.3`**
25-
- Typically installed using [this binary](https://archive.apache.org/dist/spark/spark-3.5.4/spark-3.5.4-bin-hadoop3-scala2.13.tgz) on a single VM (no cluster necessary) where you want to run this job. This simple setup is recommended for most one-time migrations.
25+
- Typically installed using [this binary](https://archive.apache.org/dist/spark/spark-3.5.5/spark-3.5.5-bin-hadoop3-scala2.13.tgz) on a single VM (no cluster necessary) where you want to run this job. This simple setup is recommended for most one-time migrations.
2626
- However we recommend using a Spark Cluster or a Spark Serverless platform like `Databricks` or `Google Dataproc` (that supports the above mentioned versions) for large (e.g. several terabytes) complex migrations OR when CDM is used as a long-term data-transfer utility and not a one-time job.
2727

2828
Spark can be installed by running the following: -
2929

3030
```
31-
wget https://archive.apache.org/dist/spark/spark-3.5.4/spark-3.5.4-bin-hadoop3-scala2.13.tgz
32-
tar -xvzf spark-3.5.4-bin-hadoop3-scala2.13.tgz
31+
wget https://archive.apache.org/dist/spark/spark-3.5.5/spark-3.5.5-bin-hadoop3-scala2.13.tgz
32+
tar -xvzf spark-3.5.5-bin-hadoop3-scala2.13.tgz
3333
```
3434

3535
> [!CAUTION]

RELEASE.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -1,7 +1,7 @@
11
# Release Notes
22

33
## [5.2.0] - 2025-xx-xx
4-
- Upgraded to use Spark `3.5.4`.
4+
- Upgraded to use Spark `3.5.5`.
55
- Cassandra Docker image tag is now set to `cassandra:5`.
66

77
## [5.1.4] - 2024-12-04

0 commit comments

Comments
 (0)