2
2
3
3
Migrate and Validate Tables between Origin and Target Cassandra Clusters.
4
4
5
- > :warning : Please note this job has been tested with spark version [ 2.4.8 ] ( https://archive.apache.org/dist/spark/spark-2.4.8 / )
5
+ > :warning : Please note this job has been tested with spark version [ 3.3.1 ] ( https://archive.apache.org/dist/spark/spark-3.3.1 / )
6
6
7
7
## Container Image
8
8
- Get the latest image that includes all dependencies from [ DockerHub] ( https://hub.docker.com/r/datastax/cassandra-data-migrator )
@@ -16,15 +16,15 @@ Migrate and Validate Tables between Origin and Target Cassandra Clusters.
16
16
- Install single instance of spark on a node where you want to run this job. Spark can be installed by running the following: -
17
17
18
18
```
19
- wget https://downloads .apache.org/spark/spark-2.4.8/
19
+ wget https://archive .apache.org/dist/ spark/spark-3.3.1/spark-3.3.1-bin-hadoop3.tgz
20
20
tar -xvzf <spark downloaded file name>
21
21
```
22
22
23
23
### Build
24
24
1 . Clone this repo
25
25
2 . Move to the repo folder ` cd cassandra-data-migrator `
26
26
3 . Run the build ` mvn clean package `
27
- 4 . The fat jar (` cassandra-data-migrator-2 .x.x.jar ` ) file should now be present in the ` target ` folder
27
+ 4 . The fat jar (` cassandra-data-migrator-3 .x.x.jar ` ) file should now be present in the ` target ` folder
28
28
29
29
# Steps for Data-Migration:
30
30
@@ -36,7 +36,7 @@ tar -xvzf <spark downloaded file name>
36
36
```
37
37
./spark-submit --properties-file sparkConf.properties /
38
38
--master "local[*]" /
39
- --class datastax.astra.migrate.Migrate cassandra-data-migrator-2 .x.x.jar &> logfile_name.txt
39
+ --class datastax.astra.migrate.Migrate cassandra-data-migrator-3 .x.x.jar &> logfile_name.txt
40
40
```
41
41
42
42
Note: Above command also generates a log file ` logfile_name.txt ` to avoid log output on the console.
@@ -49,7 +49,7 @@ Note: Above command also generates a log file `logfile_name.txt` to avoid log ou
49
49
```
50
50
./spark-submit --properties-file sparkConf.properties /
51
51
--master "local[*]" /
52
- --class datastax.astra.migrate.DiffData cassandra-data-migrator-2 .x.x.jar &> logfile_name.txt
52
+ --class datastax.astra.migrate.DiffData cassandra-data-migrator-3 .x.x.jar &> logfile_name.txt
53
53
```
54
54
55
55
- Validation job will report differences as “ERRORS” in the log file as shown below
@@ -78,7 +78,7 @@ spark.target.autocorrect.mismatch true|false
78
78
```
79
79
./spark-submit --properties-file sparkConf.properties /
80
80
--master "local[*]" /
81
- --class datastax.astra.migrate.MigratePartitionsFromFile cassandra-data-migrator-2 .x.x.jar &> logfile_name.txt
81
+ --class datastax.astra.migrate.MigratePartitionsFromFile cassandra-data-migrator-3 .x.x.jar &> logfile_name.txt
82
82
```
83
83
84
84
When running in above mode the tool assumes a ` partitions.csv ` file to be present in the current folder in the below format, where each line (` min,max ` ) represents a partition-range
0 commit comments