You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+21-18Lines changed: 21 additions & 18 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -31,7 +31,7 @@ tar -xvzf spark-3.5.3-bin-hadoop3-scala2.13.tgz
31
31
```
32
32
33
33
> [!CAUTION]
34
-
> If the above Spark and Scala version does not match, you may see an exception similar like below when running the CDM jobs,
34
+
> If the above Spark and Scala version does not match, you may see an exception like below when running the CDM jobs,
35
35
```
36
36
Exception in thread "main" java.lang.NoSuchMethodError: scala.runtime.Statics.releaseFence()V
37
37
```
@@ -41,24 +41,24 @@ Exception in thread "main" java.lang.NoSuchMethodError: scala.runtime.Statics.re
41
41
42
42
# Steps for Data-Migration:
43
43
44
-
1.`cdm.properties` file needs to be configured as applicable for the environment. Parameter descriptions and defaults are described in the file. The file can have any name, it does not need to be `cdm.properties`.
45
-
> * A simplified sample properties file configuration can be found here as [cdm.properties](./src/resources/cdm.properties)
46
-
> * A complete sample properties file configuration can be found here as [cdm-detailed.properties](./src/resources/cdm-detailed.properties)
44
+
1.`cdm.properties` file needs to be configured as applicable for the environment. The file can have any name, it does not need to be `cdm.properties`.
45
+
> * A sample properties file with default values can be found here as [cdm.properties](./src/resources/cdm.properties)
46
+
> * A complete reference properties file with default values can be found here as [cdm-detailed.properties](./src/resources/cdm-detailed.properties)
47
47
2. Place the properties file where it can be accessed while running the job via spark-submit.
48
-
3. Run the below job using `spark-submit` command as shown below:
48
+
3. Run the job using `spark-submit` command as shown below:
-The tool can be used to identify large fields from a table that may break you cluster guardrails (e.g. AstraDB has a 10MB limit for a single large field), use class option `--class com.datastax.cdm.job.GuardrailCheck` as shown below
124
+
-This mode can help identify large fields on an `origin` table that may break you cluster guardrails (e.g. AstraDB has a 10MB limit for a single large field), use class option `--class com.datastax.cdm.job.GuardrailCheck` as shown below
- Including counter table [Counter tables](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_using/useCountersConcept.html)
137
-
- Rerun job from where the previous job had stopped for any reason (killed, had exceptions, etc.)
140
+
- Rerun/Resume a previous job that may have stopped for any reason (killed, had exceptions, etc.)
138
141
- If you rerun a `validation` job, it will include any token-ranges that had differences in the previous run
139
142
- Preserve [writetimes](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_reference/cql_commands/cqlSelect.html#cqlSelect__retrieving-the-datetime-a-write-occurred-p) and [TTLs](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_reference/cql_commands/cqlSelect.html#cqlSelect__ref-select-ttl-p)
140
143
- Supports migration/validation of advanced DataTypes ([Sets](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_reference/refDataTypes.html#refDataTypes__set), [Lists](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_reference/refDataTypes.html#refDataTypes__list), [Maps](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_reference/refDataTypes.html#refDataTypes__map), [UDTs](https://docs.datastax.com/en/dse/6.8/cql/cql/cql_reference/refDataTypes.html#refDataTypes__udt))
@@ -184,7 +187,7 @@ Below recommendations may only be useful when migrating large tables where the d
184
187
1. Clone this repo
185
188
2. Move to the repo folder `cd cassandra-data-migrator`
186
189
3. Run the build `mvn clean package` (Needs Maven 3.9.x)
187
-
4. The fat jar (`cassandra-data-migrator-4.x.x.jar`) file should now be present in the `target` folder
190
+
4. The fat jar (`cassandra-data-migrator-5.x.x.jar`) file should now be present in the `target` folder
188
191
189
192
# Contributors
190
193
Checkout all our wonderful contributors [here](./CONTRIBUTING.md#contributors).
Copy file name to clipboardExpand all lines: RELEASE.md
+3Lines changed: 3 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,4 +1,7 @@
1
1
# Release Notes
2
+
## [5.1.2] - 2024-11-26
3
+
- Bug fix: SCB file on some Spark worker nodes may get deleted before the connection is established, which may cause connection exception on that worker node. Added a static async SCB delete delay to address such issues.
4
+
2
5
## [5.1.1] - 2024-11-22
3
6
- Bug fix: Writetime filter does not work as expected when custom writetimestamp is also used (issue #327).
4
7
- Removed deprecated properties `printStatsAfter` and `printStatsPerPart`. Run metrics should now be tracked using the `trackRun` feature instead.
0 commit comments