You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+5-4Lines changed: 5 additions & 4 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -26,8 +26,9 @@ tar -xvzf spark-3.3.1-bin-hadoop3.tgz
26
26
27
27
> :warning: Note that Version 4 of the tool is not backward-compatible with .properties files created in previous versions, and that package names have changed.
28
28
29
-
1.`cdm.properties` file needs to be configured as applicable for the environment. Parameter descriptions and defaults are described in the file.
30
-
> A sample properties file configuration can be [found here](./src/resources/cdm.properties)
29
+
1.`cdm.properties` file needs to be configured as applicable for the environment. Parameter descriptions and defaults are described in the file. The file can have any name, it does not need to be `cdm.properties`.
30
+
> * A simplified sample properties file configuration can be found here as [cdm.properties](./src/resources/cdm.properties)
31
+
> * A complete sample properties file configuration can be found here as [cdm-detailed.properties](./src/resources/cdm-detailed.properties)
31
32
2. Place the properties file where it can be accessed while running the job via spark-submit.
32
33
3. Run the below job using `spark-submit` command as shown below:
33
34
@@ -119,12 +120,12 @@ When running in above mode the tool assumes a `partitions.csv` file to be presen
119
120
# Perform large-field Guardrail violation checks
120
121
- The tool can be used to identify large fields from a table that may break you cluster guardrails (e.g. AstraDB has a 10MB limit for a single large field) `--class datastax.astra.migrate.Guardrail` as shown below
0 commit comments