You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: modules/ROOT/pages/tools/neo4j-admin/neo4j-admin-import.adoc
+32-20Lines changed: 32 additions & 20 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -136,7 +136,15 @@ The additional parameter `--input-type [csv|parquet]` is introduced to explicitl
136
136
Its value defaults to CSV if it is not defined.
137
137
138
138
Most of the parameters that can be used to configure the import are also valid for the Parquet format.
139
-
There are indicators in the <<full-import-options-table, `neo4j-admin database import full` options>> table to point out which parameters are not supported.
139
+
The following parameters are not supported (see <<full-import-options-table, `neo4j-admin database import full` options>> table for more details):
140
+
141
+
- `--auto-skip-subsequent-headers`
142
+
- `--delimiter`
143
+
- `--ignore-extra-columns`
144
+
- `--input-encoding`
145
+
- `--multiline-fields`
146
+
- `--quote`
147
+
- `--trim-strings`
140
148
141
149
The xref:tools/neo4j-admin/neo4j-admin-import.adoc#import-tool-examples[examples] for CSV can also be used with Parquet.
142
150
====
@@ -168,15 +176,15 @@ For horizontal tabulation (HT), use `\t` or the Unicode character ID `\9`.
168
176
Unicode character ID can be used if prepended by `\`.
169
177
|;
170
178
171
-
| --auto-skip-subsequent-headers[=true\|false]^1^
179
+
| --auto-skip-subsequent-headers[=true\|false]
172
180
|Automatically skip accidental header lines in subsequent files in file groups with more than one file.
173
181
|false
174
182
175
183
|--bad-tolerance=<num>
176
184
|Number of bad entries before the import is aborted. The import process is optimized for error-free data. Therefore, cleaning the data before importing it is highly recommended. If you encounter any bad entries during the import process, you can set the number of bad entries to a specific value that suits your needs. However, setting a high value may affect the performance of the tool.
177
185
|1000
178
186
179
-
|--delimiter=<char>^1^
187
+
|--delimiter=<char>
180
188
|Delimiter character between values in CSV data. Also accepts `TAB` and e.g. `U+20AC` for specifying a character using Unicode.
181
189
182
190
====
@@ -225,11 +233,11 @@ Possible values are:
225
233
|Whether or not empty string fields, i.e. "" from input source are ignored, i.e. treated as null.
226
234
|false
227
235
228
-
|--ignore-extra-columns[=true\|false]^1^
236
+
|--ignore-extra-columns[=true\|false]
229
237
|If unspecified columns should be ignored during the import.
230
238
|false
231
239
232
-
|--input-encoding=<character-set>^1^
240
+
|--input-encoding=<character-set>
233
241
|Character set that input data is encoded in.
234
242
|UTF-8
235
243
@@ -249,7 +257,7 @@ Values can be plain numbers, such as `10000000`, or `20G` for 20 gigabytes.
249
257
It can also be specified as a percentage of the available memory, for example `70%`.
250
258
|90%
251
259
252
-
|--multiline-fields[=true\|false]^1^
260
+
|--multiline-fields[=true\|false]
253
261
|Whether or not fields from an input source can span multiple lines, i.e. contain newline characters.
254
262
255
263
Setting `--multiline-fields=true` can severely degrade the performance of the importer.
@@ -277,7 +285,7 @@ For an example, see <<import-tool-multiple-input-files-regex-example>>.
277
285
|Delete any existing database files prior to the import.
278
286
|false
279
287
280
-
|--quote=<char>^1^
288
+
|--quote=<char>
281
289
|Character to treat as quotation character for values in CSV data.
282
290
283
291
Quotes can be escaped as per link:{rfc-4180}[RFC 4180] by doubling them, for example `""` would be interpreted as a literal `"`.
@@ -352,7 +360,7 @@ If enabled all those relationships will be found but at the cost of lower perfor
352
360
performance, this value should not be greater than the number of available processors.
353
361
|20
354
362
355
-
|--trim-strings[=true\|false]^1^
363
+
|--trim-strings[=true\|false]
356
364
|Whether or not strings should be trimmed for whitespaces.
357
365
|false
358
366
@@ -361,8 +369,6 @@ performance, this value should not be greater than the number of available proce
361
369
|
362
370
|===
363
371
364
-
^1^ __Ignored by Parquet import label:beta[].__ +
365
-
366
372
[NOTE]
367
373
.Heap size for the import
368
374
====
@@ -676,7 +682,15 @@ The additional parameter `--input-type [csv|parquet]` is introduced to explicitl
676
682
Its value defaults to CSV if it is not defined.
677
683
678
684
Most of the parameters that can be used to configure the import are also valid for the Parquet format.
679
-
There are indicators in the <<incremental-import-options-table, `neo4j-admin database import incremental` options>> table to point out which parameters are not supported.
685
+
The following parameters are not supported (see <<incremental-import-options-table, `neo4j-admin database import incremental` options>> table for more details):
686
+
687
+
- `--auto-skip-subsequent-headers`
688
+
- `--delimiter`
689
+
- `--ignore-extra-columns`
690
+
- `--input-encoding`
691
+
- `--multiline-fields`
692
+
- `--quote`
693
+
- `--trim-strings`
680
694
681
695
The xref:tools/neo4j-admin/neo4j-admin-import.adoc#import-tool-examples[examples] for CSV can also be used with Parquet.
682
696
====
@@ -708,15 +722,15 @@ For horizontal tabulation (HT), use `\t` or the Unicode character ID `\9`.
708
722
Unicode character ID can be used if prepended by `\`.
709
723
|;
710
724
711
-
| --auto-skip-subsequent-headers[=true\|false]^1^
725
+
| --auto-skip-subsequent-headers[=true\|false]
712
726
|Automatically skip accidental header lines in subsequent files in file groups with more than one file.
713
727
|false
714
728
715
729
|--bad-tolerance=<num>
716
730
|Number of bad entries before the import is aborted. The import process is optimized for error-free data. Therefore, cleaning the data before importing it is highly recommended. If you encounter any bad entries during the import process, you can set the number of bad entries to a specific value that suits your needs. However, setting a high value may affect the performance of the tool.
717
731
|1000
718
732
719
-
|--delimiter=<char>^1^
733
+
|--delimiter=<char>
720
734
|Delimiter character between values in CSV data. Also accepts `TAB` and e.g. `U+20AC` for specifying a character using Unicode.
721
735
722
736
====
@@ -763,11 +777,11 @@ Possible values are:
763
777
|Whether or not empty string fields, i.e. "" from input source are ignored, i.e. treated as null.
764
778
|false
765
779
766
-
|--ignore-extra-columns[=true\|false]^1^
780
+
|--ignore-extra-columns[=true\|false]
767
781
|If unspecified columns should be ignored during the import.
768
782
|false
769
783
770
-
|--input-encoding=<character-set>^1^
784
+
|--input-encoding=<character-set>
771
785
|Character set that input data is encoded in.
772
786
|UTF-8
773
787
@@ -787,7 +801,7 @@ Values can be plain numbers, such as `10000000`, or `20G` for 20 gigabytes.
787
801
It can also be specified as a percentage of the available memory, for example `70%`.
788
802
|90%
789
803
790
-
|--multiline-fields[=true\|false]^1^
804
+
|--multiline-fields[=true\|false]
791
805
|Whether or not fields from an input source can span multiple lines, i.e. contain newline characters.
792
806
793
807
Setting `--multiline-fields=true` can severely degrade the performance of the importer.
@@ -811,7 +825,7 @@ For an example, see <<import-tool-multiple-input-files-regex-example>>.
811
825
|When `true`, non-array property values are converted to their equivalent Cypher types. For example, all integer values will be converted to 64-bit long integers.
812
826
| true
813
827
814
-
|--quote=<char>^1^
828
+
|--quote=<char>
815
829
|Character to treat as quotation character for values in CSV data.
816
830
817
831
Quotes can be escaped as per link:{rfc-4180}[RFC 4180] by doubling them, for example `""` would be interpreted as a literal `"`.
@@ -895,7 +909,7 @@ If enabled all those relationships will be found but at the cost of lower perfor
895
909
performance, this value should not be greater than the number of available processors.
896
910
|20
897
911
898
-
|--trim-strings[=true\|false]^1^
912
+
|--trim-strings[=true\|false]
899
913
|Whether or not strings should be trimmed for whitespaces.
900
914
|false
901
915
@@ -904,8 +918,6 @@ performance, this value should not be greater than the number of available proce
904
918
|
905
919
|===
906
920
907
-
^1^ __Ignored by Parquet import label:beta[].__ +
908
-
909
921
[NOTE]
910
922
.Using both a multi-value option and a positional parameter
0 commit comments