You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/source/user-guide/latest/configs.md
+1Lines changed: 1 addition & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -107,6 +107,7 @@ These settings can be used to determine which parts of the plan are accelerated
107
107
|`spark.comet.exec.shuffle.compression.codec`| The codec of Comet native shuffle used to compress shuffle data. lz4, zstd, and snappy are supported. Compression can be disabled by setting spark.shuffle.compress=false. | lz4 |
108
108
|`spark.comet.exec.shuffle.compression.zstd.level`| The compression level to use when compressing shuffle files with zstd. | 1 |
109
109
|`spark.comet.exec.shuffle.enabled`| Whether to enable Comet native shuffle. Note that this requires setting `spark.shuffle.manager` to `org.apache.spark.sql.comet.execution.shuffle.CometShuffleManager`. `spark.shuffle.manager` must be set before starting the Spark application and cannot be changed during the application. | true |
110
+
|`spark.comet.exec.shuffle.writeBufferSize`| Size of the write buffer in bytes used by the native shuffle writer when writing shuffle data to disk. Larger values may improve write performance by reducing the number of system calls, but will use more memory. The default is 1MB which provides a good balance between performance and memory usage. | 1048576b |
110
111
|`spark.comet.native.shuffle.partitioning.hash.enabled`| Whether to enable hash partitioning for Comet native shuffle. | true |
111
112
|`spark.comet.native.shuffle.partitioning.range.enabled`| Whether to enable range partitioning for Comet native shuffle. | true |
112
113
|`spark.comet.shuffle.preferDictionary.ratio`| The ratio of total values to distinct values in a string column to decide whether to prefer dictionary encoding when shuffling the column. If the ratio is higher than this config, dictionary encoding will be used on shuffling string column. This config is effective if it is higher than 1.0. Note that this config is only used when `spark.comet.exec.shuffle.mode` is `jvm`. | 10.0 |
0 commit comments