Skip to content

Commit 0d6a4d5

Browse files
authored
docs: Improve Hive storage format and compression codec documentation (#27216)
Enhanced documentation for hive.storage-format and hive.compression-codec properties to match the detail level of Iceberg connector docs. Changes: Listed all available storage formats (ORC, PARQUET, AVRO, RCBINARY, RCTEXT, SEQUENCEFILE, JSON, TEXTFILE) Listed all available compression codecs (NONE, SNAPPY, GZIP, LZ4, ZSTD) Added format-specific codec compatibility notes (LZ4 for ORC only, ZSTD for ORC/PARQUET) Fixes #26384 == NO RELEASE NOTE ==
1 parent 1fc8fad commit 0d6a4d5

File tree

1 file changed

+12
-3
lines changed
  • presto-docs/src/main/sphinx/connector

1 file changed

+12
-3
lines changed

presto-docs/src/main/sphinx/connector/hive.rst

Lines changed: 12 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -164,9 +164,18 @@ Property Name Description
164164
absolutely necessary to access HDFS.
165165
Example: ``/etc/hdfs-site.xml``
166166

167-
``hive.storage-format`` The default file format used when creating new tables. ``ORC``
168-
169-
``hive.compression-codec`` The compression codec to use when writing files. ``GZIP``
167+
``hive.storage-format`` The default file format used when creating new tables. The ``ORC``
168+
available values are ``ORC``, ``PARQUET``, ``AVRO``,
169+
``RCBINARY``, ``RCTEXT``, ``SEQUENCEFILE``, ``JSON``,
170+
and ``TEXTFILE``.
171+
172+
``hive.compression-codec`` The compression codec to use when writing files. The ``GZIP``
173+
available values are ``NONE``, ``SNAPPY``, ``GZIP``,
174+
``LZ4``, and ``ZSTD``.
175+
176+
Note: ``LZ4`` is only available when
177+
``hive.storage-format=ORC``. ``ZSTD`` is available
178+
for both ``ORC`` and ``PARQUET`` formats.
170179

171180
``hive.force-local-scheduling`` Force splits to be scheduled on the same node as the Hadoop ``false``
172181
DataNode process serving the split data. This is useful for

0 commit comments

Comments
 (0)