You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
`Iceberg` table engine and table function support metadata cache storing the information of manifest files, manifest list and metadata json. The cache is stored in memory. This feature is controlled by setting `use_iceberg_metadata_files_cache`, which is enabled by default.
298
298
299
+
## Altinity Antalya branch
300
+
301
+
### Specify storage type in arguments
302
+
303
+
Only in Altinity Antalya branch table engine `Iceberg` can work with all storages.
304
+
In this case storage may be specified with named argument `storage_type`.
305
+
Possible values are `s3`, `azure`, `hdfs`, `local`:
Only in Altinity Antalya branch alternative syntax for `Iceberg` table engine can be executed on cluster with non-empty setting `object_storage_cluster` with cluster name.
Table functions `s3Cluster`, `azureBlobStorageCluster`, `hdsfCluster`, `icebergCluster`, etc., as well as table engines `S3`, `Azure`, `HDFS`, `Iceberg` with setting `object_storage_cluster` distribute tasks between all cluster nodes, or only between `object_storage_max_nodes` cluster nodes. Setting `object_storage_max_nodes` limits number of nodes to process distributed query. In this case random nodes are choosed for each query.
4
+
5
+
Single task is a single source file.
6
+
7
+
For each file one of cluster nodes selected as primary node. Primary node selected with consistence Rendezvous Hashing algorythm, this algorythm guarantees that for each file sthe same node selected as primary when cluster not changed, and when cluster is changed only files from deleted nodes or with new nodes as primary are affected. This incerasy cache efficiency.
8
+
Each node starts to process files for which this node is primary. When node processes all those files it can take some files from other nodes. Node gets file immediately or only when primary node does not ask for new files in `lock_object_storage_task_distribution_ms` milliseconds. Settiing `lock_object_storage_task_distribution_ms` has default value in 500 milliseconds, and can be used to achieve balance between caching and processing files when some nodes are overloaded.
9
+
10
+
If node must be shutdowned in some time, command `SYSTEM STOP SWARM MODE` can be used to stop getting new tasks for `*Cluster`-family queries. In this case node stop getting new files, but processes alrready started files. When it processes all files, node can be shutdowned withou any error on initiator.
11
+
12
+
Getting new tasks can be reenabled with `SYSTEM START SWARM MODE` command.
Copy file name to clipboardExpand all lines: docs/en/sql-reference/table-functions/azureBlobStorageCluster.md
+14Lines changed: 14 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -53,6 +53,20 @@ SELECT count(*) FROM azureBlobStorageCluster(
53
53
54
54
See [azureBlobStorage](/sql-reference/table-functions/azureBlobStorage#using-shared-access-signatures-sas-sas-tokens) for examples.
55
55
56
+
## Altinity Antalya branch
57
+
58
+
### `object_storage_cluster` setting.
59
+
60
+
Only in Altinity Antalya branch alternative syntax for `azureBlobStorageCluster` table function is `azureBlobStorage` function with non-empty setting `object_storage_cluster` with cluster name.
Copy file name to clipboardExpand all lines: docs/en/sql-reference/table-functions/deltalakeCluster.md
+11Lines changed: 11 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -36,6 +36,17 @@ A table with the specified structure for reading data from cluster in the specif
36
36
-`_time` — Last modified time of the file. Type: `Nullable(DateTime)`. If the time is unknown, the value is `NULL`.
37
37
-`_etag` — The etag of the file. Type: `LowCardinality(String)`. If the etag is unknown, the value is `NULL`.
38
38
39
+
## Altinity Antalya branch
40
+
41
+
### `object_storage_cluster` setting.
42
+
43
+
Only in Altinity Antalya branch alternative syntax for `deltaLakeCluster` table function is `deltaLake` function with non-empty setting `object_storage_cluster` with cluster name.
44
+
45
+
```sql
46
+
SELECTcount(*) FROM deltaLake(url [,aws_access_key_id, aws_secret_access_key] [,format] [,structure] [,compression])
Copy file name to clipboardExpand all lines: docs/en/sql-reference/table-functions/hdfsCluster.md
+12Lines changed: 12 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -59,6 +59,18 @@ FROM hdfsCluster('cluster_simple', 'hdfs://hdfs1:9000/{some,another}_dir/*', 'TS
59
59
If your listing of files contains number ranges with leading zeros, use the construction with braces for each digit separately or use `?`.
60
60
:::
61
61
62
+
## Altinity Antalya branch
63
+
64
+
### `object_storage_cluster` setting.
65
+
66
+
Only in Altinity Antalya branch alternative syntax for `hdfsCluster` table function is `hdfs` function with non-empty setting `object_storage_cluster` with cluster name.
67
+
68
+
```sql
69
+
SELECTcount(*)
70
+
FROM hdfs('hdfs://hdfs1:9000/{some,another}_dir/*', 'TSV', 'name String, value UInt32')
Copy file name to clipboardExpand all lines: docs/en/sql-reference/table-functions/hudiCluster.md
+12Lines changed: 12 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -42,6 +42,18 @@ A table with the specified structure for reading data from cluster in the specif
42
42
-`_time` — Last modified time of the file. Type: `Nullable(DateTime)`. If the time is unknown, the value is `NULL`.
43
43
-`_etag` — The etag of the file. Type: `LowCardinality(String)`. If the etag is unknown, the value is `NULL`.
44
44
45
+
## Altinity Antalya branch
46
+
47
+
### `object_storage_cluster` setting.
48
+
49
+
Only in Altinity Antalya branch alternative syntax for `hudiCluster` table function is `hudi` function with non-empty setting `object_storage_cluster` with cluster name.
50
+
51
+
```sql
52
+
SELECT*
53
+
FROM hudi(url [,aws_access_key_id, aws_secret_access_key] [,format] [,structure] [,compression])
Copy file name to clipboardExpand all lines: docs/en/sql-reference/table-functions/icebergCluster.md
+78Lines changed: 78 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -49,6 +49,84 @@ SELECT * FROM icebergS3Cluster('cluster_simple', 'http://test.s3.amazonaws.com/c
49
49
-`_time` — Last modified time of the file. Type: `Nullable(DateTime)`. If the time is unknown, the value is `NULL`.
50
50
-`_etag` — The etag of the file. Type: `LowCardinality(String)`. If the etag is unknown, the value is `NULL`.
51
51
52
+
## Altinity Antalya branch
53
+
54
+
### `icebergLocalCluster` table function
55
+
56
+
Only in Altinity Antalya branch `icebergLocalCluster` is a function to make distributed cluster request when iceberg data is storend on shared network storage, mounted with local path.
Only in Altinity Antalya branch alternative syntax for `icebergCluster` table function is `iceberg` function with non-empty setting `object_storage_cluster` with cluster name.
Copy file name to clipboardExpand all lines: docs/en/sql-reference/table-functions/s3Cluster.md
+17Lines changed: 17 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -90,6 +90,23 @@ Users can use the same approaches as document for the s3 function [here](/sql-re
90
90
91
91
For details on optimizing the performance of the s3 function see [our detailed guide](/integrations/s3/performance).
92
92
93
+
## Altinity Antalya branch
94
+
95
+
### `object_storage_cluster` setting.
96
+
97
+
Only in Altinity Antalya branch alternative syntax for `s3Cluster` table function is `s3` function with non-empty setting `object_storage_cluster` with cluster name.
0 commit comments