You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/usage-guide/data-storage.adoc
+13-6Lines changed: 13 additions & 6 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,11 +1,18 @@
1
1
= Data storage backends
2
2
:description: Hive supports metadata storage on S3 and HDFS. Configure S3 with S3Connection and HDFS with configMap in clusterConfig.
3
3
4
-
Hive does not store data, only metadata. It can store metadata about data stored in various places. The Stackable Operator currently supports S3 and HFS.
4
+
You can operate the Hive metastore service (HMS) without S3 or HDFS.
5
+
Its whole purpose is to store metadata such as "Table foo has columns a, b and c and is stored as parquet in local://tmp/hive/foo".
5
6
6
-
== [[s3]]S3 support
7
+
However, as soon as you start storing metadata in the HMS that refers to a `s3a://` or `hdfs://` locations, HMS will actually do some operations on the filesystem. This can be e.g. checking if the table location exists, creating it in case it is missing.
7
8
8
-
Hive supports creating tables in S3 compatible object stores.
9
+
So if you are storing tables in S3 (or HDFS for that matter), you need to give the HMS access to that filesystem as well.
10
+
The Stackable Operator currently supports S3 and HFS.
11
+
12
+
[s3]
13
+
== S3 support
14
+
15
+
HMS supports creating tables in S3 compatible object stores.
9
16
To use this feature you need to provide connection details for the object store using the xref:concepts:s3.adoc[S3Connection] in the top level `clusterConfig`.
10
17
11
18
An example usage can look like this:
@@ -22,10 +29,10 @@ clusterConfig:
22
29
secretClass: simple-hive-s3-secret-class
23
30
----
24
31
32
+
[hdfs]
33
+
== Apache HDFS support
25
34
26
-
== [[hdfs]]Apache HDFS support
27
-
28
-
As well as S3, Hive also supports creating tables in HDFS.
35
+
As well as S3, HMS also supports creating tables in HDFS.
29
36
You can add the HDFS connection in the top level `clusterConfig` as follows:
0 commit comments