You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/getting_started/first_steps.adoc
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,8 +1,8 @@
1
1
= First steps
2
-
:description: Deploy and verify a Hive metastore cluster with PostgreSQL and MinIO. Follow our setup guide and ensure all pods are ready for operation.
2
+
:description: Deploy and verify a Hive metastore cluster with PostgreSQL and MinIO. Follow the setup guide and ensure all pods are ready for operation.
3
3
4
-
After going through the xref:getting_started/installation.adoc[] section and having installed all the operators, you will now deploy a Hive metastore cluster and it's dependencies.
5
-
Afterwards you can <<_verify_that_it_works, verify that it works>>.
4
+
After going through the xref:getting_started/installation.adoc[] section and having installed all the operators, deploy a Hive metastore cluster and it's dependencies.
5
+
Afterward you can <<_verify_that_it_works, verify that it works>>.
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/getting_started/index.adoc
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,12 +1,12 @@
1
1
= Getting started
2
2
:description: Learn to set up Apache Hive with the Stackable Operator. Includes installation, dependencies, and creating a Hive metastore on Kubernetes.
3
3
4
-
This guide will get you started with Apache Hive using the Stackable Operator.
5
-
It will guide you through the installation of the operator, its dependencies and setting up your first Hive metastore instance.
4
+
This guide gets you started with Apache Hive using the Stackable Operator.
5
+
It guides you through the installation of the operator, its dependencies and setting up your first Hive metastore instance.
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/getting_started/installation.adoc
+6-7Lines changed: 6 additions & 7 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,7 +1,7 @@
1
1
= Installation
2
-
:description: Install Stackable Operator for Apache Hive with MinIO and PostgreSQL using stackablectl or Helm. Follow our guide for easy setup and configuration.
2
+
:description: Install Stackable Operator for Apache Hive with MinIO and PostgreSQL using stackablectl or Helm. Follow the guide for easy setup and configuration.
3
3
4
-
On this page you will install the Stackable Operator for Apache Hive and all required dependencies.
4
+
On this page you install the Stackable operator for Apache Hive and all required dependencies.
5
5
For the installation of the dependencies and operators you can use Helm or `stackablectl`.
6
6
7
7
The `stackablectl` command line tool is the recommended way to interact with operators and dependencies.
@@ -10,7 +10,7 @@ Follow the xref:management:stackablectl:installation.adoc[installation steps] fo
10
10
== Dependencies
11
11
12
12
First you need to install MinIO and PostgreSQL instances for the Hive metastore.
13
-
PostgreSQL is required as a database for Hive's metadata, and MinIO will be used as a data store, which the Hive metastore also needs access to.
13
+
PostgreSQL is required as a database for Hive's metadata, and MinIO is used as a data store, which the Hive metastore also needs access to.
14
14
15
15
There are two ways to install the dependencies:
16
16
@@ -66,7 +66,7 @@ Now call `stackablectl` and reference those two files:
Helm will deploy the operators in a Kubernetes Deployment and apply the CRDs for the Apache Hive service (as well as the CRDs for the required operators).
136
-
You are now ready to deploy the Apache Hive metastore in Kubernetes.
135
+
Helm deploys the operators in a Kubernetes Deployment and apply the CRDs for the Apache Hive service (as well as the CRDs for the required operators).
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/index.adoc
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -19,7 +19,7 @@ This operator does not support deploying Hive itself, but xref:trino:index.adoc[
19
19
20
20
== Getting started
21
21
22
-
Follow the xref:getting_started/index.adoc[Getting started guide] which will guide you through installing the Stackable Hive operator and its dependencies.
22
+
Follow the xref:getting_started/index.adoc[Getting started guide] which guides you through the installation of the Stackable Hive operator and its dependencies.
23
23
It walks you through setting up a Hive metastore and connecting it to a demo Postgres database and a Minio instance to store data in.
24
24
25
25
Afterwards you can consult the xref:usage-guide/index.adoc[] to learn more about tailoring your Hive metastore configuration to your needs, or have a look at the <<demos, demos>> for some example setups with either xref:trino:index.adoc[Trino] or xref:spark-k8s:index.adoc[Spark].
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/usage-guide/configuration-environment-overrides.adoc
+15-5Lines changed: 15 additions & 5 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -40,15 +40,19 @@ metastore:
40
40
replicas: 1
41
41
----
42
42
43
-
All override property values must be strings. The properties will be formatted and escaped correctly into the XML file.
43
+
All override property values must be strings.
44
+
The properties are formatted and escaped correctly into the XML file.
44
45
45
46
For a full list of configuration options we refer to the Hive https://cwiki.apache.org/confluence/display/hive/configuration+properties[Configuration Reference].
46
47
47
48
== The security.properties file
48
49
49
-
The `security.properties` file is used to configure JVM security properties. It is very seldom that users need to tweak any of these, but there is one use-case that stands out, and that users need to be aware of: the JVM DNS cache.
50
+
The `security.properties` file is used to configure JVM security properties.
51
+
It is very seldom that users need to tweak any of these, but there is one use-case that stands out, and that users need to be aware of: the JVM DNS cache.
50
52
51
-
The JVM manages its own cache of successfully resolved host names as well as a cache of host names that cannot be resolved. Some products of the Stackable platform are very sensible to the contents of these caches and their performance is heavily affected by them. As of version 3.1.3 Apache Hive performs poorly if the positive cache is disabled. To cache resolved host names, you can configure the TTL of entries in the positive cache like this:
53
+
The JVM manages its own cache of successfully resolved host names as well as a cache of host names that cannot be resolved. Some products of the Stackable platform are very sensible to the contents of these caches and their performance is heavily affected by them.
54
+
As of version 3.1.3 Apache Hive performs poorly if the positive cache is disabled.
55
+
To cache resolved host names, you can configure the TTL of entries in the positive cache like this:
52
56
53
57
[source,yaml]
54
58
----
@@ -64,9 +68,10 @@ NOTE: The operator configures DNS caching by default as shown in the example abo
64
68
For details on the JVM security see https://docs.oracle.com/en/java/javase/11/security/java-security-overview1.html
65
69
66
70
67
-
== Environment Variables
71
+
== Environment variables
68
72
69
-
In a similar fashion, environment variables can be (over)written. For example per role group:
73
+
In a similar fashion, environment variables can be (over)written.
74
+
For example per role group:
70
75
71
76
[source,yaml]
72
77
----
@@ -91,3 +96,8 @@ metastore:
91
96
config: {}
92
97
replicas: 1
93
98
----
99
+
100
+
== Pod overrides
101
+
102
+
The Hive operator also supports Pod overrides, allowing you to override any property that you can set on a Kubernetes Pod.
103
+
Read the xref:concepts:overrides.adoc#pod-overrides[Pod overrides documentation] to learn more about this feature.
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/usage-guide/database-driver.adoc
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -88,7 +88,7 @@ spec:
88
88
mountPath: /stackable/externals
89
89
----
90
90
91
-
This will make the driver available at `/stackable/external-drivers/mysql-connector-j-8.0.31.jar` when the volume `external-drivers` is mounted at `/stackable/external-drivers`.
91
+
This makes the driver available at `/stackable/external-drivers/mysql-connector-j-8.0.31.jar` when the volume `external-drivers` is mounted at `/stackable/external-drivers`.
92
92
93
93
Once the above has completed successfully, you can confirm that the driver is in the expected location by running another job:
Copy file name to clipboardExpand all lines: docs/modules/hive/pages/usage-guide/derby-example.adoc
+3-3Lines changed: 3 additions & 3 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,9 +1,9 @@
1
1
= Derby example
2
2
:description: Deploy a single-node Apache Hive Metastore with Derby or PostgreSQL. Includes setup for S3 integration and tips for database configuration.
3
3
4
-
Please note that the version you need to specify is not only the version of Apache Hive which you want to roll out, but has to be amended with a Stackable version as shown.
4
+
The version you need to specify is not only the version of Apache Hive which you want to roll out, but has to be amended with a Stackable version as shown.
5
5
This Stackable version is the version of the underlying container image which is used to execute the processes.
6
-
For a list of available versions please check our https://repo.stackable.tech/#browse/browse:docker:v2%2Fstackable%2Fhive%2Ftags[image registry].
6
+
For a list of available versions check the https://repo.stackable.tech/#browse/browse:docker:v2%2Fstackable%2Fhive%2Ftags[image registry].
7
7
It should generally be safe to simply use the latest image version that is available.
8
8
9
9
.Create a single node Apache Hive Metastore cluster using Derby:
@@ -123,7 +123,7 @@ This is called `scram-sha-256` and has been the default as of PostgreSQL 14.
123
123
Unfortunately, Hive up until the latest 3.3.x version ships with JDBC drivers that do https://wiki.postgresql.org/wiki/List_of_drivers[_not_ support] this method.
124
124
You might see an error message like this:
125
125
`The authentication type 10 is not supported.`
126
-
If this is the case please either use an older PostgreSQL version or change its https://www.postgresql.org/docs/current/runtime-config-connection.html#GUC-PASSWORD-ENCRYPTION[`password_encryption`] setting to `md5`.
126
+
If this is the case, either use an older PostgreSQL version or change its https://www.postgresql.org/docs/current/runtime-config-connection.html#GUC-PASSWORD-ENCRYPTION[`password_encryption`] setting to `md5`.
127
127
128
128
This installs PostgreSQL in version 10 to work around the issue mentioned above:
0 commit comments