Skip to content

Commit aa465e1

Browse files
authored
Merge pull request #104878 from yanancai/master
update jdbc driver doc
2 parents ef046c7 + 34253d7 commit aa465e1

File tree

2 files changed

+21
-0
lines changed

2 files changed

+21
-0
lines changed

articles/hdinsight/hadoop/apache-hadoop-connect-hive-jdbc-driver.md

Lines changed: 21 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -32,6 +32,18 @@ JDBC connections to an HDInsight cluster on Azure are made over port 443, and th
3232

3333
Replace `CLUSTERNAME` with the name of your HDInsight cluster.
3434

35+
Or you can get the connection through **Ambari UI > Hive > Configs > Advanced**.
36+
37+
![Get JDBC connection string through Ambari](./media/apache-hadoop-connect-hive-jdbc-driver/hdinsight-get-connection-string-through-ambari.png)
38+
39+
### Host name in connection string
40+
41+
Host name 'CLUSTERNAME.azurehdinsight.net' in the connection string is the same as your cluster URL. You can get it through Azure portal.
42+
43+
### Port in connection string
44+
45+
You can only use **port 443** to connect to the cluster from some places outside of the Azure virtual network. HDInsight is a managed service, which means that all connections to the cluster are managed via a secure Gateway. You cannot connect to HiveServer 2 directly on ports 10001 or 10000 because these ports are not exposed to the outside.
46+
3547
## Authentication
3648

3749
When establishing the connection, you must use the HDInsight cluster admin name and password to authenticate to the cluster gateway. When connecting from JDBC clients such as SQuirreL SQL, you must enter the admin name and password in client settings.
@@ -134,6 +146,15 @@ at java.util.concurrent.FutureTask.get(FutureTask.java:206)
134146

135147
1. Restart SQuirreL. The error should no longer occur when connecting to Hive on HDInsight.
136148

149+
### Connection disconnected by HDInsight
150+
151+
**Symptoms**: When trying to download huge amount of data (say several GBs) through JDBC/ODBC, the connection is disconnected by HDInsight unexpectedly while downloading.
152+
153+
**Cause**: This error is caused by the limitation on Gateway nodes. When getting data from JDBC/ODBC, all data needs to pass through the Gateway node. However, a gateway is not designed to download a huge amount of data, so the connection might be closed by the Gateway if it cannot handle the traffic.
154+
155+
**Resolution**: Avoid using JDBC/ODBC driver to download huge amounts of data. Copy data directly from blob storage instead.
156+
157+
137158
## Next steps
138159

139160
Now that you've learned how to use JDBC to work with Hive, use the following links to explore other ways to work with Azure HDInsight.
Loading

0 commit comments

Comments
 (0)