Skip to content

Commit 770bc2f

Browse files
authored
Merge pull request #103673 from dagiro/ts_hdfs4
ts_hdfs4
2 parents e26e8e9 + c85dd44 commit 770bc2f

File tree

2 files changed

+67
-0
lines changed

2 files changed

+67
-0
lines changed

articles/hdinsight/TOC.yml

Lines changed: 2 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -396,6 +396,8 @@
396396
href: ./spark/apache-spark-troubleshoot-event-log-requestbodytoolarge.md
397397
- name: Debug Apache Spark jobs
398398
href: ./spark/apache-spark-job-debugging.md
399+
- name: Debug WASB file operations
400+
href: ./spark/troubleshoot-debug-wasb.md
399401
- name: Use IntelliJ to debug Apache Spark job
400402
href: ./spark/apache-spark-intellij-tool-debug-remotely-through-ssh.md
401403
- name: Apache Spark troubleshooting
Lines changed: 65 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -0,0 +1,65 @@
1+
---
2+
title: Debug WASB file operations in Azure HDInsight
3+
description: Describes troubleshooting steps and possible resolutions for issues when interacting with Azure HDInsight clusters.
4+
author: hrasheed-msft
5+
ms.author: hrasheed
6+
ms.reviewer: jasonh
7+
ms.service: hdinsight
8+
ms.topic: troubleshooting
9+
ms.date: 02/07/2020
10+
---
11+
12+
# Debug WASB file operations in Azure HDInsight
13+
14+
There are times when you may want to understand what operations the WASB driver started with Azure Storage. For the client side, the WASB driver produces logs for each file system operation at **DEBUG** level. WASB driver uses log4j to control logging level and the default is **INFO** level. For Azure Storage server-side analytics logs, see [Azure Storage analytics logging](../../storage/common/storage-analytics-logging.md).
15+
16+
A produced log will look similar to:
17+
18+
```log
19+
18/05/13 04:15:55 DEBUG NativeAzureFileSystem: Moving wasb://[email protected]/user/livy/ulysses.txt/_temporary/0/_temporary/attempt_20180513041552_0000_m_000000_0/part-00000 to wasb://[email protected]/user/livy/ulysses.txt/part-00000
20+
```
21+
22+
## Turn on WASB debug log for file operations
23+
24+
1. From a web browser, navigate to `https://CLUSTERNAME.azurehdinsight.net`, where `CLUSTERNAME` is the name of your Spark cluster.
25+
26+
1. Navigate to **Spark2** > **Configs** > **advanced spark2-log4j-properties**.
27+
28+
1. Modify `log4j.appender.console.Threshold=INFO` to `log4j.appender.console.Threshold=DEBUG`.
29+
30+
1. Navigate to **Advanced livy2-log4j-properties**.
31+
32+
1. Add the following property:
33+
34+
```
35+
log4j.logger.org.apache.hadoop.fs.azure.NativeAzureFileSystem=DEBUG
36+
```
37+
38+
1. Save changes.
39+
40+
## Additional logging
41+
42+
The above logs should provide high-level understanding of the file system operations. If the above logs are still not providing useful information, or if you want to investigate blob storage api calls, add `fs.azure.storage.client.logging=true` to the `core-site`. This setting will enable the java sdk logs for wasb storage driver and will print each call to blob storage server. Remove the setting after investigations because it could fill up the disk quickly and could slow down the process.
43+
44+
If the backend is Azure Data Lake based, then use the following log4j setting for the component(for example, spark/tez/hdfs):
45+
46+
```
47+
log4j.logger.com.microsoft.azure.datalake.store=ALL,adlsFile
48+
log4j.additivity.com.microsoft.azure.datalake.store=true
49+
log4j.appender.adlsFile=org.apache.log4j.FileAppender
50+
log4j.appender.adlsFile.File=/var/log/adl/adl.log
51+
log4j.appender.adlsFile.layout=org.apache.log4j.PatternLayout
52+
log4j.appender.adlsFile.layout.ConversionPattern=%p\t%d{ISO8601}\t%r\t%c\t[%t]\t%m%n
53+
```
54+
55+
Look for the logs in `/var/log/adl/adl.log` for the logs.
56+
57+
## Next steps
58+
59+
If you didn't see your problem or are unable to solve your issue, visit one of the following channels for more support:
60+
61+
* Get answers from Azure experts through [Azure Community Support](https://azure.microsoft.com/support/community/).
62+
63+
* Connect with [@AzureSupport](https://twitter.com/azuresupport) - the official Microsoft Azure account for improving customer experience. Connecting the Azure community to the right resources: answers, support, and experts.
64+
65+
* If you need more help, you can submit a support request from the [Azure portal](https://portal.azure.com/?#blade/Microsoft_Azure_Support/HelpAndSupportBlade/). Select **Support** from the menu bar or open the **Help + support** hub. For more detailed information, review [How to create an Azure support request](https://docs.microsoft.com/azure/azure-supportability/how-to-create-azure-support-request). Access to Subscription Management and billing support is included with your Microsoft Azure subscription, and Technical Support is provided through one of the [Azure Support Plans](https://azure.microsoft.com/support/plans/).

0 commit comments

Comments
 (0)