Skip to content

Commit 2707118

Browse files
Merge pull request #261203 from v-akarnase/patch-21
Update apache-spark-zeppelin-notebook.md
2 parents 483bb3c + 932ce21 commit 2707118

File tree

1 file changed

+26
-3
lines changed

1 file changed

+26
-3
lines changed

articles/hdinsight/spark/apache-spark-zeppelin-notebook.md

Lines changed: 26 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -4,7 +4,7 @@ description: Step-by-step instructions on how to use Zeppelin notebooks with Apa
44
ms.service: hdinsight
55
ms.topic: how-to
66
ms.custom: hdinsightactive,seoapr2020
7-
ms.date: 05/26/2023
7+
ms.date: 12/14/2023
88
---
99

1010
# Use Apache Zeppelin notebooks with Apache Spark cluster on Azure HDInsight
@@ -81,7 +81,7 @@ HDInsight Spark clusters include [Apache Zeppelin](https://zeppelin.apache.org/)
8181

8282
The **%sql** statement at the beginning tells the notebook to use the Livy Scala interpreter.
8383

84-
6. Select the **Bar Chart** icon to change the display. **settings**, appear after you have selected **Bar Chart**, allows you to choose **Keys**, and **Values**. The following screenshot shows the output.
84+
6. Select the **Bar Chart** icon to change the display. **settings** appear after you have selected **Bar Chart**, allows you to choose **Keys**, and **Values**. The following screenshot shows the output.
8585

8686
:::image type="content" source="./media/apache-spark-zeppelin-notebook/hdinsight-zeppelin-spark-query-1.png " alt-text="Run a Spark SQL statement using the notebook1" border="true":::
8787

@@ -145,6 +145,29 @@ The Zeppelin notebooks are saved to the cluster headnodes. So, if you delete the
145145

146146
This action saves the notebook as a JSON file in your download location.
147147

148+
> [!NOTE]
149+
> * In HDI 4.0, the zeppelin notebook directory path is, `/usr/hdp/<version>/zeppelin/notebook/<notebook_session_id>/`
150+
>
151+
> Eg. /usr/hdp/4.1.17.10/zeppelin/2JMC9BZ8X/
152+
>
153+
> Where as in HDI 5.0 and above this path is different `/usr/hdp/<version>/zeppelin/notebook/<Kernel_name>/`
154+
>
155+
> Eg. /usr/hdp/5.1.4.5/zeppelin/notebook/Scala/
156+
> * The file name stored is different in HDI 5.0.
157+
> It is stored as `<notebook_name>_<sessionid>.zpln`
158+
>
159+
> Eg. testzeppelin_2JJK53XQA.zpln
160+
>
161+
> In HDI 4.0, the file name is just note.json stored under session_id directory.
162+
>
163+
> Eg. /2JMC9BZ8X/note.json
164+
>
165+
> * HDI Zeppelin always saves the notebook in the path `/usr/hdp/<version>/zeppelin/notebook/` in hn0 local disk.
166+
>
167+
> If you want the notebook to be available even after cluster deletion , you can try to use azure file storage (Using SMB protocol ) and link it to local path. For more details, see [Mount SMB Azure file share on Linux](/azure/storage/files/storage-how-to-use-files-linux)
168+
>
169+
> After mounting it, you can modify the zeppelin configuration zeppelin.notebook.dir to the mounted path in ambari UI.
170+
148171
## Use `Shiro` to Configure Access to Zeppelin Interpreters in Enterprise Security Package (ESP) Clusters
149172

150173
As noted above, the `%sh` interpreter isn't supported from HDInsight 4.0 onwards. Furthermore, since `%sh` interpreter introduces potential security issues, such as access keytabs using shell commands, it has been removed from HDInsight 3.6 ESP clusters as well. It means `%sh` interpreter isn't available when clicking **Create new note** or in the Interpreter UI by default.
@@ -216,7 +239,7 @@ To validate the service from a command line, SSH to the head node. Switch user t
216239
|---|---|
217240
|zeppelin-server|/usr/hdp/current/zeppelin-server/|
218241
|Server Logs|/var/log/zeppelin|
219-
|Configuration Interpreter, `Shiro`, site.xml, log4j|/usr/hdp/current/zeppelin-server/conf or /etc/zeppelin/conf|
242+
|Configuration Interpreter, `Shiro`, site.xml, `log4j`|/usr/hdp/current/zeppelin-server/conf or /etc/zeppelin/conf|
220243
|PID directory|/var/run/zeppelin|
221244
222245
### Enable debug logging

0 commit comments

Comments
 (0)