Link Spark docs to Spark JupyterLab Space (#1403)

lhoestq · web-flow · commit 95d7ada885a1 · 2024-09-10T16:39:43.000+02:00
* link to spark jupyterlab space

* Update datasets-spark.md
diff --git a/docs/hub/datasets-spark.md b/docs/hub/datasets-spark.md
@@ -211,7 +211,7 @@ To filter the dataset and only keep dialogues in Chinese:
 ```python
 >>> criteria = [("langdetect", "=", "zh-cn")]
 >>> df_chinese_only = read_parquet("hf://datasets/BAAI/Infinity-Instruct/7M/*.parquet", filters=criteria)
->>> df_chinese_only
+>>> df_chinese_only.show()
 +---+----------------------------+-----+----------+----------+                  
 | id|               conversations|label|langdetect|    source|
 +---+----------------------------+-----+----------+----------+
@@ -357,3 +357,11 @@ tmpmj97ab30.parquet: 100%|██████████| 71.3M/71.3M [00:02<00:
     <img class="block dark:hidden" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/datasets-spark-infinity-instruct-chinese-only-min.png"/>
     <img class="hidden dark:block" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/datasets-spark-infinity-instruct-chinese-only-dark-min.png"/>
 </div>
+
+## Run in JupyterLab on Hugging Face Spaces
+
+You can duplicate the [Spark on HF JupyterLab](https://huggingface.co/spaces/lhoestq/Spark-on-HF-JupyterLab) Space to get a Notebook with PySpark and those helper functions pre-installed.
+
+Click on "Duplicate Space", choose a name for your Space, select your hardware and you are ready:
+
+<img src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/spark-on-hf-jupyterlab-screenshot-min.png">