From d26f477c5b7a4ff8582a67cd8bf98aae454c3e17 Mon Sep 17 00:00:00 2001 From: Albert Thomas Date: Tue, 13 May 2025 13:16:11 +0200 Subject: [PATCH 1/3] add cli instructions to download datasets --- docs/hub/datasets-downloading.md | 7 ++++++- 1 file changed, 6 insertions(+), 1 deletion(-) diff --git a/docs/hub/datasets-downloading.md b/docs/hub/datasets-downloading.md index 97b29064e..dbd6e67e8 100644 --- a/docs/hub/datasets-downloading.md +++ b/docs/hub/datasets-downloading.md @@ -16,8 +16,13 @@ If a dataset on the Hub is tied to a [supported library](./datasets-libraries), ## Using the Hugging Face Client Library -You can use the [`huggingface_hub`](/docs/huggingface_hub) library to create, delete, update and retrieve information from repos. You can also download files from repos or integrate them into your library! For example, you can quickly load a CSV dataset with a few lines using Pandas. +You can use the [`huggingface_hub`](/docs/huggingface_hub) library to create, delete, update and retrieve information from repos. For example, to download the `HuggingFaceH4/ultrachat_200k` dataset from the command line, run +```bash +huggingface-cli download HuggingFaceH4/ultrachat_200k --repo-type dataset +``` +See the [huggingface-cli download documentation](https://huggingface.co/docs/huggingface_hub/en/guides/cli#download-a-dataset-or-a-space) for more information. +You can also integrate this into your own library! For example, you can quickly load a CSV dataset with a few lines using Pandas. ```py from huggingface_hub import hf_hub_download import pandas as pd From 34fa6618177d7b932da6ba660d644e28f14921c5 Mon Sep 17 00:00:00 2001 From: Lucain Date: Tue, 13 May 2025 18:21:31 +0200 Subject: [PATCH 2/3] Update docs/hub/datasets-downloading.md --- docs/hub/datasets-downloading.md | 3 +-- 1 file changed, 1 insertion(+), 2 deletions(-) diff --git a/docs/hub/datasets-downloading.md b/docs/hub/datasets-downloading.md index dbd6e67e8..202c87424 100644 --- a/docs/hub/datasets-downloading.md +++ b/docs/hub/datasets-downloading.md @@ -17,10 +17,9 @@ If a dataset on the Hub is tied to a [supported library](./datasets-libraries), ## Using the Hugging Face Client Library You can use the [`huggingface_hub`](/docs/huggingface_hub) library to create, delete, update and retrieve information from repos. For example, to download the `HuggingFaceH4/ultrachat_200k` dataset from the command line, run + ```bash huggingface-cli download HuggingFaceH4/ultrachat_200k --repo-type dataset -``` -See the [huggingface-cli download documentation](https://huggingface.co/docs/huggingface_hub/en/guides/cli#download-a-dataset-or-a-space) for more information. You can also integrate this into your own library! For example, you can quickly load a CSV dataset with a few lines using Pandas. ```py From 0b38d54f57f0f3458b33aee703bf695d786da78e Mon Sep 17 00:00:00 2001 From: Lucain Date: Tue, 13 May 2025 18:23:12 +0200 Subject: [PATCH 3/3] Sorry about that! --- docs/hub/datasets-downloading.md | 3 +++ 1 file changed, 3 insertions(+) diff --git a/docs/hub/datasets-downloading.md b/docs/hub/datasets-downloading.md index 202c87424..3e6d52caf 100644 --- a/docs/hub/datasets-downloading.md +++ b/docs/hub/datasets-downloading.md @@ -20,6 +20,9 @@ You can use the [`huggingface_hub`](/docs/huggingface_hub) library to create, de ```bash huggingface-cli download HuggingFaceH4/ultrachat_200k --repo-type dataset +``` + +See the [huggingface-cli download documentation](https://huggingface.co/docs/huggingface_hub/en/guides/cli#download-a-dataset-or-a-space) for more information. You can also integrate this into your own library! For example, you can quickly load a CSV dataset with a few lines using Pandas. ```py