You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: docs/hub/datasets-dask.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -93,7 +93,7 @@ the `meta` argument to know the type of the new column in the meantime.
93
93
94
94
When reading Parquet data from Hugging Face, Dask automatically leverages the metadata in Parquet files to skip entire files or row groups if they are not needed. For example if you apply a filter (predicate) on a Hugging Face Dataset in Parquet format or if you select a subset of the columns (projection), Dask will read the metadata of the Parquet files to discard the parts that are not needed without downloading them.
95
95
96
-
This is possible thanks to a [reimplmentation of the Dask DataFrame API](https://docs.coiled.io/blog/dask-dataframe-is-fast.html?utm_source=hf-docs) to support query optimization, which makes Dask faster and more robust.
96
+
This is possible thanks to a [reimplementation of the Dask DataFrame API](https://docs.coiled.io/blog/dask-dataframe-is-fast.html?utm_source=hf-docs) to support query optimization, which makes Dask faster and more robust.
97
97
98
98
For example this subset of FineWeb-Edu contains many Parquet files. If you can filter the dataset to keep the text from recent CC dumps, Dask will skip most of the files and only download the data that match the filter:
Copy file name to clipboardExpand all lines: docs/hub/security-protectai.md
+1-1Lines changed: 1 addition & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -11,7 +11,7 @@ Interested in joining our security partnership / providing scanning information
11
11
12
12
We partnered with Protect AI to provide scanning in order to make the Hub safer. The same way files are scanned by our internal scanning system, public repositories' files are scanned by Guardian.
13
13
14
-
Our frontend has been redesigned specifically for this purpose, in order to accomodate for new scanners:
14
+
Our frontend has been redesigned specifically for this purpose, in order to accommodate for new scanners:
This markdown file has been generated from a script. Please do not edit it directly.
5
+
6
+
### Template
7
+
8
+
If you want to update the content related to featherless-ai's description, please edit the template file under `https://github.com/huggingface/hub-docs/tree/main/scripts/inference-providers/templates/providers/featherless-ai.handlebars`.
9
+
10
+
### Logos
11
+
12
+
If you want to update featherless-ai's logo, upload a file by opening a PR on https://huggingface.co/datasets/huggingface/documentation-images/tree/main/inference-providers/logos. Ping @wauplin and @celinah on the PR to let them know you uploaded a new logo.
13
+
Logos must be in .png format and be named `featherless-ai-light.png` and `featherless-ai-dark.png`. Visit https://huggingface.co/settings/theme to switch between light and dark mode and check that the logos are displayed correctly.
14
+
15
+
### Generation script
16
+
17
+
For more details, check out the `generate.ts` script: https://github.com/huggingface/hub-docs/blob/main/scripts/inference-providers/scripts/generate.ts.
[Featherless AI](https://featherless.ai) is a serverless AI inference platform that offers access to thousands of open-source models.
37
+
38
+
Our goal is to make all AI models available for serverless inference. We provide inference via API to a continually expanding library of open-weight models.
39
+
40
+
## Supported tasks
41
+
42
+
43
+
### Chat Completion (LLM)
44
+
45
+
Find out more about Chat Completion (LLM) [here](../tasks/chat-completion).
0 commit comments