huggingface · rajatarya · Sep 25, 2025 · Sep 25, 2025 · Sep 26, 2025 · Sep 26, 2025
diff --git a/.github/workflows/xet_build_documentation.yml b/.github/workflows/xet_build_documentation.yml
@@ -0,0 +1,20 @@
+name: Build Xet documentation
+
+on:
+  push:
+    paths:
+      - "docs/xet/**"
+    branches:
+      - main
+
+jobs:
+   build:
+    uses: huggingface/doc-builder/.github/workflows/build_main_documentation.yml@main
+    with:
+      commit_sha: ${{ github.sha }}
+      package: hub-docs
+      package_name: xet
+      path_to_docs: hub-docs/docs/xet/
+      additional_args: --not_python_module
+    secrets:
+      hf_token: ${{ secrets.HF_DOC_BUILD_PUSH }}
diff --git a/.github/workflows/xet_build_pr_documentation.yml b/.github/workflows/xet_build_pr_documentation.yml
@@ -0,0 +1,21 @@
+name: Build Xet PR Documentation
+
+on:
+  pull_request:
+    paths:
+      - "docs/xet/**"
+
+concurrency:
+  group: ${{ github.workflow }}-${{ github.head_ref || github.run_id }}
+  cancel-in-progress: true
+
+jobs:
+  build:
+    uses: huggingface/doc-builder/.github/workflows/build_pr_documentation.yml@main
+    with:
+      commit_sha: ${{ github.event.pull_request.head.sha }}
+      pr_number: ${{ github.event.number }}
+      package: hub-docs
+      package_name: xet
+      path_to_docs: hub-docs/docs/xet/
+      additional_args: --not_python_module
diff --git a/.github/workflows/xet_upload_pr_documentation.yml b/.github/workflows/xet_upload_pr_documentation.yml
@@ -0,0 +1,16 @@
+name: Upload Xet PR Documentation
+
+on:
+  workflow_run:
+    workflows: ["Build Xet PR Documentation"]
+    types:
+      - completed
+
+jobs:
+  build:
+    uses: huggingface/doc-builder/.github/workflows/upload_pr_documentation.yml@main
+    with:
+      package_name: xet
+    secrets:
+      hf_token: ${{ secrets.HF_DOC_BUILD_PUSH }}
+      comment_bot_token: ${{ secrets.COMMENT_BOT_TOKEN }}
diff --git a/docs/hub/_redirects.yml b/docs/hub/_redirects.yml
@@ -18,4 +18,4 @@ api-webhook: webhooks
 adapter-transformers: adapters
 security-two-fa: security-2fa
 repositories-recommendations: storage-limits
-xet: storage-backends#xet
+xet: xet/index
diff --git a/docs/hub/_toctree.yml b/docs/hub/_toctree.yml
@@ -47,8 +47,19 @@
     title: Repository Settings
   - local: storage-limits
     title: Storage Limits
-  - local: storage-backends
-    title: Storage Backends
+  - local: xet/index
+    title: Storage Backend (Xet)
+    sections:
+    - local: xet/overview
+      title: Xet History & Overview
+    - local: xet/using-xet-storage
+      title: Using Xet Storage
+    - local: xet/deduplication
+      title: Deduplication
+    - local: xet/legacy-git-lfs
+      title: Backwards Compatibility & Legacy
+    - local: xet/security
+      title: Security
   - local: repositories-pull-requests-discussions
     title: Pull Requests & Discussions
   - local: notifications

diff --git a/docs/hub/gguf.md b/docs/hub/gguf.md
@@ -82,8 +82,8 @@ Find more information [here](https://github.com/huggingface/huggingface.js/tree/
 | Q4_0 | [GH](https://github.com/huggingface/huggingface.js/pull/615#discussion_r1557654249) | 4-bit round-to-nearest quantization (`q`). Each block has 32 weights. Weight formula: `w = q * block_scale`. Legacy quantization method (not used widely as of today). |
 | Q4_1 | [GH](https://github.com/huggingface/huggingface.js/pull/615#discussion_r1557682290) | 4-bit round-to-nearest quantization (`q`). Each block has 32 weights. Weight formula: `w = q * block_scale + block_minimum`. Legacy quantization method (not used widely as of today). |
 | Q4_K | [GH](https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305) | 4-bit quantization (`q`). Super-blocks with 8 blocks, each block has 32 weights. Weight formula: `w = q * block_scale(6-bit) + block_min(6-bit)`, resulting in 4.5 bits-per-weight. |
-| Q3_K | [GH](https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305) | 3-bit quantization (`q`). Super-blocks with 16 blocks, each block has 16 weights. Weight formula: `w = q * block_scale(6-bit)`, resulting. 3.4375 bits-per-weight. |
-| Q2_K | [GH](https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305) | 2-bit quantization (`q`). Super-blocks with 16 blocks, each block has 16 weight. Weight formula: `w = q * block_scale(4-bit) + block_min(4-bit)`, resulting in 2.625 bits-per-weight. |
+| Q3_K | [GH](https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305) | 3-bit quantization (`q`). Super-blocks with 16 blocks, each block has 16 weights. Weight formula: `w = q * block_scale(6-bit)`, resulting in 3.4375 bits-per-weight. |
+| Q2_K | [GH](https://github.com/ggerganov/llama.cpp/pull/1684#issue-1739619305) | 2-bit quantization (`q`). Super-blocks with 16 blocks, each block has 16 weights. Weight formula: `w = q * block_scale(4-bit) + block_min(4-bit)`, resulting in 2.625 bits-per-weight. |
 | IQ4_NL | [GH](https://github.com/ggerganov/llama.cpp/pull/5590) | 4-bit quantization (`q`). Super-blocks with 256 weights. Weight `w` is obtained using `super_block_scale` & `importance matrix`. |
 | IQ4_XS | [HF](https://huggingface.co/CISCai/OpenCodeInterpreter-DS-6.7B-SOTA-GGUF/blob/main/README.md?code=true#L59-L70) | 4-bit quantization (`q`). Super-blocks with 256 weights. Weight `w` is obtained using `super_block_scale` & `importance matrix`, resulting in 4.25 bits-per-weight. |
 | IQ3_S | [HF](https://huggingface.co/CISCai/OpenCodeInterpreter-DS-6.7B-SOTA-GGUF/blob/main/README.md?code=true#L59-L70) | 3-bit quantization (`q`). Super-blocks with 256 weights. Weight `w` is obtained using `super_block_scale` & `importance matrix`, resulting in 3.44 bits-per-weight. |

diff --git a/docs/hub/index.md b/docs/hub/index.md
@@ -27,7 +27,7 @@ The Hugging Face Hub is a platform with over 1.7M models, 400k datasets, and 600
 <a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./repositories-getting-started">Getting Started</a>
 <a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./repositories-settings">Repository Settings</a>
 <a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./storage-limits">Storage Limits</a>
-<a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./storage-backends">Storage Backends</a>
+<a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./xet/index">Storage Backend (Xet)</a>
 <a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./repositories-pull-requests-discussions">Pull requests and Discussions</a>
 <a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./notifications">Notifications</a>
 <a class="transform no-underline! transition-colors hover:translate-x-px hover:text-gray-700" href="./collections">Collections</a>
@@ -105,7 +105,7 @@ The Hugging Face Hub is a platform with over 1.7M models, 400k datasets, and 600
 
 We are helping the community work together towards the goal of advancing Machine Learning 🔥.
 
-The Hugging Face Hub is a platform with over 1.7M models, 400k datasets, and 600k demos in which people can easily collaborate in their ML workflows. The Hub works as a central place where anyone can share, explore, discover, and experiment with open-source Machine Learning.
+The Hugging Face Hub is a platform with over 2M models, 500k datasets, and 600k demos in which people can easily collaborate in their ML workflows. The Hub works as a central place where anyone can share, explore, discover, and experiment with open-source Machine Learning.
 
 No single company, including the Tech Titans, will be able to “solve AI” by themselves – the only way we'll achieve this is by sharing knowledge and resources in a community-centric approach. We are building the largest open-source collection of models, datasets, and demos on the Hugging Face Hub to democratize and advance ML for everyone 🚀.
 
@@ -122,7 +122,7 @@ On it, you'll be able to upload and discover...
 - Spaces: _interactive apps for demonstrating ML models directly in your browser_
 
 The Hub offers **versioning, commit history, diffs, branches, and over a dozen library integrations**! 
-All repositories build on [Xet](https://huggingface.co/join/xet), a new technology to efficiently store Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads.
+All repositories build on [Xet](./xet/index), a new technology to efficiently store Large Files inside Git, intelligently splitting files into unique chunks and accelerating uploads and downloads.
 
 You can learn more about the features that all repositories share in the [**Repositories documentation**](./repositories).
 

diff --git a/docs/hub/rate-limits.md b/docs/hub/rate-limits.md
@@ -64,9 +64,9 @@ Here are the current rate limits (in September '25) based on your plan:
 | Anonymous user (per IP address)                                           | 500 \*   | 3,000 \*  | 100 \* |
 | Free user                                                                 | 1,000 \* | 5,000 \*  | 200 \* |
 | PRO user                                                                  | 2,500    | 12,000    | 400    |
-| Team organization                                                         | 3,000    | 15,000    | 400    |
-| Enterprise organization                                                   | 6,000    | 30,000    | 600    |
-| Enterprise Plus organization                                              | 10,000   | 50,000    | 1,000  |
+| Team organization                                                         | 3,000    | 20,000    | 400    |
+| Enterprise organization                                                   | 6,000    | 50,000    | 600    |
+| Enterprise Plus organization                                              | 10,000   | 100,000   | 1,000  |
 | Enterprise Plus organization <br> When Organization IP Ranges are defined | 100,000  | 500,000   | 10,000 |
 | Academia Hub organization                                                 | 2,500    | 12,000    | 400    |
 

diff --git a/docs/hub/repositories.md b/docs/hub/repositories.md
@@ -3,7 +3,7 @@
 Models, Spaces, and Datasets are hosted on the Hugging Face Hub as [Git repositories](https://git-scm.com/about), which means that version control and collaboration are core elements of the Hub. In a nutshell, a repository (also known as a **repo**) is a place where code and assets can be stored to back up your work, share it with the community, and work in a team. 
 
 Unlike other collaboration platforms, our Git repositories are optimized for Machine Learning and AI files – large binary files, usually in specific file formats like Parquet and Safetensors, and up to [Terabyte-scale sizes](https://huggingface.co/blog/from-files-to-chunks)!
-To achieve this, we built [Xet](./storage-backends), a modern custom storage system built specifically for AI/ML development, enabling chunk-level deduplication, smaller uploads, and faster downloads.
+To achieve this, we built [Xet](./xet/index), a modern custom storage system built specifically for AI/ML development, enabling chunk-level deduplication, smaller uploads, and faster downloads.
 
 <div class="flex justify-center">
 <img class="block dark:hidden" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/hub/xet-speed.gif"/>
@@ -17,7 +17,7 @@ In these pages, you will go over the basics of getting started with Git and Xet
 - [Getting Started with Repositories](./repositories-getting-started)
 - [Settings](./repositories-settings)
 - [Storage Limits](./storage-limits)
-- [Storage Backends](./storage-backends)
+- [Storage Backend (Xet)](./xet/index)
 - [Pull Requests & Discussions](./repositories-pull-requests-discussions)
   - [Pull Requests advanced usage](./repositories-pull-requests-discussions#pull-requests-advanced-usage)
 - [Collections](./collections)