huggingface
diff --git a/‎docs/hub/_toctree.yml‎
Lines changed: 2 additions & 0 deletions b/‎docs/hub/_toctree.yml‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/hub/agents.md‎
Lines changed: 113 additions & 0 deletions b/‎docs/hub/agents.md‎
Lines changed: 113 additions & 0 deletions
diff --git a/‎docs/hub/datasets-adding.md‎
Lines changed: 1 addition & 0 deletions b/‎docs/hub/datasets-adding.md‎
Lines changed: 1 addition & 0 deletions
diff --git a/‎docs/hub/datasets-downloading.md‎
Lines changed: 8 additions & 1 deletion b/‎docs/hub/datasets-downloading.md‎
Lines changed: 8 additions & 1 deletion
diff --git a/‎docs/hub/model-release-checklist.md‎
Lines changed: 6 additions & 1 deletion b/‎docs/hub/model-release-checklist.md‎
Lines changed: 6 additions & 1 deletion
diff --git a/‎docs/hub/spaces-zerogpu.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/hub/spaces-zerogpu.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/inference-providers/_toctree.yml‎
Lines changed: 2 additions & 0 deletions b/‎docs/inference-providers/_toctree.yml‎
Lines changed: 2 additions & 0 deletions
diff --git a/‎docs/inference-providers/index.md‎
Lines changed: 5 additions & 0 deletions b/‎docs/inference-providers/index.md‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎docs/inference-providers/providers/cohere.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/inference-providers/providers/cohere.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎docs/inference-providers/providers/fal-ai.md‎
Lines changed: 1 addition & 1 deletion b/‎docs/inference-providers/providers/fal-ai.md‎
Lines changed: 1 addition & 1 deletion
@@ -398,6 +398,8 @@
       title: "Protect AI"
     - local: security-jfrog
       title: "JFrog"
+  - local: agents
+    title: Agents on Hub
   - local: moderation
     title: Moderation
   - local: paper-pages
 
@@ -0,0 +1,113 @@
+# Agents on the Hub
+
+This page compiles all the libraries and tools Hugging Face offers for agentic workflows: huggingface.js mcp-client, Gradio MCP Server and smolagents.
+
+## smolagents
+
+[smolagents](https://github.com/huggingface/smolagents) is a lightweight library to cover all agentic use cases, from code-writing agents to computer use, in few lines of code. It is model agnostic, supporting local models served with Hugging Face Transformers, as well as models offered with [Inference Providers](../inference-providers/index.md), and proprietary model providers. 
+
+It offers a unique kind of agent :`CodeAgent`, an agent that writes its actions in Python code.
+It also supports the standard agent that writes actions in JSON blobs as most other agentic frameworks do, called `ToolCallingAgent`.
+To learn more about write actions in code vs JSON, check out our [new short course on DeepLearning.AI](https://www.deeplearning.ai/short-courses/building-code-agents-with-hugging-face-smolagents/).
+
+If you want to avoid defining agents yourself, the easiest way to start an agent is through the CLI, using the `smolagent` command.
+
+```bash
+smolagent "Plan a trip to Tokyo, Kyoto and Osaka between Mar 28 and Apr 7." \
+--model-type "InferenceClientModel" \
+--model-id "Qwen/Qwen2.5-Coder-32B-Instruct" \
+--imports "pandas numpy" \
+--tools "web_search"
+```
+
+Agents can be pushed to Hugging Face Hub as Spaces. Check out all the cool agents people have built [here](https://huggingface.co/spaces?filter=smolagents&sort=likes).
+
+smolagents also supports MCP servers as tools, as follows:
+
+```python
+# pip install --upgrade smolagents mcp
+from smolagents import MCPClient, CodeAgent
+from mcp import StdioServerParameters
+import os
+
+server_parameters = StdioServerParameters(
+    command="uvx",  # Using uvx ensures dependencies are available
+    args=["--quiet", "[email protected]"],
+    env={"UV_PYTHON": "3.12", **os.environ},
+)
+
+with MCPClient(server_parameters) as tools:
+    agent = CodeAgent(tools=tools, model=model, add_base_tools=True)
+    agent.run("Please find the latest research on COVID-19 treatment.")
+```
+
+Learn more [in the documentation](https://huggingface.co/docs/smolagents/tutorials/tools#use-mcp-tools-with-mcpclient-directly).
+
+## huggingface.js mcp-client
+
+Huggingface.js offers an MCP client served with [Inference Providers](https://huggingface.co/docs/inference-providers/en/index) or local LLMs. Getting started with them is as simple as running `pnpm agent`. You can plug and play different models and providers by setting `PROVIDER` and `MODEL_ID` environment variables. 
+
+```bash
+export HF_TOKEN="hf_..."
+export MODEL_ID="Qwen/Qwen2.5-72B-Instruct"
+export PROVIDER="nebius"
+npx @huggingface/mcp-client
+```
+
+or, you can use any Local LLM (for example via lmstudio):
+
+```bash
+ENDPOINT_URL=http://localhost:1234/v1 \
+MODEL_ID=lmstudio-community/Qwen3-14B-GGUF \
+npx @huggingface/mcp-client
+```
+
+You can get more information about mcp-client [here](https://huggingface.co/docs/huggingface.js/en/mcp-client/README).
+
+
+## Gradio MCP Server / Tools
+
+You can build an MCP server in just a few lines of Python with Gradio. If you have an existing Gradio app or Space you'd like to use as an MCP server / tool, it's just a single-line change.
+
+To make a Gradio application an MCP server, simply pass in `mcp_server=True` when launching your demo like follows.
+
+```python
+# pip install gradio
+
+import gradio as gr
+
+def generate_image(prompt: str):
+   """
+   Generate an image based on a text prompt
+   
+   Args:
+       prompt: a text string describing the image to generate
+   """
+   pass
+
+demo = gr.Interface(
+    fn=generate_image,
+    inputs="text",
+    outputs="image",
+    title="Image Generator"
+)
+
+demo.launch(mcp_server=True)
+```
+
+The MCP server will be available at `http://your-space-id.hf.space/gradio_api/mcp/sse` where your application is served. It will have a tool corresponding to each function in your Gradio app, with the tool description automatically generated from the docstrings of your functions.
+
+Lastly, add this to the settings of the MCP Client of your choice (e.g. Cursor).
+
+```json
+{
+  "mcpServers": {
+    "gradio": {
+      "url": "http://your-server:port/gradio_api/mcp/sse"
+    }
+  }
+}
+```
+
+This is very powerful because it lets the LLM use any Gradio application as a tool. You can find thousands of them on [Spaces](https://huggingface.co/spaces). Learn more [here](https://www.gradio.app/guides/building-mcp-server-with-gradio).
+
@@ -85,6 +85,7 @@ The Hub natively supports multiple file formats:
 - Text (.txt)
 - Images (.png, .jpg, etc.)
 - Audio (.wav, .mp3, etc.)
+- PDF (.pdf)
 - [WebDataset](https://github.com/webdataset/webdataset) (.tar)
 
 It supports files compressed using ZIP (.zip), GZIP (.gz), ZSTD (.zst), BZ2 (.bz2), LZ4 (.lz4) and LZMA (.xz).
 
@@ -16,8 +16,15 @@ If a dataset on the Hub is tied to a [supported library](./datasets-libraries),
 
 ## Using the Hugging Face Client Library
 
-You can use the [`huggingface_hub`](/docs/huggingface_hub) library to create, delete, update and retrieve information from repos. You can also download files from repos or integrate them into your library! For example, you can quickly load a CSV dataset with a few lines using Pandas.
+You can use the [`huggingface_hub`](/docs/huggingface_hub) library to create, delete, update and retrieve information from repos. For example, to download the `HuggingFaceH4/ultrachat_200k` dataset from the command line, run
 
+```bash
+huggingface-cli download HuggingFaceH4/ultrachat_200k --repo-type dataset
+```
+
+See the [huggingface-cli download documentation](https://huggingface.co/docs/huggingface_hub/en/guides/cli#download-a-dataset-or-a-space) for more information.
+
+You can also integrate this into your own library! For example, you can quickly load a CSV dataset with a few lines using Pandas.
 ```py
 from huggingface_hub import hf_hub_download
 import pandas as pd
 
@@ -63,13 +63,16 @@ We wrote an extensive guide on uploading best practices [here](https://huggingfa
 
 Bonus: a recognised library also allows you to track downloads of your model over time.
 
-2. **Pipeline Tag Selection**: Choose the correct [pipeline tag](https://huggingface.co/docs/hub/model-cards#specifying-a-task--pipelinetag-) that accurately reflects your model's primary task. This tag determines how your model appears in search results and which widgets are displayed on your model page.
+2. **Correct Metadata**:
+   - **Pipeline Tag:** Choose the correct [pipeline tag](https://huggingface.co/docs/hub/model-cards#specifying-a-task--pipelinetag-) that accurately reflects your model's primary task. This tag determines how your model appears in search results and which widgets are displayed on your model page.
 
    Examples of common pipeline tags:
    - `text-generation` - For language models that generate text
    - `text-to-image` - For text-to-image generation models
    - `image-text-to-text` - For vision-language models (VLMs) that generate text
    - `text-to-speech` - For models that generate audio from text
+  
+   - **License:** License information is crucial for users to understand how they can use the model.
 
 3. **Research Papers**: If your model has associated research papers, you can cite them in your model card and they will be [linked automatically](https://huggingface.co/docs/hub/model-cards#linking-a-paper). This provides academic context, allows users to dive deeper into the theoretical foundations of your work, and increases citations.
 
@@ -88,6 +91,8 @@ Bonus: a recognised library also allows you to track downloads of your model ove
 
    Try this model directly in your browser: [Space Demo](https://huggingface.co/spaces/username/model-demo)
    ```
+   
+When you create a demo, please download the model from its repository on the Hub (instead of using external sources like Google Drive); it cross-links model artefacts and demo together and allows more paths to visibility. 
 
 6. **Quantized Versions**: Consider uploading quantized versions of your model (e.g., in GGUF or DDUF formats) to improve accessibility for users with limited computational resources. Link these versions using the [`base_model` metadata field](https://huggingface.co/docs/hub/model-cards#specifying-a-base-model) on the quantized model cards. You can also clearly document performance differences between the original and quantized versions.
 
 
@@ -2,7 +2,7 @@
 
 <img src="https://cdn-uploads.huggingface.co/production/uploads/5f17f0a0925b9863e28ad517/naVZI-v41zNxmGlhEhGDJ.gif" style="max-width: 440px; width: 100%" alt="ZeroGPU schema" />
 
-ZeroGPU is a shared infrastructure that optimizes GPU usage for AI models and demos on Hugging Face Spaces. It dynamically allocates and releases NVIDIA A100 GPUs as needed, offering:
+ZeroGPU is a shared infrastructure that optimizes GPU usage for AI models and demos on Hugging Face Spaces. It dynamically allocates and releases NVIDIA H200 GPUs as needed, offering:
 
 1. **Free GPU Access**: Enables cost-effective GPU usage for Spaces.
 2. **Multi-GPU Support**: Allows Spaces to leverage multiple GPUs concurrently on a single application.
 
@@ -29,6 +29,8 @@
     title: Nebius
   - local: providers/novita
     title: Novita
+  - local: providers/nscale
+    title: Nscale
   - local: providers/replicate
     title: Replicate
   - local: providers/sambanova
 
@@ -23,6 +23,7 @@ Here is the complete list of partners integrated with Inference Providers, and t
 | [Hyperbolic](./providers/hyperbolic)     |           ✅           |           ✅           |                    |               |               |
 | [Nebius](./providers/nebius)             |           ✅           |           ✅           |                    |       ✅       |               |
 | [Novita](./providers/novita)             |           ✅           |           ✅           |                    |               |       ✅       |
+| [Nscale](./providers/nscale)             |           ✅           |           ✅           |                    |      ✅        |              |
 | [Replicate](./providers/replicate)       |                       |                       |                    |       ✅       |       ✅       |
 | [SambaNova](./providers/sambanova)       |           ✅           |                       |         ✅          |               |               |
 | [Together](./providers/together)         |           ✅           |           ✅           |                    |       ✅       |               |
@@ -59,6 +60,10 @@ You can use Inference Providers with your preferred tools, such as Python, JavaS
 
 In this section, we will demonstrate a simple example using [deepseek-ai/DeepSeek-V3-0324](https://huggingface.co/deepseek-ai/DeepSeek-V3-0324), a conversational Large Language Model. For the example, we will use [Novita AI](https://novita.ai/) as Inference Provider.
 
+> [!TIP]
+> You can also automatically select a provider for a model using `provider="auto"` — it will pick the first available provider for your model based on your preferred order set in https://hf.co/settings/inference-providers.
+> This is the default if you don't specify a provider in our Python or JavaScript SDK.
+
 ### Authentication
 
 Inference Providers requires passing a user token in the request headers. You can generate a token by signing up on the Hugging Face website and going to the [settings page](https://huggingface.co/settings/tokens/new?ownUserPermissions=inference.serverless.write&tokenType=fineGrained). We recommend creating a `fine-grained` token with the scope to `Make calls to Inference Providers`.
 
@@ -56,6 +56,6 @@ Find out more about Chat Completion (VLM) [here](../tasks/chat-completion).
 
 <InferenceSnippet
     pipeline=image-text-to-text
-    providersMapping={ {"cohere":{"modelId":"CohereLabs/aya-vision-8b","providerModelId":"c4ai-aya-vision-8b"} } }
+    providersMapping={ {"cohere":{"modelId":"CohereLabs/aya-vision-32b","providerModelId":"c4ai-aya-vision-32b"} } }
 conversational />
 
@@ -64,6 +64,6 @@ Find out more about Text To Video [here](../tasks/text_to_video).
 
 <InferenceSnippet
     pipeline=text-to-video
-    providersMapping={ {"fal-ai":{"modelId":"Wan-AI/Wan2.1-T2V-14B","providerModelId":"fal-ai/wan-t2v"} } }
+    providersMapping={ {"fal-ai":{"modelId":"Lightricks/LTX-Video","providerModelId":"fal-ai/ltx-video"} } }
 />