huggingface
diff --git a/‎docs/source/en/guides/inference.md‎
Lines changed: 30 additions & 30 deletions b/‎docs/source/en/guides/inference.md‎
Lines changed: 30 additions & 30 deletions
diff --git a/‎src/huggingface_hub/inference/_client.py‎
Lines changed: 1 addition & 1 deletion b/‎src/huggingface_hub/inference/_client.py‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/huggingface_hub/inference/_generated/_async_client.py‎
Lines changed: 1 addition & 1 deletion b/‎src/huggingface_hub/inference/_generated/_async_client.py‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎src/huggingface_hub/inference/_providers/__init__.py‎
Lines changed: 5 additions & 0 deletions b/‎src/huggingface_hub/inference/_providers/__init__.py‎
Lines changed: 5 additions & 0 deletions
diff --git a/‎src/huggingface_hub/inference/_providers/fireworks_ai.py‎
Lines changed: 14 additions & 0 deletions b/‎src/huggingface_hub/inference/_providers/fireworks_ai.py‎
Lines changed: 14 additions & 0 deletions
diff --git a/‎src/huggingface_hub/inference/_providers/new_provider.md‎
Lines changed: 79 additions & 0 deletions b/‎src/huggingface_hub/inference/_providers/new_provider.md‎
Lines changed: 79 additions & 0 deletions
@@ -248,36 +248,36 @@ You might wonder why using [`InferenceClient`] instead of OpenAI's client? There
 
 [`InferenceClient`]'s goal is to provide the easiest interface to run inference on Hugging Face models, on any provider. It has a simple API that supports the most common tasks. Here is a table showing which providers support which tasks:
 
-| Domain              | Task                                                | HF Inference | Replicate | fal-ai | Sambanova | Together |
-| ------------------- | --------------------------------------------------- | ------------ | --------- | ------ | --------- | -------- |
-| **Audio**           | [`~InferenceClient.audio_classification`]           | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.audio_to_audio`]                 | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.automatic_speech_recognition`]   | ✅            | ❌         | ✅      | ❌         | ❌        |
-|                     | [`~InferenceClient.text_to_speech`]                 | ✅            | ✅         | ❌      | ❌         | ❌        |
-| **Computer Vision** | [`~InferenceClient.image_classification`]           | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.image_segmentation`]             | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.image_to_image`]                 | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.image_to_text`]                  | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.object_detection`]               | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.text_to_image`]                  | ✅            | ✅         | ✅      | ❌         | ✅        |
-|                     | [`~InferenceClient.text_to_video`]                  | ❌            | ✅         | ✅      | ❌         | ❌        |
-|                     | [`~InferenceClient.zero_shot_image_classification`] | ✅            | ❌         | ❌      | ❌         | ❌        |
-| **Multimodal**      | [`~InferenceClient.document_question_answering`]    | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.visual_question_answering`]      | ✅            | ❌         | ❌      | ❌         | ❌        |
-| **NLP**             | [`~InferenceClient.chat_completion`]                | ✅            | ❌         | ❌      | ✅         | ✅        |
-|                     | [`~InferenceClient.feature_extraction`]             | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.fill_mask`]                      | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.question_answering`]             | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.sentence_similarity`]            | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.summarization`]                  | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.table_question_answering`]       | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.text_classification`]            | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.text_generation`]                | ✅            | ❌         | ❌      | ❌         | ✅        |
-|                     | [`~InferenceClient.token_classification`]           | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.translation`]                    | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.zero_shot_classification`]       | ✅            | ❌         | ❌      | ❌         | ❌        |
-| **Tabular**         | [`~InferenceClient.tabular_classification`]         | ✅            | ❌         | ❌      | ❌         | ❌        |
-|                     | [`~InferenceClient.tabular_regression`]             | ✅            | ❌         | ❌      | ❌         | ❌        |
+| Domain              | Task                                                | HF Inference | Replicate | fal-ai | Fireworks AI | Sambanova | Together |
+| ------------------- | --------------------------------------------------- | ------------ | --------- | ------ | ------------ | --------- | -------- |
+| **Audio**           | [`~InferenceClient.audio_classification`]           | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.audio_to_audio`]                 | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.automatic_speech_recognition`]   | ✅            | ❌         | ✅      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.text_to_speech`]                 | ✅            | ✅         | ❌      | ❌            | ❌         | ❌        |
+| **Computer Vision** | [`~InferenceClient.image_classification`]           | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.image_segmentation`]             | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.image_to_image`]                 | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.image_to_text`]                  | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.object_detection`]               | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.text_to_image`]                  | ✅            | ✅         | ✅      | ❌            | ❌         | ✅        |
+|                     | [`~InferenceClient.text_to_video`]                  | ❌            | ✅         | ✅      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.zero_shot_image_classification`] | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+| **Multimodal**      | [`~InferenceClient.document_question_answering`]    | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.visual_question_answering`]      | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+| **NLP**             | [`~InferenceClient.chat_completion`]                | ✅            | ❌         | ❌      | ✅            | ✅         | ✅        |
+|                     | [`~InferenceClient.feature_extraction`]             | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.fill_mask`]                      | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.question_answering`]             | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.sentence_similarity`]            | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.summarization`]                  | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.table_question_answering`]       | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.text_classification`]            | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.text_generation`]                | ✅            | ❌         | ❌      | ❌            | ❌         | ✅        |
+|                     | [`~InferenceClient.token_classification`]           | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.translation`]                    | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.zero_shot_classification`]       | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+| **Tabular**         | [`~InferenceClient.tabular_classification`]         | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
+|                     | [`~InferenceClient.tabular_regression`]             | ✅            | ❌         | ❌      | ❌            | ❌         | ❌        |
 
 <Tip>
 
 
@@ -133,7 +133,7 @@ class InferenceClient:
             path will be appended to the base URL (see the [TGI Messages API](https://huggingface.co/docs/text-generation-inference/en/messages_api)
             documentation for details). When passing a URL as `model`, the client will not append any suffix path to it.
         provider (`str`, *optional*):
-            Name of the provider to use for inference. Can be `"replicate"`, `"together"`, `"fal-ai"`, `"sambanova"` or `"hf-inference"`.
+            Name of the provider to use for inference. Can be "fal-ai"`, `"fireworks-ai"`, `"replicate"`, "sambanova"`, `"together"`, or `"hf-inference"`.
             defaults to hf-inference (Hugging Face Serverless Inference API).
             If model is a URL or `base_url` is passed, then `provider` is not used.
         token (`str` or `bool`, *optional*):
 
@@ -121,7 +121,7 @@ class AsyncInferenceClient:
             path will be appended to the base URL (see the [TGI Messages API](https://huggingface.co/docs/text-generation-inference/en/messages_api)
             documentation for details). When passing a URL as `model`, the client will not append any suffix path to it.
         provider (`str`, *optional*):
-            Name of the provider to use for inference. Can be `"replicate"`, `"together"`, `"fal-ai"`, `"sambanova"` or `"hf-inference"`.
+            Name of the provider to use for inference. Can be "fal-ai"`, `"fireworks-ai"`, `"replicate"`, "sambanova"`, `"together"`, or `"hf-inference"`.
             defaults to hf-inference (Hugging Face Serverless Inference API).
             If model is a URL or `base_url` is passed, then `provider` is not used.
         token (`str` or `bool`, *optional*):
 
@@ -7,6 +7,7 @@
     FalAITextToSpeechTask,
     FalAITextToVideoTask,
 )
+from .fireworks_ai import FireworksAIConversationalTask
 from .hf_inference import HFInferenceBinaryInputTask, HFInferenceConversational, HFInferenceTask
 from .replicate import ReplicateTask, ReplicateTextToSpeechTask
 from .sambanova import SambanovaConversationalTask
@@ -15,6 +16,7 @@
 
 PROVIDER_T = Literal[
     "fal-ai",
+    "fireworks-ai",
     "hf-inference",
     "replicate",
     "sambanova",
@@ -28,6 +30,9 @@
         "text-to-speech": FalAITextToSpeechTask(),
         "text-to-video": FalAITextToVideoTask(),
     },
+    "fireworks-ai": {
+        "conversational": FireworksAIConversationalTask(),
+    },
     "hf-inference": {
         "text-to-image": HFInferenceTask("text-to-image"),
         "conversational": HFInferenceConversational(),
 
@@ -0,0 +1,14 @@
+from typing import Any, Dict, Optional
+
+from ._common import TaskProviderHelper, filter_none
+
+
+class FireworksAIConversationalTask(TaskProviderHelper):
+    def __init__(self):
+        super().__init__(provider="fireworks-ai", base_url="https://api.fireworks.ai/inference", task="conversational")
+
+    def _prepare_route(self, mapped_model: str) -> str:
+        return "/v1/chat/completions"
+
+    def _prepare_payload(self, inputs: Any, parameters: Dict, mapped_model: str) -> Optional[Dict]:
+        return {"messages": inputs, **filter_none(parameters), "model": mapped_model}
@@ -0,0 +1,79 @@
+## How to add a new provider?
+
+Before adding a new provider to the `huggingface_hub` library, make sure it has already been added to `huggingface.js` and is working on the Hub. Support in the Python library comes as a second step. In this guide, we are considering that the first part is complete. 
+
+### 1. Implement the provider helper 
+
+Create a new file under `src/huggingface_hub/inference/_providers/{provider_name}.py` and copy-paste the following snippet.
+
+Implement the methods that require custom handling. Check out the base implementation to check default behavior. If you don't need to override a method, just remove it. At least one of `_prepare_payload` or `_prepare_body` must be overwritten.
+
+If the provider supports multiple tasks that require different implementations, create dedicated subclasses for each task, following the pattern shown in `fal_ai.py`.
+
+```py
+from typing import Any, Dict, Optional, Union
+
+from ._common import TaskProviderHelper
+
+
+class MyNewProviderTaskProviderHelper(TaskProviderHelper):
+    def __init__(self):
+        """Define high-level parameters."""
+        super().__init__(provider=..., base_url=..., task=...)
+
+    def get_response(self, response: Union[bytes, Dict]) -> Any:
+        """
+        Return the response in the expected format.
+
+        Override this method in subclasses for customized response handling."""
+        return super().get_response(response)
+
+    def _prepare_headers(self, headers: Dict, api_key: str) -> Dict:
+        """Return the headers to use for the request.
+
+        Override this method in subclasses for customized headers.
+        """
+        return super()._prepare_headers(headers, api_key)
+
+    def _prepare_route(self, mapped_model: str) -> str:
+        """Return the route to use for the request.
+
+        Override this method in subclasses for customized routes.
+        """
+        return super()._prepare_route(mapped_model)
+
+    def _prepare_payload(self, inputs: Any, parameters: Dict, mapped_model: str) -> Optional[Dict]:
+        """Return the payload to use for the request, as a dict.
+
+        Override this method in subclasses for customized payloads.
+        Only one of `_prepare_payload` and `_prepare_body` should return a value.
+        """
+        return super()._prepare_payload(inputs, parameters, mapped_model)
+
+    def _prepare_body(
+        self, inputs: Any, parameters: Dict, mapped_model: str, extra_payload: Optional[Dict]
+    ) -> Optional[bytes]:
+        """Return the body to use for the request, as bytes.
+
+        Override this method in subclasses for customized body data.
+        Only one of `_prepare_payload` and `_prepare_body` should return a value.
+        """
+        return super()._prepare_body(inputs, parameters, mapped_model, extra_payload)
+```
+
+### 2. Register the provider helper in `__init__.py`
+
+Go to `src/huggingface_hub/inference/_providers/__init__.py` and add your provider  to `PROVIDER_T` and `PROVIDERS`.
+Please try to respect alphabetical order.
+
+### 3. Update docstring in `InferenceClient.__init__` to document your provider
+
+### 4. Add static tests in `tests/test_inference_providers.py`
+
+You only have to add a test for overwritten methods.
+
+### 5. Add VCR tests in `tests/test_inference_client.py`
+
+- Add an entry in `_RECOMMENDED_MODELS_FOR_VCR` at the top of the test module. It contains a mapping task <> test model. Model id must be the HF model id.
+- Add an entry in `API_KEY_ENV_VARIABLES` to define which env variable should be used
+- Run tests locally with `pytest tests/test_inference_client.py -k <provider>` and commit the VCR cassettes.