Apply suggestions from code review

Wauplin · burtenshaw · web-flow · commit 95f0171da67c · 2025-03-31T11:37:32.000+02:00
Co-authored-by: burtenshaw &lt;ben.burtenshaw@gmail.com&gt;
diff --git a/docs/api-inference/hub-api.md b/docs/api-inference/hub-api.md
@@ -1,6 +1,6 @@
 # Hub API
 
-The Hub provides a few API to deal with Inference Providers. Here is a list of them.
+The Hub provides a few APIs to interact with Inference Providers. Here is a list of them:
 
 ## List models
 
@@ -16,7 +16,7 @@ To list models powered by a provider, use the `inference_provider` query paramet
 ...
 ```
 
-It can be combined with other filters to e.g. select only text-to-image models:
+It can be combined with other filters to e.g. select only `text-to-image` models:
 
 ```sh
 # List text-to-image models served by Fal AI
@@ -28,7 +28,7 @@ It can be combined with other filters to e.g. select only text-to-image models:
 ...
 ```
 
-Pass a comma-separated list to select from multiple providers:
+Pass a comma-separated list of providers to select multiple:
 
 ```sh
 # List image-text-to-text models served by Novita or Sambanova
@@ -54,7 +54,7 @@ Finally, you can select all models served by at least one inference provider:
 
 ## Get model status
 
-If you are interested by a specific model and want to check if at least 1 provider serves it, you can request the `inference` attribute in the model info endpoint:
+To find an inference provider for a specific model, request the `inference` attribute in the model info endpoint:
 
 <inferencesnippet>
 
@@ -170,4 +170,4 @@ In the `huggingface_hub`, use `model_info` with the expand parameter:
 </inferencesnippet>
 
 
-For each provider, you get the status (`staging` or `live`), the related task (here, `conversational`) and the providerId. In practice, this information is mostly relevant for the JS and Python clients. The relevant part is to know that the listed providers are the ones serving the model.
+Each provider serving the model shows a status (`staging` or `live`), the related task (here, `conversational`) and the providerId. In practice, this information is relevant for the JS and Python clients. 
diff --git a/docs/api-inference/hub-integration.md b/docs/api-inference/hub-integration.md
@@ -1,17 +1,17 @@
 # Hub Integration
 
-The Inference Providers is tightly integrated with the Hugging Face Hub. No matter in which service you use it, the usage and billing will be centralized on your Hugging Face account.
+The Inference Providers is tightly integrated with the Hugging Face Hub. No matter which provider you use, the usage and billing will be centralized in your Hugging Face account.
 
 ## Model search
 
-When listing models on the Hub, you can filter to select models deployed on the inference provider for your choice. For example, to list all models deployed on Fireworks AI infra: https://huggingface.co/models?inference_provider=fireworks-ai.
+When listing models on the Hub, you can filter to select models deployed on the inference provider of your choice. For example, to list all models deployed on Fireworks AI infra: https://huggingface.co/models?inference_provider=fireworks-ai.
 
 <div class="flex justify-center">
     <img class="block light:hidden" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/inference-providers/models-filter-by-provider-light.png"/>
     <img class="block dark:hidden" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/inference-providers/models-filter-by-provider-dark.png"/>
 </div>
 
-It is also possible to select multiple providers or even all of them to filter all models that are available on at least 1 provider: https://huggingface.co/models?inference_provider=all.
+It is also possible to select all or multiple providers and filter their available models: https://huggingface.co/models?inference_provider=all.
 
 <div class="flex justify-center">
     <img class="block light:hidden" src="https://huggingface.co/datasets/huggingface/documentation-images/resolve/main/inference-providers/models-filter-any-provider-light.png"/>
@@ -20,7 +20,7 @@ It is also possible to select multiple providers or even all of them to filter a
 
 ## Features using Inference Providers
 
-Several Hugging Face features utilize the Inference Providers and count towards your monthly credits. The included monthly credits for PRO and Enterprise should cover moderate usage of these features for most users.
+Several Hugging Face features utilize Inference Providers and count towards your monthly credits. The included monthly credits for PRO and Enterprise should cover moderate usage of these features for most users.
 
 - [Inference Widgets](https://huggingface.co/deepseek-ai/DeepSeek-V3-0324): Interactive widgets available on model pages. This is the entry point to quickly test a model on the Hub.
 
diff --git a/docs/api-inference/index.md b/docs/api-inference/index.md
@@ -35,7 +35,7 @@ To get started quickly with [Chat Completion models](http://huggingface.co/model
 
 You can call the Inference Providers with your preferred tools, such as Python, JavaScript, or cURL. To simplify integration, we offer both a Python SDK (`huggingface_hub`) and a JavaScript SDK (`huggingface.js`).
 
-In this section, we will demonstrate a simple example using [deepseek-ai/DeepSeek-V3-0324](https://huggingface.co/deepseek-ai/DeepSeek-V3-0324), a conversational Large Language Model. For the example, we will use [Novita AI](https://novita.ai/) as Inference Provider with routed requests. You will learn what that means in the next chapters.
+In this section, we will demonstrate a simple example using [deepseek-ai/DeepSeek-V3-0324](https://huggingface.co/deepseek-ai/DeepSeek-V3-0324), a conversational Large Language Model. For the example, we will use [Novita AI](https://novita.ai/) as Inference Provider.
 
 ### Authentication
 
@@ -164,9 +164,9 @@ console.log(chatCompletion.choices[0].message);
 
 ## Next Steps
 
-In this introduction, we've covered the basics of Inference Provider. To learn more about this service, check out our guides and API Reference:
+In this introduction, we've covered the basics of Inference Providers. To learn more about this service, check out our guides and API Reference:
 - [Pricing and Billing](./pricing): everything you need to know about billing
-- [Hub integration](./hub-integration): how the Inference Providers is integrated with the Hub?
+- [Hub integration](./hub-integration): how Inference Providers is integrated with the Hub?
 - [External Providers](./providers): everything about providers and how to become an official partner
 - [Hub API](./hub-api): high level API for inference providers
 - [API Reference](./tasks/index): learn more about the parameters and task-specific settings.
diff --git a/docs/api-inference/pricing.md b/docs/api-inference/pricing.md
@@ -1,6 +1,6 @@
 # Pricing and Billing
 
-The Inference Providers is a production-ready service involving external partners and is therefore a paid-product. However, as an Hugging Face user you get monthly credits to run experiments. The amount of credits you get depends on your type of account:
+Inference Providers is a production-ready service involving external partners and is therefore a paid-product. However, as a Hugging Face user you get monthly credits to run experiments. The amount of credits you get depends on your type of account:
 
 | User Tier                | Included monthly credits           |
 | ------------------------ | ---------------------------------- |
@@ -11,7 +11,7 @@ The Inference Providers is a production-ready service involving external partner
 
 **PRO and Enterprise Hub users** can continue using the API once their monthly included credits are exhausted. This billing model, known as "Pay-as-you-Go" (PAYG), is charged on top of the monthly subscription. PAYG is only available for providers that are integrated with our billing system. We're actively working to integrate all providers, but in the meantime, any providers that are not yet integrated will be blocked once the free-tier limit is reached.
 
-If you haven't used up your included credits yet, we estimate costs for providers that aren’t fully integrated with our billing system. These estimates are usually higher than the actual cost to prevent abuse, which is why PAYG is currently disabled for those providers.
+If you have remaining credits, we estimate costs for providers that aren’t fully integrated with our billing system. These estimates are usually higher than the actual cost to prevent abuse, which is why PAYG is currently disabled for those providers.
 
 You can track your spending on your [billing page](https://huggingface.co/settings/billing).
 
@@ -25,7 +25,7 @@ Hugging Face charges you the same rates as the provider, with no additional fees
 
 The documentation above assumes you are making routed requests to external providers. In practice, there are 3 different ways to run inference, each with unique billing implications:
 
-- **Routed Request**: This is the default method for using the Inference Providers. Simply use the JavaScript or Python `InferenceClient`, or make raw HTTP requests with your Hugging Face User Access Token. Your request is automatically routed through Hugging Face to the provider's platform. No separate provider account is required, and billing is managed directly by Hugging Face. This approach lets you seamlessly switch between providers without additional setup.
+- **Routed Request**: This is the default method for using Inference Providers. Simply use the JavaScript or Python `InferenceClient`, or make raw HTTP requests with your Hugging Face User Access Token. Your request is automatically routed through Hugging Face to the provider's platform. No separate provider account is required, and billing is managed directly by Hugging Face. This approach lets you seamlessly switch between providers without additional setup.
 
 - **Routed Request with Custom Key**: In your [settings page](https://huggingface.co/settings/inference-providers) on the Hub, you can configure a custom key for each provider. To use this option, you'll need to create an account on the provider's platform, and billing will be handled directly by that provider. Hugging Face won't charge you for the call. This method gives you more control over billing when experimenting with models on the Hub. When making a routed request with a custom key, your code remains unchanged—you'll still pass your Hugging Face User Access Token. Hugging Face will automatically swap the authentication when routing the request.
 
@@ -41,15 +41,15 @@ Here is a table that sums up what we've seen so far:
 
 ## HF-Inference cost
 
-As you may have noticed, you can select to work with `"hf-inference"` provider. This is what used to be the "Inference API (serverless)" prior to the Inference Providers integration. From a user point of view, working with HF Inference is the same as with any other providers. Past the free-tier credits, you get charged for every inference request based on the compute time x price of the underlying hardware.
+As you may have noticed, you can select to work with `"hf-inference"` provider. This service used to be "Inference API (serverless)" prior to Inference Providers. From a user point of view, working with HF Inference is the same as with any other provider. Past the free-tier credits, you get charged for every inference request based on the compute time x price of the underlying hardware.
 
 For instance, a request to [black-forest-labs/FLUX.1-dev](https://huggingface.co/black-forest-labs/FLUX.1-dev) that takes 10 seconds to complete on a GPU machine that costs $0.00012 per second to run, will be billed $0.0012.
 
 The `"hf-inference"` provider is currently the default provider when working with the JavaScript and Python SDKs. Note that this default might change in the future.
 
 ## Organization billing
 
-For Enterprise Hub organizations, it is possible to centralize billing for all your users. Each user still use their own User Access Token but the requests are billed to your organization. This can be done by passing `"X-HF-Bill-To: my-org-name"` as header in your HTTP requests.
+For Enterprise Hub organizations, it is possible to centralize billing for all your users. Each user still uses their own User Access Token but the requests are billed to your organization. This can be done by passing `"X-HF-Bill-To: my-org-name"` as header in your HTTP requests.
 
 If you are using the JavaScript `InferenceClient`, you can set the `billTo` attribute at a client level: