Update deploy-models-tsuzumi.md

msakande · web-flow · commit fec5ef2cb180 · 2024-11-19T11:15:34.000-06:00
Remove "More inference examples" section per PM instructions.
Update PM alias in metadata
diff --git a/articles/ai-studio/how-to/deploy-models-tsuzumi.md b/articles/ai-studio/how-to/deploy-models-tsuzumi.md
@@ -6,8 +6,8 @@ ms.service: azure-ai-studio
 manager: scottpolly
 ms.topic: how-to
 ms.date: 10/24/2024
-ms.reviewer: ssalgado
-reviewer: ssalgadodev
+ms.reviewer: haelhamm
+reviewer: hazemelh
 ms.author: ssalgado
 author: ssalgadodev
 ms.custom: references_regions, generated
@@ -1322,20 +1322,6 @@ The following example shows how to handle events when the model detects harmful
 
 ::: zone-end
 
-## More inference examples
-
-For more examples of how to use Tsuzumi models, see the following examples and tutorials:
-
-| Description                               | Language          | Sample                                                             |
-|-------------------------------------------|-------------------|------------------------------------------------------------------- |
-| CURL request                              | Bash              | [Link](https://aka.ms/meta-llama-3.1-405B-instruct-webrequests)    |
-| Azure AI Inference package for JavaScript | JavaScript        | [Link](https://aka.ms/azsdk/azure-ai-inference/javascript/samples) |
-| Azure AI Inference package for Python     | Python            | [Link](https://aka.ms/azsdk/azure-ai-inference/python/samples)     |
-| Python web requests                       | Python            | [Link](https://aka.ms/meta-llama-3.1-405B-instruct-webrequests)    |
-| OpenAI SDK (experimental)                 | Python            | [Link](https://aka.ms/meta-llama-3.1-405B-instruct-openai)         |
-| LangChain                                 | Python            | [Link](https://aka.ms/meta-llama-3.1-405B-instruct-langchain)      |
-| LiteLLM                                   | Python            | [Link](https://aka.ms/meta-llama-3.1-405B-instruct-litellm)        | 
-
 ## Cost and quota considerations for tsuzumi models deployed as serverless API endpoints
 
 Quota is managed per deployment. Each deployment has a rate limit of 200,000 tokens per minute and 1,000 API requests per minute. However, we currently limit one deployment per model per project. Contact Microsoft Azure Support if the current rate limits aren't sufficient for your scenarios.