oracle-devrel
diff --git a/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/litellm/README.md‎
Lines changed: 2 additions & 2 deletions b/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/litellm/README.md‎
Lines changed: 2 additions & 2 deletions
diff --git a/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/litellm/assets/images/litellm.png‎
51.4 KB b/‎cloud-infrastructure/ai-infra-gpu/ai-infrastructure/litellm/assets/images/litellm.png‎
51.4 KB
@@ -1,8 +1,8 @@
 # Calling multiple vLLM inference servers using LiteLLM
 
-In this tutorial we explain how to use a LiteLLM Proxy Server to call multiple LLM inference endpoints from a single interface. LiteLLM interacts will 100+ LLMs such as OpenAI, Coheren, NVIDIA Triton and NIM, etc. Here we will use two vLLM inference servers.
+In this tutorial we explain how to use a LiteLLM Proxy Server to call multiple LLM inference endpoints from a single interface. LiteLLM interacts will 100+ LLMs such as OpenAI, Cohere, NVIDIA Triton and NIM, etc. Here we will use two vLLM inference servers.
 
-![Hybrid shards](assets/images/litellm.avif "LiteLLM")
+![Hybrid shards](assets/images/litellm.png "LiteLLM")
 
 ## Introduction