Skip to content

Commit b9d5367

Browse files
authored
Updated with AOAI embedding model best practices
1 parent 2501186 commit b9d5367

File tree

1 file changed

+10
-1
lines changed

1 file changed

+10
-1
lines changed

articles/search/cognitive-search-skill-azure-openai-embedding.md

Lines changed: 10 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -8,7 +8,7 @@ ms.service: cognitive-search
88
ms.custom:
99
- ignite-2023
1010
ms.topic: reference
11-
ms.date: 02/21/2024
11+
ms.date: 03/28/2024
1212
---
1313

1414
# Azure OpenAI Embedding skill
@@ -113,6 +113,15 @@ The output resides in memory. To send this output to a field in the search index
113113
]
114114
```
115115

116+
# Best practices
117+
118+
The following are some best practices you need to consider when utilizing this skill:
119+
- If you are hitting your Azure OpenAI TPM (Tokens per minute) limit, consider the [quota limits advisory](../ai-services/openai/quotas-limits.md) so you can address accordingly. Refer to the [Azure OpenAI monitoring](../ai-services/openai/how-to/monitoring.md) documentation for more information about your Azure OpenAI instance performance.
120+
- The Azure OpenAI embeddings model deployment you use for this skill should be ideally separate from the deployment used for other use cases, including the [query vectorizer](vector-search-how-to-configure-vectorizer.md). This helps each deployment to be tailored to its specific use case, leading to optimized performance and identifying traffic from the indexer and the index embedding calls easily.
121+
- Your Azure OpenAI instance should be in the same region or at least geographically close to the region where your AI Search service is hosted. This reduces latency and improves the speed of data transfer between the services.
122+
- If you have a larger than default Azure OpenAI TPM (Tokens per minute) limit as published in [quotas and limits](../ai-services/openai/quotas-limits.md) documentation, open a [support case](../azure-portal/supportability/how-to-create-azure-support-request.md) with the Azure AI Search team, so this can be adjusted accordingly. This helps your indexing process not being unnecessarily slowed down by the documented default TPM limit, if you have higher limits.
123+
124+
116125
## Errors and warnings
117126

118127
| Condition | Result |

0 commit comments

Comments
 (0)