Merge pull request #270458 from aahill/ingestion-2

prmerger-automator[bot] · web-flow · commit 809525840e73 · 2024-03-28T05:19:27.000Z
comma
diff --git a/articles/ai-services/openai/concepts/use-your-data.md b/articles/ai-services/openai/concepts/use-your-data.md
@@ -328,7 +328,7 @@ Azure OpenAI On Your Data processes your documents by splitting them into chunks
 
 #### Setting chunk size for your use case
 
-The default chunk size is 1024 tokens. However, given the uniqueness of your data, you might find a different chunk size (such as 256, 512, or 1,536 tokens) more effective.
+The default chunk size is 1,024 tokens. However, given the uniqueness of your data, you might find a different chunk size (such as 256, 512, or 1,536 tokens) more effective.
 
 Adjusting the chunk size can enhance your chatbot's performance. While finding the optimal chunk size requires some trial and error, start by considering the nature of your dataset. A smaller chunk size is generally better for datasets with direct facts and less context, while a larger chunk size might be beneficial for more contextual information, though it could affect retrieval performance. 
 
diff --git a/articles/ai-services/openai/whats-new.md b/articles/ai-services/openai/whats-new.md
@@ -24,9 +24,10 @@ Azure OpenAI Studio now provides a Risks & Safety dashboard for each of your dep
 
 [Use the Risks & Safety monitor](./how-to/risks-safety-monitor.md)
 
-### Elasticsearch database support for Azure OpenAI On Your Data
+### Azure OpenAI On Your Data updates
 
 - You can now connect to an Elasticsearch vector database to be used with [Azure OpenAI On Your Data](./concepts/use-your-data.md?tabs=elasticsearch#supported-data-sources).
+- You can use the [chunk size parameter](./concepts/use-your-data.md#chunk-size-preview) during data ingestion to set the maximum number of tokens of any given chunk of data in your index.
 
 ### 2024-02-01 general availability (GA) API released