Merge pull request #127 from MicrosoftDocs/main

Taojunshen · web-flow · commit 3d9b2ce3b80d · 2024-09-05T06:35:01.000+08:00
9/4/2024 PM Publish
diff --git a/articles/ai-services/openai/concepts/models.md b/articles/ai-services/openai/concepts/models.md
@@ -66,7 +66,7 @@ See [model versions](../concepts/model-versions.md) to learn about how Azure Ope
 
 |  Model ID  | Description | Max Request (tokens) | Training Data (up to)  |
 |  --- |  :--- |:--- |:---: |
-|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000  <br> Output: 4,096| Oct 2023 |
+|`gpt-4o` (2024-08-06) <br> **GPT-4o (Omni)** | **Latest large GA model** <br> - Structured outputs<br> - Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000  <br> Output: 16,384 | Oct 2023 |
 |`gpt-4o-mini` (2024-07-18) <br> **GPT-4o mini** | **Latest small GA model** <br> - Fast, inexpensive, capable model ideal for replacing GPT-3.5 Turbo series models. <br> - Text, image processing <br>- JSON Mode <br> - parallel function calling | Input: 128,000 <br> Output: 16,384  | Oct 2023 |
 |`gpt-4o` (2024-05-13) <br> **GPT-4o (Omni)** | Text, image processing <br> - JSON Mode <br> - parallel function calling <br> - Enhanced accuracy and responsiveness <br> - Parity with English text and coding tasks compared to GPT-4 Turbo with Vision <br> - Superior performance in non-English languages and in vision tasks |Input: 128,000  <br> Output: 4,096| Oct 2023 |
 | `gpt-4` (turbo-2024-04-09) <br>**GPT-4 Turbo with Vision** | **New GA model** <br> - Replacement for all previous GPT-4 preview models (`vision-preview`, `1106-Preview`, `0125-Preview`). <br> - [**Feature availability**](#gpt-4o-and-gpt-4-turbo) is currently different depending on method of input, and deployment type. | Input: 128,000  <br> Output: 4,096  | Dec 2023 |
diff --git a/articles/search/index.yml b/articles/search/index.yml
@@ -50,10 +50,10 @@ landingContent:
             url: retrieval-augmented-generation-overview.md
       - linkListType: quickstart
         links:
-          - text: Create a vector index
-            url: search-get-started-vector.md
           - text: Retrieve data using an LLM
             url: search-get-started-rag.md
+          - text: Create a vector index
+            url: search-get-started-vector.md
           - text: Query a vector index
             url: vector-search-how-to-query.md
       - linkListType: sample
diff --git a/articles/search/search-data-sources-gallery.md b/articles/search/search-data-sources-gallery.md
@@ -9,8 +9,7 @@ ms.service: cognitive-search
 ms.custom:
   - ignite-2023
 ms.topic: conceptual
-layout: LandingPage
-ms.date: 06/18/2024
+ms.date: 09/04/2024
 ---
 
 # Data sources gallery
@@ -153,7 +152,7 @@ By [Azure AI Search](search-how-to-index-onelake-files.md)
 
 Connect to a OneLake lakehouse to extract supported files content from a hierarchy of directories and nested subdirectories.
 
-[More details](search-howto-index-cosmosdb.md)
+[More details](search-how-to-index-onelake-files.md)
 
 :::image type="icon" source="media/search-data-sources-gallery/fabric_onelake_logo.png":::
 
diff --git a/articles/search/search-limits-quotas-capacity.md b/articles/search/search-limits-quotas-capacity.md
@@ -8,7 +8,7 @@ author: HeidiSteen
 ms.author: heidist
 ms.service: cognitive-search
 ms.topic: conceptual
-ms.date: 06/13/2024
+ms.date: 09/04/2024
 ms.custom:
   - references_regions
   - build-2024
@@ -63,11 +63,14 @@ You might find some variation in maximum limits if your service happens to be pr
 
 ## Document limits 
 
-You can have approximately 24 billion documents per index on Basic, S1, S2, S3, L1, and L2 search services. For S3 HD, the limit is 2 billion documents per index. Each instance of a complex collection counts as a separate document in terms of these limits.
+Maximum number of documents per index are:
 
-### Document size limits per API call
++ 24 billion on Basic, S1, S2, S3, L1, and L2 search services.
++ 2 billion on S3 HD.
 
-The maximum document size when calling an Index API is approximately 16 megabytes.
+Each instance of a complex collection counts as a separate document in terms of these limits.
+
+Maximum document size when calling an Index API is approximately 16 megabytes.
 
 Document size is actually a limit on the size of the Index API request body. Since you can pass a batch of multiple documents to the Index API at once, the size limit realistically depends on how many documents are in the batch. For a batch with a single document, the maximum document size is 16 MB of JSON.
 
@@ -79,73 +82,49 @@ When you index documents with vector fields, Azure AI Search constructs internal
 
 The service enforces a vector index size quota **for every partition** in your search service. Each extra partition increases the available vector index size quota. This quota is a hard limit to ensure your service remains healthy, which means that further indexing attempts once the limit is exceeded results in failure. You can resume indexing once you free up available quota by either deleting some vector documents or by scaling up in partitions.
 
-The table describes the vector index size quota per partition across the service tiers. For context, it includes:
-
-+ [Partition storage limits](#service-limits) for each tier, repeated here for context.
-+ Amount of each partition (in GB) available for vector indexes (created when you add vector fields to an index).
-+ Approximate number of embeddings (floating point values) per partition.
+Vector limits vary by service creation date and tier.
 
-Use the [GET Service Statistics](/rest/api/searchservice/get-service-statistics) to retrieve your vector index size quota or review the **Indexes** page or **Usage** tab in the Azure portal.
++ To check the age of your search service or learn more about vector indexes, see [Vector index size and staying under limits](vector-search-index-size.md).
 
-Vector limits vary by service creation date and tier. To check the age of your search service and learn more about vector indexes, see [Vector index size and staying under limits](vector-search-index-size.md).
++ To view the vector quota in effect for your search service, use [GET Service Statistics](/rest/api/searchservice/get-service-statistics), or check the **Properties** and **Usage** tabs for your search service in the Azure portal.
 
-### Vector limits on services created after May 17, 2024
+#### Storage quota (GB)
 
-The highest vector limits are available on search services created after May 17, 2024 in a [supported region](#service-limits).
+This table repeats [partition storage limits](#service-limits) for context. The table shows the progression of storage quota increases in GB over time. Vector quota is per partition, so the increase in vector quota is bound to the increase in per-partition storage for each tier. Higher capacity partitions were brought online starting in April 2024.
 
-| Tier   | Storage quota (GB) | Vector quota per partition (GB) | 
-|--------|--------------------|---------------------------------|
-| Basic | 15                  | 5      |
-| S1    | 160                 | 35     |
-| S2    | 512                 | 150    |
-| S3    | 1,024               | 300    |
-| L1    | 2,048               | 150    |
-| L2    | 4,096               | 300    |
+| Service creation date |Basic | S1| S2 | S3 | L1 | L2 |
+|-----------------------|------|---|----|----|----|----|
+|**Before July 1, 2023** <sup>1</sup> | 2  | 25 | 100 | 200 | 1,000 | 2,000 |
+| **July 1, 2023 through April 3, 2024** <sup>2</sup>| 2  | 25 | 100 | 200 | 1,000 | 2,000 |
+|**April 3, 2024 through May 17, 2024** <sup>3</sup> | 15  | 160 | 350 | 700 | 1,000 | 2,000 |
+|**After May 17, 2024** <sup>4</sup> | 15  | 160 | 512 | 1,024 | 2,048 | 4,096 |
 
-### Vector limits on services created between April 3, 2024 and May 17, 2024
+<sup>1</sup> Partition sizes during early preview.
 
-The following vector limits are available on search services created after April 3, 2024 in a [supported region](#service-limits).
+<sup>2</sup> No change during the later preview period.
 
-| Tier   | Storage quota (GB) | Vector quota per partition (GB) |
-|--------|--------------------|---------|
-| Basic | 15                  | 5       |
-| S1    | 160                 | 35      |
-| S2    | 350                 | 100     |
-| S3    | 700                 | 200     |
-| L1    | 1,000               | 12      |
-| L2    | 2,000               | 36      |
+<sup>3</sup> Higher capacity storage for Basic, S1, S2, S3 in the following regions. **Americas**: Brazil South​, Canada Central​, Canada East​​, East US​, East US 2, ​Central US​, North Central US​, South Central US​, West US​, West US 2​, West US 3​, West Central US. **Europe**: France Central​. Italy North​​, North Europe​​, Norway East, Poland Central​​, Switzerland North​, Sweden Central​, UK South​, UK West​. **Middle East**:  UAE North. **Africa**: South Africa North. **Asia Pacific**: Australia East​, Australia Southeast​​, Central India, Jio India West​, East Asia, Southeast Asia​, Japan East, Japan West​, Korea Central, Korea South​.
 
-Notice that L1 and L2 limits are unchanged in the April 3 rollout.
+<sup>4</sup> Higher capacity storage for more tiers and more regions. **Europe**: Germany North​, Germany West Central, Switzerland West​. **Azure Government**: Texas, Arizona, Virginia. **Africa**: South Africa North​. **Asia Pacific**: China North 3, China East 3.
 
-### Vector limits on services created between July 1, 2023 and April 3, 2024
+#### Vector quota per partition (GB)
 
-The following limits applied to new services created between July 1 and April 3, 2024, except for the following regions, which have the original limits from before July 1, 2023:
+This table shows the progression of vector quota increases in GB over time. The quota is per partition, so if you scale a new Standard (S1) service to 6 partitions, total vector quota is 35 multiplied by 6.
 
-+ Germany West Central
-+ West India
-+ Qatar Central
+| Service creation date |Basic | S1| S2 | S3 | L1 | L2 |
+|-----------------------|------|---|----|----|----|----|
+|**Before July 1, 2023** <sup>1</sup> | 0.5 | 1 | 6 | 12 | 12 | 36 |
+| **July 1, 2023 through April 3, 2024** <sup>2</sup>| 1  | 3 | 12 | 36 | 12 | 36 |
+|**April 3, 2024 through May 17, 2024** <sup>3</sup> | 5  | 35 | 100 | 200 | 12 | 36 |
+|**After May 17, 2024** <sup>4</sup> | 5  | 35 | 150 | 300 | 150 | 300 |
 
-All other regions have these limits:
+<sup>1</sup> Initial vector limits during early preview.
 
-| Tier   | Storage quota (GB) | Vector quota per partition (GB) |
-|--------|--------------------|---------------|
-| Basic | 2                  | 1              |
-| S1    | 25                 | 3              |
-| S2    | 100                | 12             |
-| S3    | 200                | 36             |
-| L1    | 1,000              | 12             |
-| L2    | 2,000              | 36             |
+<sup>2</sup> Vector limits during the later preview period. Three regions didn't have the higher limits: Germany West Central, West India, Qatar Central.
 
-### Vector limits on services created before July 1, 2023
+<sup>3</sup> Higher vector quota based on the larger partitions for supported tiers and regions.
 
-| Tier   | Storage quota (GB) | Vector quota per partition (GB) |
-|--------|--------------------|--------------|
-| Basic | 2                  | 0.5           |
-| S1    | 25                 | 1             |
-| S2    | 100                | 6             |
-| S3    | 200                | 12            |
-| L1    | 1,000              | 12            |
-| L2    | 2,000              | 36            |
+<sup>4</sup> Higher vector quota for more tiers and regions based on partition size updates.
 
 ## Indexer limits
 
diff --git a/articles/search/search-region-support.md b/articles/search/search-region-support.md
@@ -21,7 +21,7 @@ This article identifies the cloud regions in which Azure AI Search is available.
 
 | Feature | Availability |
 |---------|--------------|
-| [Extra capacity](search-limits-quotas-capacity.md#service-limits) | Higher capacity partitions became available in selected regions starting in April 2024 with a second wave following in May 2024. Currently, there are just a few regions that *don't* offer higher capacity partitions. If you're using an older search service, create a new search service to benefit from more capacity at the same billing rate. To check existing capacity, [find your search service](https://portal.azure.com/#blade/HubsExtension/BrowseResourceBlade/resourceType/Microsoft.Search%2FsearchServices) and select the **Properties** tab in the middle of the Overview page. To check search service age, follow [these instructions](vector-search-index-size.md#how-to-check-service-creation-date). Regional support for extra capacity is noted in the footnotes of this article.|
+| [Extra capacity](search-limits-quotas-capacity.md#service-limits) | Higher capacity partitions became available in selected regions starting in April 2024 with a second wave following in May 2024. If you're using an older search service, create a new search service to benefit from more capacity at the same billing rate. To check existing capacity, [find your search service](https://portal.azure.com/#blade/HubsExtension/BrowseResourceBlade/resourceType/Microsoft.Search%2FsearchServices) and select the **Properties** tab in the middle of the Overview page. To check search service age, follow [these instructions](vector-search-index-size.md#how-to-check-service-creation-date).  Currently, there are just a few regions that *don't* offer higher capacity partitions. Regional support for extra capacity is noted in the footnotes of this article.|
 | [Availability zones](search-reliability.md#availability-zone-support) | Divides a region's data centers into distinct physical location groups, providing high-availability within the same geo. Regional support is noted in this article. |
 | [Azure AI enrichment](cognitive-search-concept-intro.md) | Refers to skills that make internal calls to Azure AI for enrichment and transformation during indexing. Integration requires that Azure AI Search coexists with an [Azure AI multi-service account](/azure/ai-services/multi-service-resource) in the same physical region. Regional support is noted in this article. |
 | [Azure OpenAI integration](vector-search-integrated-vectorization.md)  | Refers to skills and vectorizers that make internal calls to deployed embedding and chat models on Azure OpenAI. Check [Azure OpenAI model region availability](/azure/ai-services/openai/concepts/models#model-summary-table-and-region-availability) for the most current list of regions for each embedding and chat model. Specific Azure OpenAI models are in fewer regions, so be sure to check for joint regional availability before installing.|
diff --git a/articles/search/search-reliability.md b/articles/search/search-reliability.md
@@ -6,7 +6,7 @@ author: mattgotteiner
 ms.author: magottei
 ms.service: cognitive-search
 ms.topic: conceptual
-ms.date: 01/02/2024
+ms.date: 09/04/2024
 ms.custom:
   - subject-reliability
   - references_regions
@@ -55,10 +55,9 @@ Availability zones are used when you add two or more replicas to your search ser
 
 ### Supported regions
 
-Support for availability zones depends on infrastructure and storage. Currently, two zones that were announced in October 2023 have insufficient storage and don't provide an availability zone for Azure AI Search:
+Support for availability zones depends on infrastructure and storage. Currently, the following zone has insufficient storage and doesn't provide an availability zone for Azure AI Search:
 
-+ Israel Central
-+ Italy North
++ Japan West
 
 Otherwise, availability zones for Azure AI Search are supported in the following regions:
 
@@ -75,6 +74,8 @@ Otherwise, availability zones for Azure AI Search are supported in the following
 | East US 2 | January 30, 2021 or later |
 | France Central| October 23, 2020 or later |
 | Germany West Central |  May 3, 2021, or later |
+| Israel Central | April 1, 2024, or later |
+| Italy North | April 1, 2024, or later |
 | Japan East | January 30, 2021 or later |
 | Korea Central | January 20, 2022 or later |
 | North Europe | January 28, 2021 or later |
diff --git a/articles/search/search-what-is-azure-search.md b/articles/search/search-what-is-azure-search.md
@@ -41,13 +41,13 @@ On the search service itself, the two primary workloads are *indexing* and *quer
 
 + [**Indexing**](search-what-is-an-index.md) is an intake process that loads content into your search service and makes it searchable. Internally, inbound text is processed into tokens and stored in inverted indexes, and inbound vectors are stored in vector indexes. The document format that Azure AI Search can index is JSON. You can upload JSON documents that you've assembled, or use an indexer to retrieve and serialize your data into JSON. 
 
-  [Applied AI](cognitive-search-concept-intro.md) through a [skillset](cognitive-search-working-with-skillsets.md) extends indexing with image and language models. If you have images or large unstructured text in source document, you can attach skills that perform OCR, analyze and escribe images, infer structure, translate text and more. 
+  [Applied AI](cognitive-search-concept-intro.md) through a [skillset](cognitive-search-working-with-skillsets.md) extends indexing with image and language models. If you have images or large unstructured text in source document, you can attach skills that perform OCR, analyze and describe images, infer structure, translate text and more. Output is text that can be serialized into JSON and ingested into a search index.
 
-  Skillsets can also perform [data chunking and vectorization during indexing](vector-search-integrated-vectorization.md). Skills that attach to Azure OpenAI, the model catalog in Azure AI Studio, or custom skills that attach to any external chunking and embedding model can be used during indexing to create vector data.
+  Skillsets can also perform [data chunking and vectorization during indexing](vector-search-integrated-vectorization.md). Skills that attach to Azure OpenAI, the model catalog in Azure AI Studio, or custom skills that attach to any external chunking and embedding model can be used during indexing to create vector data. Output is chunked vector content that can be ingested into a search index.
 
 + [**Querying**](search-query-overview.md) can happen once an index is populated with searchable content, when your client app sends query requests to a search service and handles responses. All query execution is over a search index that you control.
 
-  [Semantic ranking](semantic-search-overview.md) is an extension of query execution. It adds secondary ranking, using language understanding to reevalute a result set, promoting the most semantically relevant results to the top.
+  [Semantic ranking](semantic-search-overview.md) is an extension of query execution. It adds secondary ranking, using language understanding to reevaluate a result set, promoting the most semantically relevant results to the top.
 
   [Integrated vectorization](vector-search-integrated-vectorization.md) is also an extension of query execution. If you have vector fields in your search index, you can submit raw vector queries or text that's vectorized at query time.
 
@@ -99,7 +99,7 @@ Or, try solution accelerators:
 
 + [**Chat with your data** solution accelerator](https://github.com/Azure-Samples/chat-with-your-data-solution-accelerator) helps you create a custom RAG solution over your content.
 
-+ [**Conversational Knowledge Mining** solution accelerator](https://github.com/microsoft/Customer-Service-Conversational-Insights-with-Azure-OpenAI-Services), helps you create an interactive solution to extract actionable insights from post-contact center transcripts.
++ [**Conversational Knowledge Mining** solution accelerator](https://github.com/microsoft/Customer-Service-Conversational-Insights-with-Azure-OpenAI-Services) helps you create an interactive solution to extract actionable insights from post-contact center transcripts.
 
 + [**Build your own copilot** solution accelerator](https://github.com/microsoft/Build-your-own-copilot-Solution-Accelerator), leverages Azure OpenAI Service, Azure AI Search and Microsoft Fabric, to create custom copilot solutions.
 
diff --git a/articles/search/vector-search-index-size.md b/articles/search/vector-search-index-size.md
diff --git a/articles/search/whats-new.md b/articles/search/whats-new.md