Skip to content

Commit 05a2e1f

Browse files
committed
Merge branch 'main' of https://github.com/MicrosoftDocs/azure-docs-pr into apimllm
2 parents e7b403a + 41780f9 commit 05a2e1f

File tree

184 files changed

+2142
-1159
lines changed

Some content is hidden

Large Commits have some content hidden by default. Use the searchbox below for content that may be hidden.

184 files changed

+2142
-1159
lines changed

articles/ai-services/document-intelligence/concept-read.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -122,7 +122,7 @@ The searchable PDF capability enables you to convert an analog PDF, such as scan
122122
> [!IMPORTANT]
123123
>
124124
> * Currently, the searchable PDF capability is only supported by Read OCR model `prebuilt-read`. When using this feature, please specify the `modelId` as `prebuilt-read`, as other model types will return error for this preview version.
125-
> * Searchable PDF is included with the 2024-07-31-preview `prebuilt-read` model with no usage cost for general PDF consumption.
125+
> * Searchable PDF is included with the 2024-07-31-preview `prebuilt-read` model with no additional cost for generating a searchable PDF output.
126126
127127
### Use searchable PDF
128128

@@ -135,7 +135,7 @@ POST /documentModels/prebuilt-read:analyze?output=pdf
135135
202
136136
```
137137

138-
Once the `Analyze` operation is complete, make a `GET` request to retrieve the `Analyze` operation results.
138+
Poll for completion of the `Analyze` operation. Once the operation is complete, issue a `GET` request to retrieve the PDF format of the `Analyze` operation results .
139139

140140
Upon successful completion, the PDF can be retrieved and downloaded as `application/pdf`. This operation allows direct downloading of the embedded text form of PDF instead of Base64-encoded JSON.
141141

articles/ai-services/document-intelligence/faq.yml

Lines changed: 17 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -7,7 +7,7 @@ metadata:
77
ms.service: azure-ai-document-intelligence
88
ms.custom: references_regions
99
ms.topic: faq
10-
ms.date: 07/09/2024
10+
ms.date: 08/08/2024
1111
ms.author: lajanuar
1212
title: Frequently asked questions
1313
summary: |
@@ -52,22 +52,23 @@ sections:
5252
answer: |
5353
**Yes.**
5454
55-
You can use a document generative AI solution to chat with your documents, generate captivating content from those documents, and access Azure OpenAI Service models on your data.
55+
Document Intelligence now includes [custom generative](concept-custom.md) a new type of extraction model that uses Generative AI and large language models (LLMs) to extract fields from documents. In the past you've had to use a RAG (retrieval augmented generation) pattern to extract fields. The new model provides high quality results with a single API call.
56+
You can also use a document generative AI solution to chat with your documents (RAG), generate captivating content from those documents, and access Azure OpenAI Service models on your data.
5657
57-
- With Azure AI Document Intelligence and Azure OpenAI combined, you can build an enterprise application to seamlessly interact with your documents by using natural languages. You can easily find answers, gain valuable insights, and generate new and engaging content from existing documents.
58+
- With Azure AI Document Intelligence and Azure OpenAI combined, you can build an enterprise application to seamlessly interact with your documents using natural language. You can easily find answers, gain valuable insights, and generate new and engaging content from existing documents.
5859
59-
- You can find more details in the [technical community blog](https://techcommunity.microsoft.com/t5/azure-ai-services-blog/document-generative-ai-the-power-of-azure-ai-document/ba-p/3875015).
60+
- You can find more details on the [retrieval augmented generation pattern here](concept-retrieval-augmented-generation.md).
6061
6162
- question: |
6263
Can Document Intelligence help with semantic chunking within documents for retrieval-augmented generation?
6364
answer: |
6465
**Yes.**
6566
66-
Document Intelligence can provide the building blocks to enable semantic chunking. Semantic chunking is a key step in retrieval-augmented generation (RAG) to ensure its efficient storage and retrieval.
67+
Document Intelligence can provide the building blocks to enable semantic chunking. Semantic chunking is a key step in retrieval-augmented generation (RAG) to ensure context dense chunks and relevence improvement.
6768
68-
- Document Intelligence provides a layout model that segments documents into coherent units based on their semantic content.
69+
- Document Intelligence provides a layout model that provides an visual decomposition of the document into lines, paragraphs, sections, headers and footers.
6970
70-
- You can then export the obtained information to Markdown format, so that you can customize your semantic segmentation strategy based on the available building blocks.
71+
- You can then choose to retrieve the results in markdown format, to further chunk the document on section or paragraph boundaries.
7172
7273
For more information, see [overview of RAG in Document Intelligence](concept-retrieval-augmented-generation.md)
7374
@@ -288,11 +289,17 @@ sections:
288289
The copy operation is limited to copying models within the specific cloud environment where you trained the model. For instance, copying models from the public cloud to the Azure Government cloud isn't supported.
289290
290291
- question: |
291-
Am I charged when training a custom neural model?
292+
Am I charged when training a custom models?
292293
answer: |
293-
**Yes for the first 10 hours only.**
294+
**Yes**
294295
295-
If training hours exceed 10 hours, charges are incurred for further custom neural model training.
296+
Although training is free for all custom generative and custom template models, creating the training dataset for all models requires running the Layout model on the training documents. Customers are responsible for this cost.
297+
298+
Custom generative models also rely on the auto label feature to speed up the generation of the labeled dataset. There is a cost associated with this action. While the build operation for template and generative models is free, creating the labeled dataset can result in some minimal costs.
299+
300+
Custom neural models have a limit on the number of models/the amount of time that models can be trained for free. The first 10 hours of training are free. If training a single model for longer than 10 hours or training multiple models that exceed the 10 hour limit, you will need to enable paid training by setting a training budget. See [training a custom neural model](concept-custom-neural.md) for details.
301+
302+
For v3.0 or v3.1 models the paid training tier only applies to additional models, the training time per model is not configurable.
296303
297304
- name: Storage account
298305
questions:

articles/ai-services/document-intelligence/whats-new.md

Lines changed: 10 additions & 10 deletions
Original file line numberDiff line numberDiff line change
@@ -37,12 +37,10 @@ The Document Intelligence [**2024-07-31-preview**](/rest/api/aiservices/document
3737
* **West Europe**
3838
* **North Central US**
3939

40-
* [Read model](concept-read.md) now supports [PDF output](concept-read.md#searchable-pdf) to download PDFs with embedded text from extraction results, allowing for PDF to be utilized in scenarios such as search and large language model ingestion.
41-
* [Layout model](concept-layout.md) now supports improved [figure detection](concept-layout.md#figures) where figures from documents can now be downloaded as an image file to be used for further figure understanding.
42-
* [Custom extraction models](concept-custom.md#custom-extraction-models)
43-
* Custom extraction models now support updating the model in-place.
44-
* [🆕 Custom generative (Document field extraction) model](concept-custom-generative.md)
45-
* Document Intelligence now offers new custom generative model that utilizes generative AI to extract fields from unstructured documents or structured forms with a wide variety of visual templates.
40+
* [🆕 Document field extraction (custom generative) model](concept-custom-generative.md)
41+
* Use **Generative AI** to extract fields from documents and forms. Document Intelligence now offers a new document field extraction model that utilizes large language models (LLMs) to extract fields from unstructured documents or structured forms with a wide variety of visual templates. With grounded values and confidence scores, the new Generative AI based extraction fits into your existing processes.
42+
* [🆕 Model compose with custom classifiers](concept-composed-models.md)
43+
* Document Intelligence now adds support for composing model with an explicit custom classification model. [Learn more about the benefits](concept-composed-models.md) of using the new compose capability.
4644
* [Custom classification model](concept-custom.md#custom-classification-model)
4745
* Custom classification model now supports updating the model in-place as well.
4846
* Custom classification model adds support for model copy operation to enable backup and disaster recovery.
@@ -58,12 +56,14 @@ The Document Intelligence [**2024-07-31-preview**](/rest/api/aiservices/document
5856
* New prebuilt to extract account information including beginning and ending balances, transaction details from bank statements.​
5957
* [🆕 US Tax model](concept-tax-document.md)
6058
* New unified US tax model that can extract from forms such as W-2, 1098, 1099, and 1040.
59+
* 🆕 Searchable PDF. The [prebuilt read](concept-read.md) model now supports [PDF output](concept-read.md#searchable-pdf) to download PDFs with embedded text from extraction results, allowing for PDF to be utilized in scenarios such as search copy of contents.
60+
* [Layout model](concept-layout.md) now supports improved [figure detection](concept-layout.md#figures) where figures from documents can now be downloaded as an image file to be used for further figure understanding. The layout model also features improvements to the OCR model for scanned text targeting improvements for single characters, boxed text and dense text documents.
61+
* [🆕 Batch API](concept-batch-analysis.md)
62+
* Document Intelligence now adds support for batch analysis operation to support analyzing a set of documents to simplify developer experience and improve efficiency.
6163
* [Add-on capabilities](concept-add-on-capabilities.md)
6264
* [Query fields](concept-add-on-capabilities.md#query-fields) AI quality of extraction is improved with the latest model.
63-
* [🆕 Batch API](concept-batch-analysis.md)
64-
* Document Intelligence now adds support for batch analysis operation to support analyzing a set of documents to simplify developer experience and improve service efficiency.
65-
* [🆕 Model compose with custom classifiers](concept-composed-models.md)
66-
* Document Intelligence now adds support for composing model with an explicit custom classification model.
65+
66+
6767

6868
## May 2024
6969

articles/ai-services/openai/quotas-limits.md

Lines changed: 12 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -10,7 +10,7 @@ ms.custom:
1010
- ignite-2023
1111
- references_regions
1212
ms.topic: conceptual
13-
ms.date: 08/06/2024
13+
ms.date: 08/08/2024
1414
ms.author: mbullwin
1515
---
1616

@@ -91,13 +91,23 @@ Global Standard deployments use Azure's global infrastructure, dynamically routi
9191

9292
The Usage Limit determines the level of usage above which customers might see larger variability in response latency. A customer’s usage is defined per model and is the total tokens consumed across all deployments in all subscriptions in all regions for a given tenant.
9393

94+
> [!NOTE]
95+
> Usage tiers only apply to standard and global standard deployment types. Usage tiers do not apply to global batch deployments.
96+
9497
#### GPT-4o global standard & standard
9598

9699
|Model| Usage Tiers per month |
97100
|----|----|
98-
|`gpt-4o` |1.5 Billion tokens |
101+
|`gpt-4o` | 8 Billion tokens |
99102
|`gpt-4o-mini` | 45 Billion tokens |
100103

104+
#### GPT-4 standard
105+
106+
|Model| Usage Tiers per month|
107+
|---|---|
108+
| `gpt-4` + `gpt-4-32k` (all versions) | 4 Billion |
109+
110+
101111
## Other offer types
102112

103113
If your Azure subscription is linked to certain [offer types](https://azure.microsoft.com/support/legal/offer-details/) your max quota values are lower than the values indicated in the above tables.

articles/ai-services/speech-service/includes/release-notes/release-notes-containers.md

Lines changed: 18 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -2,10 +2,27 @@
22
author: eric-urban
33
ms.service: azure-ai-speech
44
ms.topic: include
5-
ms.date: 2/16/2024
5+
ms.date: 8/7/2024
66
ms.author: eur
77
---
88

9+
### 2024-August release
10+
11+
Add support for the latest model versions:
12+
- Speech language identification 1.14.0
13+
- Upgrade .Net 8.0
14+
- Mitigate Vulnerabilities
15+
- Neural text to speech 3.3.0
16+
- Upgrade .Net 8.0
17+
- Mitigate Vulnerabilities
18+
- Speech to text 4.18.0
19+
- Upgrade .Net 8.0
20+
- Mitigate Vulnerabilities
21+
- Upgrade Recognition Engine
22+
- Fix the issue where `PropertyId.Speech_SegmentationSilenceTimeoutMs` was being ignored.
23+
- Update Deprecated Models
24+
- Remove the `uk-UA` locale
25+
926
### 2024-February release
1027

1128
Add support for the latest model versions:

articles/ai-services/speech-service/speech-container-overview.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: how-to
9-
ms.date: 1/22/2024
9+
ms.date: 8/7/2024
1010
ms.author: eur
1111
keywords: on-premises, Docker, container
1212
---
@@ -21,9 +21,9 @@ The following table lists the Speech containers available in the Microsoft Conta
2121

2222
| Container | Features | Supported versions and locales |
2323
|--|--|--|
24-
| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results. | Latest: 4.7.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
25-
| [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.7.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
26-
| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.13.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
24+
| [Speech to text](speech-container-stt.md) | Transcribes continuous real-time speech or batch audio recordings with intermediate results. | Latest: 4.8.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list).|
25+
| [Custom speech to text](speech-container-cstt.md) | Using a custom model from the [custom speech portal](https://speech.microsoft.com/customspeech), transcribes continuous real-time speech or batch audio recordings into text with intermediate results. | Latest: 4.8.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/custom-speech-to-text/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/speech-to-text/tags/list). |
26+
| [Speech language identification](speech-container-lid.md)<sup>1, 2</sup> | Detects the language spoken in audio files. | Latest: 1.14.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/language-detection/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/language-detection/tags/list). |
2727
| [Neural text to speech](speech-container-ntts.md) | Converts text to natural-sounding speech by using deep neural network technology, which allows for more natural synthesized speech. | Latest: 3.3.0<br/><br/>For all supported versions and locales, see the [Microsoft Container Registry (MCR)](https://mcr.microsoft.com/product/azure-cognitive-services/speechservices/neural-text-to-speech/tags) and [JSON tags](https://mcr.microsoft.com/v2/azure-cognitive-services/speechservices/neural-text-to-speech/tags/list). |
2828

2929
<sup>1</sup> The container is available in public preview. Containers in preview are still under development and don't meet Microsoft's stability and support requirements.

articles/ai-studio/how-to/configure-managed-network.md

Lines changed: 13 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -27,6 +27,7 @@ You need to configure following network isolation configurations.
2727
- Create private endpoint outbound rules to your private Azure resources. Private Azure AI Search isn't supported yet.
2828
- If you use Visual Studio Code integration with allow only approved outbound mode, create FQDN outbound rules described in the [use Visual Studio Code](#scenario-use-visual-studio-code) section.
2929
- If you use HuggingFace models in Models with allow only approved outbound mode, create FQDN outbound rules described in the [use HuggingFace models](#scenario-use-huggingface-models) section.
30+
- If you use one of the open-source models with allow only approved outbound mode, create FQDN outbound rules described in the [curated by Azure AI](#scenario-curated-by-azure-ai) section.
3031

3132
## Network isolation architecture and isolation modes
3233

@@ -759,6 +760,18 @@ If you plan to use __HuggingFace models__ with the hub, add outbound _FQDN_ rule
759760
* cnd.auth0.com
760761
* cdn-lfs.huggingface.co
761762

763+
### Scenario: Curated by Azure AI
764+
765+
These models involve dynamic installation of dependencies at runtime, and reequire outbound _FQDN_ rules to allow traffic to the following hosts:
766+
767+
*.anaconda.org
768+
*.anaconda.com
769+
anaconda.com
770+
pypi.org
771+
*.pythonhosted.org
772+
*.pytorch.org
773+
pytorch.org
774+
762775
## Private endpoints
763776

764777
Private endpoints are currently supported for the following Azure services:

articles/ai-studio/how-to/deploy-models-cohere-rerank.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -92,7 +92,7 @@ To create a deployment:
9292
1. Alternatively, you can initiate a deployment by starting from your project in AI Studio.
9393

9494
1. From the left sidebar of your project, select **Components** > **Deployments**.
95-
1. Select **+ Create deployment**.
95+
1. Select **+ Deploy model**.
9696
1. Search for and select **Cohere-rerank-3-english**. to open the Model Details page.
9797
1. Select **Confirm** to open a serverless API deployment window for the model.
9898

articles/ai-studio/how-to/deploy-models-openai.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -48,7 +48,7 @@ Alternatively, you can initiate deployment by starting from your project in AI S
4848

4949
1. Go to your project in AI Studio.
5050
1. Select **Components** > **Deployments**.
51-
1. Select **+ Create deployment**.
51+
1. Select **+ Deploy model**.
5252
1. In the **Collections** filter, select **Azure OpenAI**.
5353
1. Select a model such as `gpt-4` from the Azure OpenAI collection.
5454
1. Select **Confirm** to open the deployment window.

articles/ai-studio/how-to/deploy-models-serverless-availability.md

Lines changed: 4 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -24,7 +24,10 @@ Certain models in the model catalog can be deployed as a serverless API with pay
2424

2525
## Region availability
2626

27-
[!INCLUDE [region-availabilitity-serverless-api](~/reusable-content/ce-skilling/azure/includes/ai-studio/includes/region-availabilitity-serverless-api.md)]
27+
Pay-as-you-go billing is available only to users whose Azure subscription belongs to a billing account in a country where the model provider has made the offer available (see "offer availability region" in the table in the next section). If the offer is available in the relevant region, the user then must have a Hub/Project in the Azure region where the model is available for deployment or fine-tuning, as applicable (see "Hub/Project Region" columns in the following tables).
28+
29+
[!INCLUDE [region-availability-maas](../includes/region-availability-maas.md)]
30+
2831

2932
## Alternatives to region availability
3033

0 commit comments

Comments
 (0)