Skip to content

Commit cf81e77

Browse files
Merge pull request #5954 from MicrosoftDocs/main
Auto Publish – main to live - 2025-07-10 05:06 UTC
2 parents d3417cf + b895ec2 commit cf81e77

File tree

10 files changed

+38
-25
lines changed

10 files changed

+38
-25
lines changed

articles/ai-foundry/agents/how-to/tools/model-context-protocol.md

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -43,7 +43,7 @@ You can bring multiple remote MCP servers to Foundry Agent service by adding the
4343
1. Find the remote MCP server you want to connect to, such as GitHub MCP Server. Create or update a Foundry Agent with a `mcp` tool with the following information:
4444
1. `server_url`: the url of the MCP server, for example, `https://api.githubcopilot.com/mcp/`
4545
2. `server_label`: a unique identifier of this MCP server to the agent, for example, `github`
46-
3. `allowed_tools`: optional, a list of tools you want to allow without approval
46+
3. `allowed_tools`: optional, specify the tools that this agent can have access to and use.
4747

4848
1. Create a run and pass additional information about the `mcp` tool in `tool_resources` with headers
4949
1. `tool_label`: use the identifier you provided during create/update agent

articles/ai-services/document-intelligence/concept/markdown-elements.md

Lines changed: 3 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -108,6 +108,9 @@ The Layout API preserves figure elements:
108108
* Preserves figure captions with the `<figcaption>` tag to provide important context
109109
* Preserves figure footnotes as separate paragraphs following the figure container
110110

111+
> [!IMPORTANT]
112+
> In cases where we detect certain document components like section heading as part of the figures, markdown output will not present figures in the output and use the information for document structure analysis. For these cases, enumerate the figures field in JSON to retrieve all the figures.
113+
111114
Here's an example:
112115

113116
``` md

articles/ai-services/document-intelligence/containers/configuration.md

Lines changed: 2 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ ms.author: lajanuar
1616

1717
:::moniker range="doc-intel-2.1.0"
1818

19-
Support for containers is currently available with Document Intelligence version `2022-08-31 (GA)` for all models, `2023-07-31 (GA)` for Read, Layout, Invoice, Receipt, and ID Document models, and `2024-11-30 (GA)` for Layout model:
19+
Support for containers is currently available with Document Intelligence version `2022-08-31 (GA)` for all models, `2023-07-31 (GA)` for Read, Layout, Invoice, Receipt, and ID Document models, and `2024-11-30 (GA)` for Read and Layout model:
2020

2121
* [REST API `2022-08-31 (GA)`](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-v3.0%20(2022-08-31)&preserve-view=true&tabs=HTTP)
2222
* [REST API `2023-07-31 (GA)`](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-v3.1%20(2023-07-31)&tabs=HTTP&preserve-view=true)
@@ -38,7 +38,7 @@ With Document Intelligence containers, you can build an application architecture
3838

3939
> [!IMPORTANT]
4040
>
41-
> Document Intelligence v4.0 container is currently available for Layout model only.
41+
> Document Intelligence v4.0 container is currently available for Read and Layout model only.
4242
4343
## Configuration settings
4444

articles/ai-services/document-intelligence/containers/image-tags.md

Lines changed: 3 additions & 2 deletions
Original file line numberDiff line numberDiff line change
@@ -16,7 +16,7 @@ ms.author: lajanuar
1616

1717
:::moniker range="doc-intel-2.1.0"
1818

19-
Support for containers is currently available with Document Intelligence version `2022-08-31 (GA)` for all models, `2023-07-31 (GA)` for Read, Layout, Invoice, Receipt, and ID Document models, and `2024-11-30 (GA)` for Layout model:
19+
Support for containers is currently available with Document Intelligence version `2022-08-31 (GA)` for all models, `2023-07-31 (GA)` for Read, Layout, Invoice, Receipt, and ID Document models, and `2024-11-30 (GA)` for Read and Layout model:
2020

2121
* [REST API `2022-08-31 (GA)`](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-v3.0%20(2022-08-31)&preserve-view=true&tabs=HTTP)
2222
* [REST API `2023-07-31 (GA)`](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-v3.1%20(2023-07-31)&tabs=HTTP&preserve-view=true)
@@ -81,11 +81,12 @@ The following containers support DocumentIntelligence v3.1 models and features:
8181

8282
Document Intelligence container images can be found within the [**Microsoft Artifact Registry** (also know as Microsoft Container Registry(MCR))](https://mcr.microsoft.com/catalog?search=document%20intelligence), the primary registry for all Microsoft published container images.
8383

84-
The following containers support DocumentIntelligence v3.1 models and features:
84+
The following containers support Document Intelligence v4.0 models and features:
8585

8686
| Container name |image |
8787
|---|---|
8888
| [**Layout 4.0**](https://mcr.microsoft.com/en-us/product/azure-cognitive-services/form-recognizer/layout-4.0/tags) |`mcr.microsoft.com/azure-cognitive-services/form-recognizer/layout-4.0:latest`|
89+
| [**Read 4.0**](https://mcr.microsoft.com/product/azure-cognitive-services/form-recognizer/read-4.0/tags) |`mcr.microsoft.com/azure-cognitive-services/form-recognizer/read-4.0:latest`|
8990
::: moniker-end
9091

9192

articles/ai-services/document-intelligence/containers/install-run.md

Lines changed: 3 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -23,15 +23,15 @@ Azure AI Document Intelligence is an Azure AI service that lets you build automa
2323

2424
In this article you can learn how to download, install, and run Document Intelligence containers. Containers enable you to run the Document Intelligence service in your own environment. Containers are great for specific security and data governance requirements.
2525

26-
* **Layout** model is supported by Document Intelligence v4.0 containers.
26+
* **Read**, **Layout** model is supported by Document Intelligence v4.0 containers.
2727

2828
* **Read**, **Layout**, **ID Document**, **Receipt**, and **Invoice** models are supported by Document Intelligence v3.1 containers.
2929

3030
* **Read**, **Layout**, **General Document**, **Business Card**, and **Custom** models are supported by Document Intelligence v3.0 containers.
3131

3232
## Version support
3333

34-
Support for containers is currently available with Document Intelligence version `v3.0: 2022-08-31 (GA)` for all models, `v3.1 2023-07-31 (GA)` for Read, Layout, ID Document, Receipt, and Invoice models, and `v4.0 2024-11-30 (GA)` for Layout:
34+
Support for containers is currently available with Document Intelligence version `v3.0: 2022-08-31 (GA)` for all models, `v3.1 2023-07-31 (GA)` for Read, Layout, ID Document, Receipt, and Invoice models, and `v4.0 2024-11-30 (GA)` for Read and Layout:
3535

3636
* [REST API `v3.0: 2022-08-31 (GA)`](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-v3.0%20(2022-08-31)&preserve-view=true&tabs=HTTP)
3737
* [REST API `v3.1: 2023-07-31 (GA)`](/rest/api/aiservices/document-models/analyze-document?view=rest-aiservices-v3.1%20(2023-07-31)&tabs=HTTP&preserve-view=true)
@@ -171,7 +171,7 @@ version: "3.9"
171171
services:
172172
azure-form-recognizer-read:
173173
container_name: azure-form-recognizer-read
174-
image: mcr.microsoft.com/azure-cognitive-services/form-recognizer/read-3.1
174+
image: mcr.microsoft.com/azure-cognitive-services/form-recognizer/read-4.0
175175
environment:
176176
- EULA=accept
177177
- billing={FORM_RECOGNIZER_ENDPOINT_URI}

articles/ai-services/document-intelligence/whats-new.md

Lines changed: 8 additions & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -27,8 +27,15 @@ Document Intelligence service is updated on an ongoing basis. Bookmark this page
2727
> [!IMPORTANT]
2828
> Preview API versions are retired once the GA API is released. The 2023-02-28-preview API version is retiring. If you're still using the preview API or the associated SDK versions, update your code to target the latest API version `2024-11-30 (GA)`. </br>
2929
30+
## June 2025
31+
**Document Intelligence v4.0 Read container is now available!**
32+
<br>
33+
This container image includes highly requested Read features like searchable PDF! For more information, *see:*
34+
* [Install and run containers](containers/install-run.md?view=doc-intel-4.0.0&preserve-view=true)
35+
* [Container image tags](containers/image-tags.md?view=doc-intel-4.0.0&preserve-view=true)
36+
3037
## April 2025
31-
**Document Intelligence v4.0 container is now available! Currently, Layout model is the only supported model for v4.0 release.**
38+
**Document Intelligence v4.0 Layout container is now available!**
3239
<br>
3340
For more information, *see:*
3441
* [Install and run containers](containers/install-run.md?view=doc-intel-4.0.0&preserve-view=true)

articles/ai-services/speech-service/speech-synthesis-markup-voice.md

Lines changed: 4 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -6,7 +6,7 @@ author: eric-urban
66
manager: nitinme
77
ms.service: azure-ai-speech
88
ms.topic: how-to
9-
ms.date: 3/10/2025
9+
ms.date: 7/9/2025
1010
ms.author: eur
1111
ms.custom: references_regions
1212
#Customer intent: As a developer, I want to learn how to use Speech Synthesis Markup Language (SSML) elements to customize what my Speech service voice sounds like.
@@ -559,9 +559,9 @@ This SSML snippet illustrates how to request blend shapes with your synthesized
559559

560560
## Voice conversion element
561561

562-
Voice conversion is the process of transforming the voice characteristics of a given audio to a target voice speaker. After voice conversion, the resulting audio reserves source audio's linguistic content and prosody while the voice timbre sounds like the target speaker. For more information, see [voice conversion](./voice-conversion.md).
562+
Voice conversion (preview) is the process of transforming the voice characteristics of a given audio to a target voice speaker. After voice conversion, the resulting audio reserves source audio's linguistic content and prosody while the voice timbre sounds like the target speaker. For more information, see [voice conversion](./voice-conversion.md).
563563

564-
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md#voice-conversion).
564+
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion).
565565

566566
The following table describes the usage of the `mstts:voiceconversion` element's attributes:
567567

@@ -573,7 +573,7 @@ Here's how the voice conversion works:
573573
- The source audio is a prerecorded audio file that contains the spoken words and prosody.
574574
- Text content: The final synthesized speech follows the spoken words in the source audio.
575575
- Prosody and rhythm: The speech maintains the timing and intonation from the source.
576-
- The `<voice>` tag specifies the target voice used for the output audio. For information about the supported target voices, see [supported voices for voice conversion](./language-support.md#voice-conversion).
576+
- The `<voice>` tag specifies the target voice used for the output audio. For information about the supported target voices, see [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion).
577577
- The output audio keeps the timbre (tone and voice quality) of the target voice, but follows the text and speaking style of the source audio.
578578

579579
> [!NOTE]

articles/ai-services/speech-service/toc.yml

Lines changed: 1 addition & 1 deletion
Original file line numberDiff line numberDiff line change
@@ -136,7 +136,7 @@ items:
136136
- name: High definition (HD) voices
137137
href: high-definition-voices.md
138138
displayName: hd voice
139-
- name: Voice conversion
139+
- name: Voice conversion (preview)
140140
href: voice-conversion.md
141141
displayName: voice conversion
142142
- name: Custom voice

articles/ai-services/speech-service/voice-conversion.md

Lines changed: 6 additions & 4 deletions
Original file line numberDiff line numberDiff line change
@@ -8,11 +8,13 @@ ms.reviewer: eur
88
manager: nitinme
99
ms.service: azure-ai-speech
1010
ms.topic: overview
11-
ms.date: 7/1/2025
11+
ms.date: 7/9/2025
1212
ms.custom: references_regions
1313
---
1414

15-
# What is voice conversion?
15+
# What is voice conversion? (Preview)
16+
17+
[!INCLUDE [Feature preview](./includes/previews/preview-generic.md)]
1618

1719
Voice conversion is the process of transforming the voice characteristics of a given audio to a target voice speaker. After voice conversion, the resulting audio reserves source audio's linguistic content and prosody while the voice timbre sounds like the target speaker.
1820

@@ -31,13 +33,13 @@ Voice conversion (or voice changer or speech to speech conversion) is built on s
3133
| **High speaker similarity** | Captures the timbre and vocal identity of the target speaker.<br>Generates audio that accurately matches the target voice. |
3234
| **Prosody preservation** | Maintains rhythm, stress, and intonation of source audio.<br>Preserves expressive and emotional qualities. |
3335
| **High audio fidelity** | Generates realistic, natural-sounding audio.<br>Minimizes artifacts. |
34-
| **Multilingual support** | Enables multilingual voice conversion.<br>Supports 91 locales (same as standard text to speech locale support).<br>See [supported voices for voice conversion](./language-support.md#voice-conversion) for the complete list. |
36+
| **Multilingual support** | Enables multilingual voice conversion.<br>Supports 91 locales (same as standard text to speech locale support).<br>See [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion) for the complete list. |
3537

3638
## Use voice conversion
3739

3840
You can use Azure AI Speech voice conversion with either the Speech SDK or text to speech REST APIs.
3941

40-
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md#voice-conversion).
42+
Use the `<mstts:voiceconversion>` tag via Speech Synthesis Markup Language (SSML) to specify the source audio URL and the target voice for the conversion. For a complete list of supported target voices, see [supported voices for voice conversion](./language-support.md?tabs=tts#voice-conversion).
4143

4244
### Example SSML
4345

articles/search/cognitive-search-skill-document-intelligence-layout.md

Lines changed: 7 additions & 7 deletions
Original file line numberDiff line numberDiff line change
@@ -11,14 +11,14 @@ ms.custom:
1111
- references_regions
1212
- ignite-2024
1313
ms.topic: reference
14-
ms.date: 06/10/2025
14+
ms.date: 07/10/2025
1515
---
1616

1717
# Document Layout skill
1818

1919
[!INCLUDE [Feature preview](./includes/previews/preview-generic.md)]
2020

21-
The **Document Layout** skill analyzes a document to detect structure and characteristics, and produces a syntactical representation of the document in Markdown or Text format. You can use it to extract text and images, where image extraction includes location metadata that preserves image position within the document. Image proximity to related content adds value to Retrieval Augmented Generation (RAG) workloads and [multimodal search](multimodal-search-overview.md).
21+
The **Document Layout** skill analyzes a document to detect structure and characteristics, and produces a syntactical representation of the document in Markdown or Text format. You can use it to extract text and images, where image extraction includes location metadata that preserves image position within the document. Image proximity to related content is beneficial in Retrieval Augmented Generation (RAG) workloads and [multimodal search](multimodal-search-overview.md) scenarios.
2222

2323
This article is the reference documentation for the Document Layout skill. For usage information, see [How to chunk and vectorize by document layout](search-how-to-semantic-chunking.md).
2424

@@ -46,11 +46,11 @@ The Document Layout skill calls the [Document Intelligence Public preview versio
4646

4747
Supported regions vary by modality and how the skill connects to the Document Intelligence layout model.
4848

49-
| Approach | Regions | Requirement |
50-
|----------|---------|-------------|
51-
| [Import and vectorize data wizard](search-import-data-portal.md) | **East US**, **West Europe**, **North Central US** | Create an Azure AI multi-service resource in one of these regions to get the portal experience. |
52-
| Programmatic, using a [keyless connection (preview)](cognitive-search-attach-cognitive-services.md#bill-through-a-keyless-connection) for billing | Varies by resource | Create Azure AI Search in one of these regions: **East US**, **West Europe**, **North Central US**, **West US 2**. <br>Access Document Intelligence through an Azure AI multi-service resource in any region listed in the [Product availability by region](https://azure.microsoft.com/explore/global-infrastructure/products-by-region/table) table.|
53-
| Programmatic, using a [multi-service resource API key](cognitive-search-attach-cognitive-services.md#bill-through-a-keyless-connection) for billing | **East US**, **West Europe**, **North Central US**, **West US 2** | Create your Azure AI Search service and AI multi-service resource in the same region. |
49+
| Approach | Requirement |
50+
|----------|-------------|
51+
| [Import and vectorize data wizard](search-import-data-portal.md) | Create an Azure AI multi-service resource in one of these regions to get the portal experience: **East US**, **West Europe**, **North Central US**. |
52+
| Programmatic, using [Microsoft Entra ID authentication (preview)](cognitive-search-attach-cognitive-services.md#bill-through-a-keyless-connection) for billing | Create Azure AI Search in one of these regions: **East US**, **West Europe**, **North Central US**, **West US 2**. <br>Create the Azure AI multi-service resource in any region listed in the [Product availability by region](https://azure.microsoft.com/explore/global-infrastructure/products-by-region/table) table.|
53+
| Programmatic, using a [multi-service resource API key](cognitive-search-attach-cognitive-services.md#bill-through-a-keyless-connection) for billing | Create your Azure AI Search service and AI multi-service resource in the same region: **East US**, **West Europe**, **North Central US**, **West US 2**. |
5454

5555
## Supported file formats
5656

0 commit comments

Comments
 (0)