Skip to content

Commit e9928e6

Browse files
committed
update read
1 parent bfe7f38 commit e9928e6

File tree

1 file changed

+43
-43
lines changed
  • articles/ai-services/document-intelligence/prebuilt

1 file changed

+43
-43
lines changed

articles/ai-services/document-intelligence/prebuilt/read.md

Lines changed: 43 additions & 43 deletions
Original file line numberDiff line numberDiff line change
@@ -26,7 +26,7 @@ ms.author: lajanuar
2626

2727
> [!NOTE]
2828
>
29-
> For extracting text from external images like labels, street signs, and posters, use the [Azure AI Image Analysis v4.0 Read](../../Computer-vision/concept-ocr.md) feature optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in real-time user experience scenarios.
29+
> To extract text from external images like labels, street signs, and posters, use the [Azure AI Image Analysis v4.0 Read](../../Computer-vision/concept-ocr.md) feature optimized for general (not document) images with a performance-enhanced synchronous API. This capability makes it easier to embed OCR in real-time user experience scenarios.
3030
>
3131
3232
Document Intelligence Read Optical Character Recognition (OCR) model runs at a higher resolution than Azure AI Vision Read and extracts print and handwritten text from PDF documents and scanned images. It also includes support for extracting text from Microsoft Word, Excel, PowerPoint, and HTML documents. It detects paragraphs, text lines, words, locations, and languages. The Read model is the underlying OCR engine for other Document Intelligence prebuilt models like Layout, General Document, Invoice, Receipt, Identity (ID) document, Health insurance card, W2 in addition to custom models.
@@ -83,11 +83,11 @@ See our [Language Support—document analysis models](../language-support/ocr.md
8383
## Data extraction (v4)
8484

8585
> [!NOTE]
86-
> Microsoft Word and HTML file are supported in v4.0. Compared with PDF and images, below features are not supported:
86+
> Microsoft Word and HTML file are supported in v4.0. The following capabilities are currently not supported:
8787
>
88-
> * There are no angle, width/height and unit with each page object.
89-
> * For each object detected, there is no bounding polygon or bounding region.
90-
> * Page range (`pages`) is not supported as a parameter.
88+
> * No angle, width/height, and unit returned with each page object.
89+
> * No bounding polygon or bounding region for each object detected.
90+
> * No page range (`pages`) as a parameter returned.
9191
> * No `lines` object.
9292
9393
## Searchable PDFs
@@ -96,8 +96,8 @@ The searchable PDF capability enables you to convert an analog PDF, such as scan
9696

9797
> [!IMPORTANT]
9898
>
99-
> * Currently, the searchable PDF capability is only supported by Read OCR model `prebuilt-read`. When using this feature, please specify the `modelId` as `prebuilt-read`, as other model types will return error for this preview version.
100-
> * Searchable PDF is included with the 2024-11-30 GA `prebuilt-read` model with no additional cost for generating a searchable PDF output.
99+
> * Currently, only the Read OCR model `prebuilt-read` supports the searchable PDF capability. When using this feature, specify the `modelId` as `prebuilt-read`. Other model types return an error for this preview version.
100+
> * Searchable PDF is included with the `2024-11-30` GA `prebuilt-read` model with no added cost for generating a searchable PDF output.
101101
102102
### Use searchable PDFs
103103

@@ -124,41 +124,41 @@ Upon successful completion, the PDF can be retrieved and downloaded as `applicat
124124
// Upon successful completion, retrieve the PDF as application/pdf.
125125
GET {endpoint}/documentintelligence/documentModels/prebuilt-read/analyzeResults/{resultId}/pdf?api-version=2024-11-30
126126
URI Parameters
127-
Name In Required Type Description
128-
endpoint path True
127+
Name In Required Type Description
128+
endpoint path True
129129
string
130130

131-
uri
131+
uri
132132
The Document Intelligence service endpoint.
133133

134-
modelId path True
134+
modelId path True
135135
string
136136

137137
Unique document model name.
138138

139139
Regex pattern: ^[a-zA-Z0-9][a-zA-Z0-9._~-]{1,63}$
140140

141-
resultId path True
141+
resultId path True
142142
string
143143

144-
uuid
144+
uuid
145145
Analyze operation result ID.
146146

147-
api-version query True
147+
api-version query True
148148
string
149149

150150
The API version to use for this operation.
151151

152152
Responses
153-
Name Type Description
154-
200 OK
153+
Name Type Description
154+
200 OK
155155
file
156156

157157
The request has succeeded.
158158

159159
Media Types: "application/pdf", "application/json"
160160

161-
Other Status Codes
161+
Other Status Codes
162162
DocumentIntelligenceErrorResponse
163163

164164
An unexpected error response.
@@ -177,8 +177,8 @@ Authorization URL: https://login.microsoftonline.com/common/oauth2/authorize
177177
Token URL: https://login.microsoftonline.com/common/oauth2/token
178178

179179
Scopes
180-
Name Description
181-
https://cognitiveservices.azure.com/.default
180+
Name Description
181+
https://cognitiveservices.azure.com/.default
182182
Examples
183183
Get Analyze Document Result PDF
184184
Sample request
@@ -195,69 +195,69 @@ JSON
195195
Copy
196196
"{pdfBinary}"
197197
Definitions
198-
Name Description
199-
DocumentIntelligenceError
198+
Name Description
199+
DocumentIntelligenceError
200200
The error object.
201201

202-
DocumentIntelligenceErrorResponse
202+
DocumentIntelligenceErrorResponse
203203
Error response object.
204204

205-
DocumentIntelligenceInnerError
205+
DocumentIntelligenceInnerError
206206
An object containing more specific information about the error.
207207

208208
DocumentIntelligenceError
209209
The error object.
210210

211-
Name Type Description
212-
code
211+
Name Type Description
212+
code
213213
string
214214

215215
One of a server-defined set of error codes.
216216

217-
details
217+
details
218218
DocumentIntelligenceError[]
219219

220220
An array of details about specific errors that led to this reported error.
221221

222-
innererror
222+
innererror
223223
DocumentIntelligenceInnerError
224224

225225
An object containing more specific information than the current object about the error.
226226

227-
message
227+
message
228228
string
229229

230230
A human-readable representation of the error.
231231

232-
target
232+
target
233233
string
234234

235235
The target of the error.
236236

237237
DocumentIntelligenceErrorResponse
238238
Error response object.
239239

240-
Name Type Description
241-
error
240+
Name Type Description
241+
error
242242
DocumentIntelligenceError
243243

244244
Error info.
245245

246246
DocumentIntelligenceInnerError
247247
An object containing more specific information about the error.
248248

249-
Name Type Description
250-
code
249+
Name Type Description
250+
code
251251
string
252252

253253
One of a server-defined set of error codes.
254254

255-
innererror
255+
innererror
256256
DocumentIntelligenceInnerError
257257

258258
Inner error.
259259

260-
message
260+
message
261261
string
262262

263263
A human-readable representation of the error.
@@ -439,7 +439,7 @@ Find more samples on GitHub:
439439
440440
> [!NOTE]
441441
>
442-
> For extracting text from external images like labels, street signs, and posters, use the [Azure AI Image Analysis v4.0 Read](../..//Computer-vision/concept-ocr.md) feature optimized for general, non-document images with a performance-enhanced synchronous API that makes it easier to embed OCR in your user experience scenarios.
442+
> To extract text from external images like labels, street signs, and posters, use the [Azure AI Image Analysis v4.0 Read](../../Computer-vision/concept-ocr.md) feature optimized for general (not document) images with a performance-enhanced synchronous API. This capability makes it easier to embed OCR in real-time user experience scenarios.
443443
>
444444
445445
Document Intelligence Read Optical Character Recognition (OCR) model runs at a higher resolution than Azure AI Vision Read and extracts print and handwritten text from PDF documents and scanned images. It also includes support for extracting text from Microsoft Word, Excel, PowerPoint, and HTML documents. It detects paragraphs, text lines, words, locations, and languages. The Read model is the underlying OCR engine for other Document Intelligence prebuilt models like Layout, General Document, Invoice, Receipt, Identity (ID) document, Health insurance card, W2 in addition to custom models.
@@ -513,11 +513,11 @@ See our [Language Support—document analysis models](../language-support/ocr.md
513513
## Data extraction
514514
515515
> [!NOTE]
516-
> Microsoft Word and HTML file are supported in v3.1 and later versions. Compared with PDF and images, below features are not supported:
516+
> Microsoft Word and HTML file are supported in v4.0. The following capabilities are currently not supported:
517517
>
518-
> * There are no angle, width/height and unit with each page object.
519-
> * For each object detected, there is no bounding polygon or bounding region.
520-
> * Page range (`pages`) is not supported as a parameter.
518+
> * No angle, width/height, and unit returned with each page object.
519+
> * No bounding polygon or bounding region for each object detected.
520+
> * No page range (`pages`) as a parameter returned.
521521
> * No `lines` object.
522522
523523
## Searchable PDF
@@ -526,9 +526,9 @@ The searchable PDF capability enables you to convert an analog PDF, such as scan
526526
527527
> [!IMPORTANT]
528528
>
529-
> * Currently, the searchable PDF capability is only supported by Read OCR model `prebuilt-read`. When using this feature, please specify the `modelId` as `prebuilt-read`, as other model types will return an error.
530-
> * Searchable PDF is included with the 2024-11-30 `prebuilt-read` model with no additional cost for generating a searchable PDF output.
531-
> * Searchable PDF currently only supports PDF files as input. Support for other file types, such as image files, will be available later.
529+
> * Currently, only Read OCR model `prebuilt-read` supports the searchable PDF capability. When using this feature, specify the `modelId` as `prebuilt-read`. Other model types return an error.
530+
> * Searchable PDF is included with the `2024-11-30` `prebuilt-read` model with no added cost for generating a searchable PDF output.
531+
> * Searchable PDF currently only supports PDF files as input.
532532
533533
### Use searchable PDF
534534

0 commit comments

Comments
 (0)