Merge pull request #205405 from laujan/remove-entities

v-stsavell · web-flow · commit a7b8aedf7754 · 2022-07-20T15:20:37.000-05:00
remove entities
diff --git a/articles/applied-ai-services/form-recognizer/concept-general-document.md b/articles/applied-ai-services/form-recognizer/concept-general-document.md
@@ -7,16 +7,15 @@ manager: nitinme
 ms.service: applied-ai-services
 ms.subservice: forms-recognizer
 ms.topic: conceptual
-ms.date: 06/06/2022
+ms.date: 07/20/2022
 ms.author: lajanuar
 recommendations: false
 ---
 <!-- markdownlint-disable MD033 -->
 
 # Form Recognizer general document model (preview)
 
-The General document preview model combines powerful Optical Character Recognition (OCR) capabilities with deep learning models to extract key-value pairs, selection marks, and entities from documents. General document is only available with the preview (v3.0) API.  For more information on using the preview (v3.0) API, see our [migration guide](v3-migration-guide.md).
-
+The General document preview model combines powerful Optical Character Recognition (OCR) capabilities with deep learning models to extract key-value pairs, tables, and selection marks from documents. General document is only available with the preview (v3.0) API.  For more information on using the preview (v3.0) API, see our [migration guide](v3-migration-guide.md).
 
 The general document API supports most form types and will analyze your documents and extract keys and associated values. It's ideal for extracting common key-value pairs from documents. You can use the general document model as an alternative to training a custom model without labels.
 
@@ -27,7 +26,7 @@ The general document API supports most form types and will analyze your document
 
 * The general document model is a pre-trained model; it doesn't require labels or training.
 
-* A single API extracts key-value pairs, selection marks, entities, text, tables, and structure from documents.
+* A single API extracts key-value pairs, selection marks, text, tables, and structure from documents.
 
 * The general document model supports structured, semi-structured, and unstructured documents.
 
@@ -81,21 +80,11 @@ Key-value pairs are specific spans within the document that identify a label or
 
 Keys can also exist in isolation when the model detects that a key exists, with no associated value or when processing optional fields. For example, a middle name field may be left blank on a form in some instances. Key-value pairs are spans of text contained in the document. If you have documents where the same value is described in different ways, for example, customer and user, the associated key will be either customer or user based on context.
 
-## Entities
-
-Natural language processing models can identify parts of speech and classify each token or word. The named entity recognition model is able to identify entities like people, locations, and dates to provide for a richer experience. Identifying entities enables you to distinguish between customer types, for example,  an individual or an organization.
-
-The key-value pair extraction model and entity identification model are run in parallel on the entire document—not just on the values of the extracted key-value pairs. This process ensures that complex structures where a key can't be identified are still enriched by identifying the entities referenced. You can still match keys or values to entities based on the offsets of the identified spans.
-
-* The general document is a pre-trained model and can be directly invoked via the REST API.
-
-* The general document model supports named entity recognition (NER) for several entity categories. NER is the ability to identify different entities in text and categorize them into pre-defined classes or types such as: person, location, event, product, and organization. Extracting entities can be useful in scenarios where you want to validate extracted values. The entities are extracted from the entire content and not just the extracted values.
-
 ## Data extraction
 
-| **Model**   | **Text extraction** |**Key-Value pairs** |**Selection Marks**   | **Tables**   |**Entities** |
-| --- | :---: |:---:| :---: | :---: |:---: |
-|General document  | ✓  |  ✓ | ✓  | ✓  | ✓  |
+| **Model**   | **Text extraction** |**Key-Value pairs** |**Selection Marks**   | **Tables**   |
+| --- | :---: |:---:| :---: | :---: |
+|General document  | ✓  |  ✓ | ✓  | ✓  |
 
 ## Input requirements
 
@@ -114,29 +103,8 @@ The key-value pair extraction model and entity identification model are run in p
 |--------|:----------------------|:---------|
 |General document| <ul><li>English (United States)—en-US</li></ul>| English (United States)—en-US|
 
-### Named entity recognition (NER) categories
-
-| Category | Type | Description |
-|-----------|-------|--------------------|
-| Person | String | A person's partial or full name. |
-| PersonType | String | A person's job type or role.  |
-| Location | String | Natural and human-made landmarks, structures, geographical features, and geopolitical entities. |
-| Organization | String | Companies, political groups, musical bands, sport clubs, government bodies, and public organizations. |
-| Event | String | Historical, social, and naturally occurring events. |
-| Product | String |Physical objects of various categories. |
-| Skill | String | A capability, skill, or expertise. |
-| Address | String | Full mailing addresses. |
-| Phone number | String| Phone numbers. | 
-| Email | String | Email address. |
-| URL | String | Website URLs and links. |
-| IP Address | String | Network IP addresses. |
-| DateTime | String | Dates and times of day. |
-| Quantity | String | Numerical measurements and units. |
-
 ## Considerations
 
-* Extracting entities can be useful in scenarios where you want to validate extracted values. The entities are extracted on the entire contents of the documents and not just the extracted values.
-
 * Keys are spans of text extracted from the document, for semi structured documents, keys may need to be mapped to an existing dictionary of keys.
 
 * Expect to see key-value pairs with a key, but no value. For example if a user chose to not provide an email address on the form.
diff --git a/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-csharp-sdk.md b/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-csharp-sdk.md
@@ -168,7 +168,6 @@ Analyze and extract text, tables, structure, key-value pairs, and named entities
 > * For this example, you'll need a **form document file from a URI**. You can use our [sample form document](https://raw.githubusercontent.com/Azure-Samples/cognitive-services-REST-api-samples/master/curl/form-recognizer/sample-layout.pdf) for this quickstart.
 > * To analyze a given file at a URI, you'll use the `StartAnalyzeDocumentFromUri` method and pass `prebuilt-document` as the model ID. The returned value is an `AnalyzeResult` object containing data about the submitted document.
 > * We've added the file URI value to the `Uri fileUri` variable at the top of the script.
-> * For simplicity, all the entity fields that the service returns are not shown here. To see the list of all supported fields and corresponding types, see the [General document](../concept-general-document.md#named-entity-recognition-ner-categories) concept page.
 
 **Add the following code sample to the Program.cs file. Make sure you update the key and endpoint variables with values from your Azure portal Form Recognizer instance:**
 
diff --git a/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-java-sdk.md b/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-java-sdk.md
@@ -147,7 +147,6 @@ Extract text, tables, structure, key-value pairs, and named entities from docume
 > * For this example, you'll need a **form document file at a URI**. You can use our [sample form document](https://raw.githubusercontent.com/Azure-Samples/cognitive-services-REST-api-samples/master/curl/form-recognizer/sample-layout.pdf) for this quickstart.
 > * To analyze a given file at a URI, you'll use the `beginAnalyzeDocumentFromUrl` method and pass `prebuilt-document` as the model Id. The returned value is an `AnalyzeResult` object containing data about the submitted document.
 > * We've added the file URI value to the `documentUrl` variable in the main method.
-> * For simplicity, all the entity fields that the service returns are not shown here. To see the list of all supported fields and corresponding types, see our [General document](../concept-general-document.md#named-entity-recognition-ner-categories) concept page.
 
 **Add the following code sample to the `FormRecognizer.java` file. Make sure you update the key and endpoint variables with values from your Azure portal Form Recognizer instance:**
 
diff --git a/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-javascript-sdk.md b/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-javascript-sdk.md
@@ -112,7 +112,6 @@ Extract text, tables, structure, key-value pairs, and named entities from docume
 > * For this example, you'll need a **form document file from a URL**. You can use our [sample form document](https://raw.githubusercontent.com/Azure-Samples/cognitive-services-REST-api-samples/master/curl/form-recognizer/sample-layout.pdf) for this quickstart.
 > * To analyze a given file from a URL, you'll use the `beginAnalyzeDocuments` method and pass in `prebuilt-document` as the model Id.
 > * We've added the file URL value to the `formUrl` variable near the top of the file.
-> * To see the list of all supported fields and corresponding types, see our [General document](../concept-general-document.md#named-entity-recognition-ner-categories) concept page.
 
 **Add the following code sample to the `index.js` file. Make sure you update the key and endpoint variables with values from your Azure portal Form Recognizer instance:**
 
diff --git a/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-python-sdk.md b/articles/applied-ai-services/form-recognizer/quickstarts/try-v3-python-sdk.md
@@ -88,7 +88,6 @@ Extract text, tables, structure, key-value pairs, and named entities from docume
 > * For this example, you'll need a **form document file from a URL**. You can use our [sample form document](https://raw.githubusercontent.com/Azure-Samples/cognitive-services-REST-api-samples/master/curl/form-recognizer/sample-layout.pdf) for this quickstart.
 > * To analyze a given file at a URL, you'll use the `begin_analyze_document_from_url` method and pass in `prebuilt-document` as the model Id. The returned value is a `result` object containing data about the submitted document.
 > * We've added the file URL value to the `docUrl` variable in the `analyze_general_documents` function.
-> * For simplicity, all the entity fields that the service returns are not shown here. To see the list of all supported fields and corresponding types, see our [General document](../concept-general-document.md#named-entity-recognition-ner-categories) concept page.
 
 <!-- markdownlint-disable MD036 -->
 **Add the following code sample to your form_recognizer_quickstart.py application. Make sure you update the key and endpoint variables with values from your Azure portal Form Recognizer instance:**
diff --git a/articles/applied-ai-services/form-recognizer/v3-migration-guide.md b/articles/applied-ai-services/form-recognizer/v3-migration-guide.md
@@ -7,7 +7,7 @@ manager: nitinme
 ms.service: applied-ai-services
 ms.subservice: forms-recognizer
 ms.topic: how-to
-ms.date: 06/06/2022
+ms.date: 07/20/2022
 ms.author: lajanuar
 recommendations: false
 ---
@@ -21,7 +21,7 @@ recommendations: false
 Form Recognizer v3.0 (preview) introduces several new features and capabilities:
 
 * [Form Recognizer REST API](quickstarts/try-v3-rest-api.md) has been redesigned for better usability.
-* [**General document (v3.0)**](concept-general-document.md) model is a new API that extracts text, tables, structure, key-value pairs, and named entities from forms and documents.
+* [**General document (v3.0)**](concept-general-document.md) model is a new API that extracts text, tables, structure, and key-value pairs, from forms and documents.
 * [**Custom document model (v3.0)**](concept-custom-neural.md) is a new custom model type to extract fields from structured and unstructured documents.
 * [**Receipt (v3.0)**](concept-receipt.md) model supports single-page hotel receipt processing.
 * [**ID document (v3.0)**](concept-id-document.md) model supports endorsements, restrictions, and vehicle classification extraction from US driver's licenses.