You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: articles/logic-apps/parse-document-chunk-text.md
+15-15Lines changed: 15 additions & 15 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -1,26 +1,26 @@
1
1
---
2
2
title: Parse document or chunk text
3
-
description: Parse a document or chunk text for Standard workflows in Azure Logic Apps.
3
+
description: Parse a document or chunk text for workflows in Azure Logic Apps.
4
4
services: azure-logic-apps
5
5
ms.suite: integration
6
6
ms.collection: ce-skilling-ai-copilot
7
7
ms.reviewer: estfan, azla
8
8
ms.topic: how-to
9
-
ms.date: 08/16/2024
10
-
# Customer intent: As a developer using Azure Logic Apps, I want to parse a document or chunk text that I want to use with Azure AI operations for my Standard workflow in Azure Logic Apps.
9
+
ms.date: 12/18/2024
10
+
# Customer intent: As a developer using Azure Logic Apps, I want to parse a document or chunk text that I want to use with Azure AI operations for my workflow in Azure Logic Apps.
11
11
---
12
12
13
-
# Parse or chunk content for Standard workflows in Azure Logic Apps (Preview)
13
+
# Parse or chunk content for workflows in Azure Logic Apps (Preview)
> This capability is in preview and is subject to the
19
19
> [Supplemental Terms of Use for Microsoft Azure Previews](https://azure.microsoft.com/support/legal/preview-supplemental-terms/).
20
20
21
21
Sometimes you have to convert content into tokens, which are words or chunks of characters, or divide a large document into smaller pieces before you can use this content with some actions. For example, the **Azure AI Search** or **Azure OpenAI** actions expect tokenized input and can handle only a limited number of tokens.
22
22
23
-
For these scenarios, use the **Data Operations** actions named **Parse a document** and **Chunk text** in your Standard logic app workflow. These actions respectively transform content, such as a PDF document, CSV file, Excel file, and so on, into tokenized string output and then split the string into pieces, based on the number of tokens. You can then reference and use these outputs with subsequent actions in your workflow.
23
+
For these scenarios, use the **Data Operations** actions named **Parse a document** and **Chunk text** in your logic app workflow. These actions respectively transform content, such as a PDF document, CSV file, Excel file, and so on, into tokenized string output and then split the string into pieces, based on the number of tokens. You can then reference and use these outputs with subsequent actions in your workflow.
24
24
25
25
> [!TIP]
26
26
>
@@ -40,15 +40,15 @@ This how-to guide shows how to add and set up these operations in your workflow.
40
40
41
41
* An Azure account and subscription. If you don't have an Azure subscription, [sign up for a free Azure account](https://azure.microsoft.com/free/?WT.mc_id=A261C142F).
42
42
43
-
* A Standard logic app workflow with an existing trigger because the **Parse a document** and **Chunk text** operations are available only as actions. Make sure that the action that retrieves the content that you want to parse or chunk precedes these data operations.
43
+
* A Consumption or Standard logic app workflow with an existing trigger because the **Parse a document** and **Chunk text** operations are available only as actions. Make sure that the action that retrieves the content that you want to parse or chunk precedes these data operations.
44
44
45
45
## Parse a document
46
46
47
47
The **Parse a document** action converts content, such as a PDF document, CSV file, Excel file, and so on, into a tokenized string. For this example, suppose your workflow starts with the **Request** trigger named **When a HTTP request is received**. This trigger waits to receive an HTTP request sent from another component, such as an Azure function, another logic app workflow, and so on. The HTTP request includes the URL for a new uploaded document that is available for the workflow to retrieve and parse. An **HTTP** action immediately follows the trigger, and sends an HTTP request to the document's URL, and returns with the document content from its storage location.
48
48
49
49
If you use other content sources, such as Azure Blob Storage, SharePoint, OneDrive, File System, FTP, and so on, you can check whether triggers are available for these sources. You can also check whether actions are available to retrieve the content for these sources. For more information, see [Built-in operations](/azure/logic-apps/connectors/built-in/reference/) and [Managed connectors](/connectors/connector-reference/connector-reference-logicapps-connectors).
50
50
51
-
1. In the [Azure portal](https://portal.azure.com), open your Standard logic app resource and workflow in the designer.
51
+
1. In the [Azure portal](https://portal.azure.com), open your logic app resource and workflow in the designer.
52
52
53
53
1. Under the existing trigger and actions, [follow these general steps to add the **Data Operations** action named **Parse a document**](create-workflow-with-trigger-or-action.md#add-action) to your workflow.
54
54
@@ -70,11 +70,11 @@ If you use other content sources, such as Azure Blob Storage, SharePoint, OneDri
70
70
71
71
In this example, the **Parse a document** action references the **Body** output from the **HTTP** action.
72
72
73
-
:::image type="content" source="media/parse-document-chunk-text/select-http-body.png" alt-text="Screenshot shows Standard workflow designer, and action named Parse a document with open dynamic content list. and selected Body output from HTTP action." lightbox="media/parse-document-chunk-text/select-http-body.png":::
73
+
:::image type="content" source="media/parse-document-chunk-text/select-http-body.png" alt-text="Screenshot shows workflow designer, which has an action named Parse a document with opened dynamic content list and selected Body output from HTTP action.":::
74
74
75
75
The **Body** output now appears in the **Document Content** box:
76
76
77
-
:::image type="content" source="media/parse-document-chunk-text/parse-document.png" alt-text="Screenshot shows sample workflow with Body output in the action named Parse a document." lightbox="media/parse-document-chunk-text/parse-document.png":::
77
+
:::image type="content" source="media/parse-document-chunk-text/parse-document.png" alt-text="Screenshot shows sample workflow with Body output in the action named Parse a document.":::
78
78
79
79
1. Under the **Parse a document** action, add the actions that you want to work with the tokenized string output, for example, **Chunk text**, which this guide describes later.
80
80
@@ -102,7 +102,7 @@ The **Chunk text** action splits content into smaller pieces for subsequent acti
102
102
> Preceding actions that use chunking don't affect the **Chunk text** action,
103
103
> nor does the **Chunk text** action affect subsequent actions that use chunking.
104
104
105
-
1. In the [Azure portal](https://portal.azure.com), open your Standard logic app resource and workflow in the designer.
105
+
1. In the [Azure portal](https://portal.azure.com), open your logic app resource and workflow in the designer.
106
106
107
107
1. Under the **Parse a document** action, [follow these general steps to add the **Data Operations** action named **Chunk text**](create-workflow-with-trigger-or-action.md#add-action).
108
108
@@ -128,11 +128,11 @@ The **Chunk text** action splits content into smaller pieces for subsequent acti
128
128
129
129
In this example, the **Chunk text** action references the **Parsed result text** output from the **Parse a document** action.
130
130
131
-
:::image type="content" source="media/parse-document-chunk-text/select-parsed-result-text.png" alt-text="Screenshot shows Standard workflow designer, action named Chunk text with open dynamic content list, and selected output from Parse a document action." lightbox="media/parse-document-chunk-text/select-parsed-result-text.png":::
131
+
:::image type="content" source="media/parse-document-chunk-text/select-parsed-result-text.png" alt-text="Screenshot shows workflow designer, action named Chunk text with open dynamic content list, and selected output from Parse a document action.":::
132
132
133
-
The **Parsed result action**output now appears in the **Text**box:
133
+
The **Text**box now shows the **Parsed result action**output:
134
134
135
-
:::image type="content" source="media/parse-document-chunk-text/chunk-text.png" alt-text="Screenshot shows sample workflow with selected parsed result text output in the action named Chunk text." lightbox="media/parse-document-chunk-text/chunk-text.png":::
135
+
:::image type="content" source="media/parse-document-chunk-text/chunk-text.png" alt-text="Screenshot shows sample workflow with selected parsed result text output in the action named Chunk text.":::
136
136
137
137
1. Complete the setup for the **Chunk text** action, based on your selected strategy and scenario. For more information, see [Chunk text - Reference](#chunk-text---reference).
138
138
@@ -171,7 +171,7 @@ Now, when you add other actions that expect and use tokenized input, such as the
171
171
172
172
The following example includes other actions that create a complete workflow pattern to ingest data from any source:
0 commit comments