You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: ui/enriching/image-descriptions.mdx
+5Lines changed: 5 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -39,6 +39,11 @@ Line breaks have been inserted here for readability. The output will not contain
39
39
}
40
40
```
41
41
42
+
<Note>
43
+
The `image_base64` field is generated only for documents or PDF pages that are [partitioned](/ui/partitioning) by using the High Res strategy. This field is not generated for
44
+
documents or PDF pages that are partitioned by using the Fast or VLM strategy.
45
+
</Note>
46
+
42
47
For workflows that use [chunking](/ui/chunking), note the following changes:
43
48
44
49
- Each `Image` element is replaced by a `CompositeElement` element.
Copy file name to clipboardExpand all lines: ui/enriching/table-descriptions.mdx
+11-1Lines changed: 11 additions & 1 deletion
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -43,14 +43,24 @@ Line breaks have been inserted here for readability. The output will not contain
43
43
}
44
44
```
45
45
46
+
<Note>
47
+
The `image_base64` field is generated only for documents or PDF pages that are [partitioned](/ui/partitioning) by using the High Res strategy. This field is not generated for
48
+
documents or PDF pages that are partitioned by using the Fast or VLM strategy.
49
+
</Note>
50
+
46
51
Here are two examples of the descriptions for detected tables. These descriptions are generated with GPT-4o by OpenAI:
47
52
48
53

49
54
50
55

51
56
52
57
The generated table's summary will overwrite any text that Unstructured had previously extracted from that table into the `text` field.
53
-
The table's original content is available in the `image_base64` field.
58
+
The table's original content is available in the `image_base64` field.
59
+
60
+
<Note>
61
+
The `image_base64` field is generated only for documents or PDF pages that are [partitioned](/ui/partitioning) by using the High Res strategy. This field is not generated for
62
+
documents or PDF pages that are partitioned by using the Fast or VLM strategy.
63
+
</Note>
54
64
55
65
For workflows that use [chunking](/ui/chunking), note the following changes:
Copy file name to clipboardExpand all lines: ui/enriching/table-to-html.mdx
+5-2Lines changed: 5 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -60,14 +60,17 @@ Line breaks have been inserted here for readability. The output will not contain
60
60
}
61
61
```
62
62
63
+
<Note>
64
+
The `image_base64` field is generated only for documents or PDF pages that are [partitioned](/ui/partitioning) by using the High Res strategy. This field is not generated for
65
+
documents or PDF pages that are partitioned by using the Fast or VLM strategy.
66
+
</Note>
67
+
63
68
For workflows that use [chunking](/ui/chunking), note the following changes:
64
69
65
70
- If a `Table` element must be chunked, the `Table` element is replaced by a set of related `TableChunk` elements.
66
71
- Each of these `TableChunk` elements will contain HTML table output for only its own element.
67
72
- None of the these `TableChunk` elements will contain an `image_base64` field.
Copy file name to clipboardExpand all lines: ui/summarizing.mdx
+5Lines changed: 5 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -71,6 +71,11 @@ Line breaks have been inserted here for readability. The output will not contain
71
71
}
72
72
```
73
73
74
+
<Note>
75
+
The `image_base64` field is generated only for documents or PDF pages that are [partitioned](/ui/partitioning) by using the High Res strategy. This field is not generated for
76
+
documents or PDF pages that are partitioned by using the Fast or VLM strategy.
0 commit comments