You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: open-source/core-functionality/partitioning.mdx
+4-12Lines changed: 4 additions & 12 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -229,17 +229,13 @@ elements = partition_email(text=text, include_headers=True)
229
229
230
230
`partition_email` includes a `max_partition` parameter that indicates the maximum character length for a document element. This parameter only applies if `"text/plain"` is selected as the `content_source`. The default value is `1500`, which roughly corresponds to the average character length for a paragraph. You can disable `max_partition` by setting it to `None`.
231
231
232
-
You can optionally partition e-mail attachments by setting `process_attachments=True`. If you set `process_attachments=True`, you’ll also need to pass in a partitioning function to `attachment_partitioner`. The following is an example of what the workflow looks like:
232
+
You can optionally partition e-mail attachments by setting `process_attachments=True`. The following is an example of what the workflow looks like:
233
233
234
234
```python
235
-
from unstructured.partition.auto import partition
236
235
from unstructured.partition.email import partition_email
elements = partition_email(filename=filename, process_attachments=True)
243
239
```
244
240
245
241
@@ -377,17 +373,13 @@ elements = partition_msg(filename="example-docs/fake-email.msg")
377
373
378
374
`partition_msg` includes a `max_partition` parameter that indicates the maximum character length for a document element. This parameter only applies if `"text/plain"` is selected as the `content_source`. The default value is `1500`, which roughly corresponds to the average character length for a paragraph. You can disable `max_partition` by setting it to `None`.
379
375
380
-
You can optionally partition e-mail attachments by setting `process_attachments=True`. If you set `process_attachments=True`, you’ll also need to pass in a partitioning function to `attachment_partitioner`. The following is an example of what the workflow looks like:
376
+
You can optionally partition e-mail attachments by setting `process_attachments=True`. The following is an example of what the workflow looks like:
381
377
382
378
```python
383
-
from unstructured.partition.auto import partition
384
379
from unstructured.partition.msg import partition_msg
0 commit comments