Skip to content

FR/add partition option to return pdf images and pass it through the apiΒ #306

@ron-unstructured

Description

@ron-unstructured

Description
This parameter is available in the partition_pdf but not through the API. With the new GPT-4V multimodal model, extracting images from source documents will be helpful.

To Reproduce

from unstructured.partition.pdf import partition_pdf
partition_pdf(filename, strategy="hi_res", extract_images_in_pdf=True)

Metadata

Metadata

Assignees

Labels

No labels
No labels

Type

No type

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions