Skip to content

Consider using Document Intelligence method for extracting figure images #2311

@pamelafox

Description

@pamelafox

We currently use Python Pillow plus pymupdf, but apparently DI has a method as well:
https://learn.microsoft.com/en-us/python/api/overview/azure/ai-documentintelligence-readme?view=azure-python-preview#extract-figures-from-documents

We should try that and see if the results are the same (quality/latency/cost).
We could also look at the code to see if they basically do the same thing.

Metadata

Metadata

Assignees

No one assigned

    Labels

    visionRelated to the multimodal feature that can ingest figures and answer questions based off images

    Type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions