Skip to content

Replacing image/extracting text layer from pdf? #951

@vs-777

Description

@vs-777

I have a two layered pdf - the background layer is an image and the front layer is text obtained from an OCR engine. I need to replace the image with another while keeping the text layer the same. Or, if it is easier, extract the text layer and place it with the same coordinates on the other image. Is either of these possible with PyMuPDF? I have looked at the issue #338, in order to remove the image and place a pixmap of the new image onto the pdf. You mention that it's not possible to completely remove the image, maybe a new feature has been added since then to allow for this?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions