it confuses me that how we can extract the non-rectangular images #2339

BriskyGates · 2023-04-13T09:19:51Z

BriskyGates
Apr 13, 2023

def extract_images(pdf_path, output_folder):
    doc = fitz.open(pdf_path)
    for page_num in range(len(doc)):
        page = doc.load_page(page_num)
        image_list = page.get_images(full=True)
        for img_index, img in enumerate(image_list):
            xref = img[0]
            base_image = doc.extract_image(xref)
            image_bytes = base_image["image"]

            # save to the png file
            with open(f"{output_folder}/image_p{page_num + 1}_i{img_index + 1}.png", "wb") as img_out:
                img_out.write(image_bytes)

pdf_font_garbled.pdf
eg. page2
there exists two kinds of images, the portrait and the watermark, but the former is non-rectangular, how we fill it with black background

thanks in advance :>

Answered by JorjMcKie

Apr 13, 2023

Looking closer at the images on page 2, you will see that a number of them has masks, these items have a second entry > 0, e.g. (53, 90, 173, 173, 8, 'DeviceRGB', '', 'Image53', 'DCTDecode') has 90 there. This is an image mask which must be applied to get the full image. The following snippet only extracts images with a mask and recovers the full picture by applying the mask to the base image:

for item in page.get_images():
    xref = item[0]  # base image xref
    mask = item[1]  # mask xref
    if mask == 0: continue  # ignore if no masked image
    pix0 = fitz.Pixmap(doc, xref)  # pixmap of base image
    if pix0.alpha: pix0 = fitz.Pixmap(pix0, 0)  # remove alpha channel if present
    p…

View full answer

JorjMcKie · 2023-04-13T10:01:16Z

JorjMcKie
Apr 13, 2023
Maintainer

Looking closer at the images on page 2, you will see that a number of them has masks, these items have a second entry > 0, e.g. (53, 90, 173, 173, 8, 'DeviceRGB', '', 'Image53', 'DCTDecode') has 90 there. This is an image mask which must be applied to get the full image. The following snippet only extracts images with a mask and recovers the full picture by applying the mask to the base image:

for item in page.get_images():
    xref = item[0]  # base image xref
    mask = item[1]  # mask xref
    if mask == 0: continue  # ignore if no masked image
    pix0 = fitz.Pixmap(doc, xref)  # pixmap of base image
    if pix0.alpha: pix0 = fitz.Pixmap(pix0, 0)  # remove alpha channel if present
    pixm = fitz.Pixmap(doc, mask)  # pixmap of mask
    pix = fitz.Pixmap(pix0,pixm)  # merge base and mask pixmap to one transparent pixmap
    fname=f"{xref}.png"  # output filename
    pix.save(fname)  # save recovered picture

delivers this for base image xref 53:

0 replies

BriskyGates · 2023-04-17T05:17:37Z

BriskyGates
Apr 17, 2023
Author

thanks a lot!!! :>

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

it confuses me that how we can extract the non-rectangular images #2339

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

it confuses me that how we can extract the non-rectangular images #2339

Uh oh!

BriskyGates Apr 13, 2023

Replies: 2 comments

Uh oh!

JorjMcKie Apr 13, 2023 Maintainer

Uh oh!

BriskyGates Apr 17, 2023 Author

BriskyGates
Apr 13, 2023

JorjMcKie
Apr 13, 2023
Maintainer

BriskyGates
Apr 17, 2023
Author