PDF #12217

Taghreed7878 · 2023-08-27T16:05:24Z

Taghreed7878
Aug 27, 2023

Is there a way to make paddleocr object accepts bytes in case of PDF files like it accepts bytes in case of images?

nullgogo · 2023-08-28T08:53:24Z

nullgogo
Aug 28, 2023

用fitz这个库转img，用fitz.open(stream=pdf_bytes, filetype='bytes')这个方式读取，源码是直接open文件路径

希望能把PDF bytes files的方式也集成进去

0 replies

Taghreed7878 · 2023-08-28T08:59:45Z

Taghreed7878
Aug 28, 2023
Author

But this will write the PDF bytes into a file, right?

0 replies

nullgogo · 2023-08-28T09:06:14Z

nullgogo
Aug 28, 2023

But this will write the PDF bytes into a file, right?

不需要写成file

参考以下代码，imgs可以直接作为入参

import fitz
from PIL import Image
imgs = []
with fitz.open(stream=pdf_bytes, filetype='bytes') as pdf:
    for pg in range(0, pdf.page_count):
        page = pdf[pg]
        mat = fitz.Matrix(2, 2)
        pm = page.get_pixmap(matrix=mat, alpha=False)

        # if width or height > 2000 pixels, don't enlarge the image
        if pm.width > 2000 or pm.height > 2000:
            pm = page.get_pixmap(matrix=fitz.Matrix(1, 1), alpha=False)

        img = Image.frombytes("RGB", [pm.width, pm.height], pm.samples)
        img = cv2.cvtColor(np.array(img), cv2.COLOR_RGB2BGR)
        imgs.append(img)

0 replies

nullgogo · 2023-08-28T09:19:30Z

nullgogo
Aug 28, 2023

But this will write the PDF bytes into a file, right?

不需要写成file

参考以下代码，imgs可以直接作为入参

import fitz
from PIL import Image
imgs = []
with fitz.open(stream=pdf_bytes, filetype='bytes') as pdf:
    for pg in range(0, pdf.page_count):
        page = pdf[pg]
        mat = fitz.Matrix(2, 2)
        pm = page.get_pixmap(matrix=mat, alpha=False)

        # if width or height > 2000 pixels, don't enlarge the image
        if pm.width > 2000 or pm.height > 2000:
            pm = page.get_pixmap(matrix=fitz.Matrix(1, 1), alpha=False)

        img = Image.frombytes("RGB", [pm.width, pm.height], pm.samples)
        img = cv2.cvtColor(np.array(img), cv2.COLOR_RGB2BGR)
        imgs.append(img)

imgs作为一个list，det要传False... 这样似乎还不行，还得imgs[0],imgs[1]... 一页一页放进去，不懂是不是我搞错了

0 replies

Taghreed7878 · 2023-09-10T16:00:48Z

Taghreed7878
Sep 10, 2023
Author

The issue with this approach is that each page will be processed separately I think, so the bboxes for each one will not be accumulated, right?

0 replies

BrownTen · 2023-11-28T06:41:52Z

BrownTen
Nov 28, 2023

But this will write the PDF bytes into a file, right?

不需要写成file

参考以下代码，imgs可以直接作为入参

import fitz
from PIL import Image
imgs = []
with fitz.open(stream=pdf_bytes, filetype='bytes') as pdf:
    for pg in range(0, pdf.page_count):
        page = pdf[pg]
        mat = fitz.Matrix(2, 2)
        pm = page.get_pixmap(matrix=mat, alpha=False)

        # if width or height > 2000 pixels, don't enlarge the image
        if pm.width > 2000 or pm.height > 2000:
            pm = page.get_pixmap(matrix=fitz.Matrix(1, 1), alpha=False)

        img = Image.frombytes("RGB", [pm.width, pm.height], pm.samples)
        img = cv2.cvtColor(np.array(img), cv2.COLOR_RGB2BGR)
        imgs.append(img)

请问，为什么要做这个enlarge的操作？，虽然但是我直接get_pixmap()，的确效果不太好。
另外，他这个matrix的具体含义能讲一讲吗。是只是 resize还是做了相关增强？

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

PDF #12217

Uh oh!

{{title}}

Uh oh!

Replies: 6 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

PDF #12217

Uh oh!

Taghreed7878 Aug 27, 2023

Replies: 6 comments

Uh oh!

nullgogo Aug 28, 2023

Uh oh!

Taghreed7878 Aug 28, 2023 Author

Uh oh!

nullgogo Aug 28, 2023

Uh oh!

nullgogo Aug 28, 2023

Uh oh!

Taghreed7878 Sep 10, 2023 Author

Uh oh!

BrownTen Nov 28, 2023

Taghreed7878
Aug 27, 2023

nullgogo
Aug 28, 2023

Taghreed7878
Aug 28, 2023
Author

nullgogo
Aug 28, 2023

nullgogo
Aug 28, 2023

Taghreed7878
Sep 10, 2023
Author

BrownTen
Nov 28, 2023