Replies: 1 comment
-
See also https://gist.github.com/mara004/881d0c5a99b8444fd5d1d21a333b70f8 |
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Hi community,
as a related project of interest, I started to experiment with JPype API bindings to Apache PDFBox.
Like pdfium, it is (I think) liberal-licensed and can render PDFs.
Currently, all Python pdfbox bindings known to me use the CLI, either via subprocess (pdf2jpg), or by calling CLI entrypoint functions (python-pdfbox).1 This has some disadvantages, e.g. callers are restricted to a few pre-defined workflows, all memory transfer needs to go through files, etc. Using
subprocess
leads to further performance drawbacks.I now wrote a small gist that uses the actual pdfbox API with JPype, showing init, rendering, buffer transfer, and PIL image retrieval: https://gist.github.com/mara004/51c3216a9eabd3dcbc78a86d877a61dc
This makes me wonder if it would be possible to create a similar bindings project like pypdfium2 for pdfbox,
i.e. a setup infrastructure to download/bundle a PDFBox Jar, and a set of helpers & tests.
Setup should be easier because it's just one Jar, not platform specific binaries.
However, I'm afraid my work load is high and I may not be able to cope with a second bindings project, but wanted to share the idea here.
Footnotes
None of which seem to be actively maintained, though ↩
Beta Was this translation helpful? Give feedback.
All reactions