Replies: 1 comment 1 reply
-
This is typical "Discussions" item - no bug. So let me transfer this first. |
Beta Was this translation helpful? Give feedback.
1 reply
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
🐛 Describe the bug
Hey, it seems that in one of my docs, the library is extracting invisible characters/characters of the same colors as the background.
Is there a way to parse ONLY visible chars / all characters that have a different color from the BG?
👨🔬 To Reproduce
I'd love to upload the file as well but it's not for general distribution.
If you give me a way to check if it's the former or the latter, I'll gladly check out and tell you more!
🖼️ Screenshots
green area (highlighted)

⚙️ Configuration
Linux - Ubuntu 22.04.2 LTS (Jammy Jellyfish)
3.10.6 (main, Nov 14 2022, 16:10:14) [GCC 11.3.0]
PyMuPDF 1.21.1: Python bindings for the MuPDF 1.21.1 library.
Version date: 2022-12-13 00:00:01.
Built for Python 3.10 on linux (64-bit).
Beta Was this translation helpful? Give feedback.
All reactions