Skip to content
Discussion options

You must be logged in to vote

Are you talking about incomplete font files?

Yes, exactly. Could be that the font in question only has /ToUnicode entries for numbers and maybe always returns the space character code for all other glyphs.

WRT to "incomplete" fonts: The /ToUnicode information is optional for a font. There is no law or rule prescribing that a font must provide this information at all. Please recall that PDF has originally been created to display information for human reception. Not as a data store - things like text extraction, image extraction, etc. came later and - as I wrote - are not necessarily reliable.

Replies: 4 comments 3 replies

Comment options

You must be logged in to vote
2 replies
@ousia
Comment options

@JorjMcKie
Comment options

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
1 reply
@JorjMcKie
Comment options

Answer selected by JorjMcKie
Comment options

You must be logged in to vote
0 replies
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
3 participants
Converted from issue

This discussion was converted from issue #2516 on July 05, 2023 15:02.