Skip to content
This repository was archived by the owner on Apr 15, 2024. It is now read-only.

Implemented OneByteIdentityH/V encoding cmap (with test pdf)#179

Open
hynekcer wants to merge 1 commit intoeuske:masterfrom
hynekcer:byte-identity
Open

Implemented OneByteIdentityH/V encoding cmap (with test pdf)#179
hynekcer wants to merge 1 commit intoeuske:masterfrom
hynekcer:byte-identity

Conversation

@hynekcer
Copy link
Copy Markdown

Some type of CID fonts with "Identity" cmap is problematic and nothing is extracted. An example is http://stackoverflow.com/questions/43832499/pdfminer-empty-output

I fixed it. The simple test file is taken from question http://stackoverflow.com/questions/38119176/itextsharp-font-widths-definition-not-correctly-loaded

Maybe a better or more general solution would be to use the existing /ToUnicode map, but my solution is similar to /Identity-H that also ignores the unicode map and believe that it is an identity.

eladkehat added a commit to eladkehat/yapdfminer that referenced this pull request May 1, 2019
pantuts added a commit to atlas-health/pdfminer that referenced this pull request Dec 12, 2019
pantuts added a commit to atlas-health/pdfminer that referenced this pull request Dec 12, 2019
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants