-
Notifications
You must be signed in to change notification settings - Fork 10k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Some characters in the text layer are wrong compared to what is displayed #13260
Comments
I greped |
Please note that Adobe Reader, i.e. the reference implementation, is "wrong" as well here. (The real bug is in the incomplete /Encoding-data of the font in question.)
Given that the specification mentions "The names may appear in any order." for |
I noticed the sentence "The names may appear in any order." too and I don't really understand how |
I extracted the font from the pdf and I ran:
and I got I'm not sure there is an encoding issue: https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=271 |
Sure, but the |
I've got a potential patch for this locally, but I still need to run all tests and think through the various edge-cases involved. |
Attach (recommended) or Link to PDF file here:
https://web.archive.org/web/20091223123331/http://sci2s.ugr.es/keel/pdf/specific/articulo/alp99.pdf
(aka
tests/pdfs/issue1936.pdf
)When I copy/paste the multiplication sign on the first line of the first page, I got a
£
.And when I do the same in evince I got a
×
(the same in chrome).The font descriptor is:
So it should be possible to guess the correct character in using the
CharSet
entry:https://www.adobe.com/content/dam/acom/en/devnet/pdf/pdfs/PDF32000_2008.pdf#page=291
The text was updated successfully, but these errors were encountered: