-
Notifications
You must be signed in to change notification settings - Fork 10k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Text selection causes unnecessary line breaks #2569
Comments
Another PDF with the same issue: http://archive.cs.uu.nl/mirror/CTAN/macros/latex/contrib/chessboard/chessboard.pdf#page=43&zoom=auto,0,770. Also, spaces are not copied in your PDF if you copy 'The Mozilla Manifesto'. |
Another example: http://sundoc.bibliothek.uni-halle.de/diss-online/05/05H152/t4.pdf |
Well the top post states it is the issue with wkhtmltopdf, but in the case above, it has nothing to do with it. The bug has been around more than 1 year. |
@ReporterX I have updated the title of this issue to reflect the current state. The problem is that each character is put in a separate div. When you copy that, you get the abovementioned behaviour. IIRC work is being done to refactor the text layer, i.e., to reduce the amount of divs (combining them into one). |
Is there any solution to this? |
@timvandermeij Does it still look like the current refactoring work will alleviate this issue? |
@jasonparallel I'm afraid not. There are some PRs open for text layer alignment, but combining |
Is this related to #2989? In any case here is another example: https://www.ietf.org/proceedings/82/slides/rtcweb-13.pdf
|
The original PDF issue was fixed by #6590 (?). Rest of the problematic PDFs might have different problems -- the new issues shall be created for them. Closing as fixed. |
No, it was not. The issue is closed once the reporter's issue is resolved.
The #2989 talks about using span elements vs div and/or using events to replace the clipboard.
Thank you. |
Okay, I've created a new issue for this: #6659 |
PDFs created with "Print Pages to Pdf" addon based upon the opensource library wkhtmltopdf have an annoying issue when opened with pdf.js: if you copy and paste the textcontent of the PDF, each character is separated by a line break.
E.g.: With this PDF, if you copy and paste "The Mozilla Manifesto", you'll get:
T
h
e
M
o
z
i
l
l
a
M
a
n
i
f
e
s
t
o
The text was updated successfully, but these errors were encountered: