-
Notifications
You must be signed in to change notification settings - Fork 2.2k
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
The ruby rtc element is incorrectly processed. #1294
Comments
final merge for the progress report
Thanks -- fixed. I brought the implementation up to the current spec as defined by WHATWG (scroll to 'A start tag whose tag name is one of: "rb", "rtc"' and the following "rp" / "rt" section). Note that in jsoup (and other browsers), tags which aren't explicitly defined are still supported, they get default treatment. The bug described above was not because "rtc" was not known, it was because there was explicit handling for it -- and the spec changed since implemented. I'd appreciate it if interested users can test and review, and raise any issues found. |
Also, I checked that Tag defines the tags Those are all marked as inline (phrasing). Other tags ( |
Given the markup in the example from https://www.w3.org/TR/2001/REC-ruby-20010531/#complex:
the jsoup parser treats the
rtc
element as an unknown element that gets closed immediately. This causes it to serialize in xml mode as:I have checked the behaviour of Firefox and Chrome, and they preserve the
rtc
element structure, e.g.:The
rtc
element is supported in the W3C HTML spec [1], but not the WHATWG spec. Also, even though therbc
element is not listed in either of those (only in the Ruby Annotations specification), the jsoup parser preserves therbc
element structure.[1] https://www.w3.org/TR/2014/REC-html5-20141028/text-level-semantics.html#the-rtc-element
The text was updated successfully, but these errors were encountered: