-
-
Notifications
You must be signed in to change notification settings - Fork 15
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add feature to allow for parsing of self closing script tags #124
Add feature to allow for parsing of self closing script tags #124
Conversation
Thanks @Seanstoppable will have a look.... |
Hi @Seanstoppable, i'm more in favor to remove features than adding new ones ;-). Can you tell me please a bit about the motivation behind this? |
Sure, absoluately understand. I am currently maintaining a bunch of crawlers. At one point, we apparently decided we wanted to capture self closing script tags in our parsing (despite them being non-compliant). In these cases, we are still attempting to parse the attributes, in particular the script src attribute. We have a fork for essentially these lines, which I am attempting to bring up to date. Rather than maintaining a fork for these lines, I am hoping to contribute back so we can go back to using official published jars. Since self-closed iframes are also supported (despite also not being compliant), I was hoping this would bring parity for this tag and a reasonable addition. |
Absolutely. Tests added. |
@Seanstoppable will merge this and have a look |
@Seanstoppable fixed (see commit) and made a new snapshot build available. Thanks for your contribution |
Awesome, thanks! |
I discovered that my company was maintaining a fork to handle unclosed script tags.
Since functionality exists to enable handling for unclosed iframes and other unclosed tags, this brings in a similar feature to close that gap.