-
-
Notifications
You must be signed in to change notification settings - Fork 200
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
403 error for drupal.org domain #197
Comments
Such a link is behind authentication; when I visit it, I get https://www.drupal.org/user/login?destination=user/3013749. I had some ideas around this: #86 But for now if it's an annoyance I would suggest setting up the |
Also closing as a dupe of #86 |
@gjtorikian thanks, that makes sense. I didn't even realize that the What's odd, though, is that the last two errors in the build definitely do not require authentication but are yielding 403 errors:
If you have thoughts on those two please let me know. |
Now that is indeed interesting. 😄 |
I'm seeing this on several sites, including |
Both this and #200 (which may be related) seem domain based. Perhaps a test-wide domain blacklist? |
This is definitely related to #200. Compare
to
|
@gjtorikian you're 100% right. Reached out to friends at Drupal and WP to see what's up. |
I appreciate that. Still, the problem seems to expand beyond just the scope of those large providers:
I'm thinking to try the initial /cc @i0rek as well since he may be unaware of this. |
I was indeed unaware. Thats interesting, thanks for the heads up. Is there anything I can do? |
Links to Tumblr or Wordpress doesnt allow ping. Check this issue for more information gjtorikian/html-proofer#197.
I don't think so, at least as it pertains to this project. But you might ask future users to try changing their user-agent if Typhoeus fails. |
I checked with my engineering team. We blocked the Typheous user agent back on April 2nd because we had a user that was causing a denial of service on Drupal.org by attempting to mirror every piece of the site. That's not the fault of html-proofer per se, but there was definitely a user abusing Drupal.org with it. Given the size of Drupal.org and the age of some of our pages, I'm not sure that running html-proofer on the entire domain is best practice. If you want to do something like this for research, let me know and we can set up a rate-limited test on our staging site rather than production. If I'm missing why there is a need to run Drupal.org through html-proofer, let me know. We are open to working on something that improves the quality of our site. |
@joshuami thanks for your reply. I'm definitely not interested to run all of Drupal.org through html-proofer. Rather, I use html-proofer to check the validity of external links on my company's site. I'd like html-proofer to know that, for example, I linked correctly to https://drupal.org/project/drupal instead of https://drupal.org/project/drupl Check out the build at the top of this issue for more detail: https://travis-ci.org/savaslabs/savaslabs.github.io/builds/57088282 |
@kostajh The next version of html-proofer will have a fix for this. @joshuami Thanks a bunch for the reply. I think my only concern here is to notify Typhoeus of the User-Agent ban so that legit users/projects (like this one!) are aware. It sucks that someone abused Typhoeus in that way. |
Any idea why
html-proofer
is returning "403 No error" fordrupal.org
links? See for example the output from https://travis-ci.org/savaslabs/savaslabs.github.io/builds/57088282Other
https
links on that site work just fine.The text was updated successfully, but these errors were encountered: