Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

IAFD 403 forbidden error #121

Closed
DrunkSith opened this issue Jul 13, 2024 · 16 comments
Closed

IAFD 403 forbidden error #121

DrunkSith opened this issue Jul 13, 2024 · 16 comments
Labels
bug Something isn't working Cloudflare issue The issue is due to a Cloudflare challenge or protection.

Comments

@DrunkSith
Copy link

I can't seem to scrape anything from IAFD.

@wkearney99
Copy link

wkearney99 commented Jul 14, 2024

Same here. Get error 403 within AMM. Using same URL in a browser successfully loads the page.

And if I take the title.rme/id= field and use that as a manual ID it also throws the 403 error.

Is the User Agent string editable in AMM? Might that be something iafd is blocking?

@wkearney99
Copy link

Is this related? #108 (comment)

AMM utilizes DuckDuckGo for searches on the IAFD if no results are found during the onsite search. Occasionally, DuckDuckGo blocks too many searches, so AMM attempts accessing DuckDuckGo via the Tor network. However, in your case, this was unsuccessful five times. I'll take a look at the DDG search.

@adultmm
Copy link
Owner

adultmm commented Jul 14, 2024

Guys, please attach the corresponding log file. Thanks

@wkearney99
Copy link

It's repeatable. No scrapes are successful from iafd. This is a simple test. It has one movie in the database, and ONLY the iafd scraper is enabled.

2024-07-15.csv

@adultmm
Copy link
Owner

adultmm commented Jul 15, 2024

Thank you. I can see the 403 error in the log, but it works for me. I'll try to reproduce.

@wkearney99
Copy link

It would really help if we had a bit more indications in the log about how a connection was being made. Using proxies, tor, etc. It'd help point the finger toward the actual problem.

@adultmm
Copy link
Owner

adultmm commented Jul 17, 2024

The actual problem - as usual - is the ClouFlare protection. Me, and many others work on solving these challenges: FlareSolverr, CF-Clearance-Scraper, etc. If you can download the content of the URL with a curl or any other tool, you have solved the problem. Send the solution, and I'll implement it in AMM.

@adultmm adultmm added bug Something isn't working Cloudflare issue The issue is due to a Cloudflare challenge or protection. labels Jul 17, 2024
@adultmm
Copy link
Owner

adultmm commented Jul 18, 2024

Fixed in v0.12.12.

@adultmm adultmm closed this as completed Jul 18, 2024
@adultmm
Copy link
Owner

adultmm commented Jul 20, 2024

Try out the standalone version that works: v1.0.0-beta1.

@adultmm adultmm reopened this Jul 21, 2024
@adultmm
Copy link
Owner

adultmm commented Jul 21, 2024

@wkearney99 @DrunkSith v1.0.0-beta1 scrapes IAFD for you? Thanks

@adultmm
Copy link
Owner

adultmm commented Jul 21, 2024

Cody has an issue with the v1.0.0-beta1 version as well: #123 (comment)

@wkearney99
Copy link

wkearney99 commented Jul 25, 2024

Nope, 1.0.0.b1 immediate 403 error, same as with 12.11.
2024-07-26.csv

@Fetterbr61
Copy link

aaa-004
cant scrape any japanese movies, especially femdom sites.

@wkearney99
Copy link

The problem seems to come and go. What's the code logic behind the scenes that would affect this?

@adultmm
Copy link
Owner

adultmm commented Aug 19, 2024

As I can see, IAFD changed someting on their CF settings and now AMM can scrape their pages. Please reopen, if the problem appears again.

@adultmm adultmm closed this as completed Aug 19, 2024
@wkearney99
Copy link

Seems to be a problem again. 403 errors. Even a paste of the manual ID entry fails with a 403.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working Cloudflare issue The issue is due to a Cloudflare challenge or protection.
Projects
None yet
Development

No branches or pull requests

4 participants