Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Dead links #463

Open
jcubic opened this issue Oct 16, 2024 · 2 comments
Open

Dead links #463

jcubic opened this issue Oct 16, 2024 · 2 comments

Comments

@jcubic
Copy link
Contributor

jcubic commented Oct 16, 2024

I've noticed one or two dead links. One pointed to (via two redirects to betting site) and the other to the marketplace to buy a domain.

If you are willing to merge my PR and I see if I can create GitHub workflow that will scan each link (in cron every week) if the field name appear on the blog.

I write the script locally using Puppeteer to scrap the text from the page and see if all URL have the somewhere on the page the title.

I think that deleting the URLs shoud be done by hand so I can show the list of invalid URLs that can be investigated by hand.

What do you think?

@Ryuno-Ki
Copy link

I wonder whether Puppeteer is needed here (instead of cURL + htmlq for example).

@jcubic
Copy link
Contributor Author

jcubic commented Oct 17, 2024

A lot of people create websites in React that doesn't work without JavaScript.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants