Web Scraping

⚠️ WEB SCRAPING ETIQUETTE⚠️

Always play nice with the websites you're scraping; check out their rules and get the green light if needed. Steer clear of swiping personal stuff and be copyright-conscious. Oh, and stay in the know about the legal side of scraping – we don't want any surprise legal drama, right?

ALWAYS CHECK THE robots.txt file of the website you are scraping, this will show you which pages you can and cannot crawl.

This project includes:

Simple web scraping of a page from the Vegan Society News. I first retrived the news cards on the page, saving them into a CSV file. Next I proceeded by scraping the images on the news, and saving them to my machine.

Popular libraries for web scraping

Scrappy
BeautifulSoup - used in this small project
Selenium

Web scraping Steps

Crawl
Parse and transform
Store

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
scraped_images		scraped_images
README.md		README.md
requirements.txt		requirements.txt
scraping_beautiful_soup.py		scraping_beautiful_soup.py
vegan_news.csv		vegan_news.csv

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Web Scraping

This project includes:

Popular libraries for web scraping

Web scraping Steps

About

Releases

Packages

Languages

fonsecagabriella/Web_Scraping

Folders and files

Latest commit

History

Repository files navigation

Web Scraping

This project includes:

Popular libraries for web scraping

Web scraping Steps

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages