This project is based on Python3 and BeautifulSoup4 library for web scraping. It allows us to make some automatization during information search in Google.
- Web scraper, which allows us to make some automatization during information search in Google.
- Script showing scheme for coding web scraper with BeautifulSoup.
- The main purpouse for coding it was getting familiare with BeautifulSoup library.
- Python 3
- BeautifulSoup4 library
Google can block IP adress used for running this code (necessity to use VPN/proxy).
- Project code starts with importing libraries needed for the rest of the code.
- It opens keywords.txt file to get information about keywords.
- After getting each keyword it makes Google query with that keyword and opens it in browser.
- It scraps information about the total numbers of results.
- It writes this information into CSV file.
- It goes in the loop to every container, which storages a link and scraps it.
- It writes these links into CSV file.
Project is: complete, but some improvement still can be done.
Room for improvement:
- making code more clean through putting functionalities in functions
- handling exceptions (for excemple, when file keywords.txt doesn't exist)
Created by Jacek Mendyk - feel free to contact me!