Google Image Scraper for faces, for training AI models

A library for scraping Google Images for a specified person. The library resizes the images to a specified resolution (standard: 512x512), crops them and makes sure the face is still in the image. This library can be used to train AI models such as Stable Diffusion on a specific person.

Pre-requisites:

Google Chrome
Selenium (pip install Selenium)
Pillow (pip install Pillow)

Setup:

Open command prompt

Clone this repository (or download)

git clone https://github.com/rundfunk47/Google-Image-Scraper

Install Dependencies
```
pip install -r requirements.txt
```

Usage:

This project was created to bypass Google Chrome's new restrictions on web scraping from Google Images.

Type

python main.py --search-key "Elon Musk" --token_name "emsk"

This will search Google Images for "Elon Musk", detect the face, resize the image and keep the face within the frame. Photos will be stored with the names "photos/Elon Musk/emsk (1).jpg", "photos/Elon Musk/emsk (2).jpg" and so on in this example.

Type

python main.py --help

for all the arguments

The app also comes with a script, rename.py, to help you rename files in the generated folder. This is good if you want to manually remove some photos but want to name the files like ("emsk (1).jpg", "emsk (b).jpg") and so on. It is run with the same arguments:

python rename.py --search-key "Elon Musk" --token_name "emsk"

IMPORTANT:

This program will install an updated webdriver automatically. There is no need to install your own.

Name		Name	Last commit message	Last commit date
Latest commit History 93 Commits
webdriver		webdriver
.gitattributes		.gitattributes
.gitignore		.gitignore
GoogleImageScraper.py		GoogleImageScraper.py
ImageProcessor.py		ImageProcessor.py
README.md		README.md
SeleniumScraper.py		SeleniumScraper.py
haarcascade_frontalface_default.xml		haarcascade_frontalface_default.xml
juypter_main.ipynb		juypter_main.ipynb
main.py		main.py
patch.py		patch.py
rename.py		rename.py
requirements.txt		requirements.txt
youtube_thumbnail.PNG		youtube_thumbnail.PNG

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Google Image Scraper for faces, for training AI models

Pre-requisites:

Setup:

Usage:

IMPORTANT:

About

Releases

Packages

Languages

rundfunk47/Google-Image-Scraper

Folders and files

Latest commit

History

Repository files navigation

Google Image Scraper for faces, for training AI models

Pre-requisites:

Setup:

Usage:

IMPORTANT:

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages