A Python script that scrapes wedding location information from a website and stores it in Elasticsearch or in CSV format.
Python 3
- Clone the repository:
git clone https://github.com/ocriado91/python-wedding-scraper
- Install the required Python libraries using pip and the requirements.txt file:
pip install -r requirements.txt
- Start the Elasticsearch server:
sudo systemctl start elasticsearch
- Run the script:
python3 main.py
You can modify the following variables in the script to customize the scraping:
--first_page
: First index of page to extract information--last_page
: Last index of page to extract information--remove_index
: Remove Elasticsearch index before inject new data--store_csv
: Save information in CSV format
This script is for educational purposes only. Use it at your own risk and make sure to respect the website's terms of service and robots.txt.