Skip to content

ocriado91/python-wedding-scraper

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

8 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

python-wedding-scraper

A Python script that scrapes wedding location information from a website and stores it in Elasticsearch or in CSV format.

image

Prerequisites

Python 3

Beautfilsoup

Elasticsearch

GeoPy

Pandas

Requests

Installing

  1. Clone the repository:
git clone https://github.com/ocriado91/python-wedding-scraper
  1. Install the required Python libraries using pip and the requirements.txt file:
pip install -r requirements.txt

Usage

  1. Start the Elasticsearch server:
sudo systemctl start elasticsearch
  1. Run the script:
python3 main.py

Customization

You can modify the following variables in the script to customize the scraping:

  • --first_page: First index of page to extract information
  • --last_page: Last index of page to extract information
  • --remove_index: Remove Elasticsearch index before inject new data
  • --store_csv: Save information in CSV format

Disclaimer

This script is for educational purposes only. Use it at your own risk and make sure to respect the website's terms of service and robots.txt.

About

A wedding python scraper

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages