This repository maps the Yelp Dataset to a relational database model using SQL Alchemy.
The database model schema can be found in this PDF file.
Download the dataset here: Kaggle. The dataset files must go in the data/real folder.
Create a python virtual environment and install the required packages.
virtualenv venv -p python3;
source venv/bin/activate;
pip install -r requirements.txt;
Launch the init_db.py script.
python src/init_db.py
The sqlite database file will be located at the root of the repository,
under the name db.sqlite
.