This project is written for CITS3401 Data Warehousing, Semester 1 2019. It is a two part project, Data Warehousing (40 marks) and Data Mining (20 marks), with 2 bonus marks in Data Mining.
Kaggle's PetFinder Adoption Prediction data set is used for the project. ETL is performed using Python, and concept heirarchies are generated using draw.io.
$ cd submission/etl
$ python etl.py
$ cd submission/associations
$ python build_associations.py
- Microsoft SQL Server
- Microsoft Visual Studio SSDT
- Microsoft SQL Server Management Studio
- Python 3
- Weka
- PowerBI
- draw.io
These files should be used as a guideline to understanding the project. Copying the code in any way risks Academic Misconduct at the University of Western Australia. Discretion is advised.