E6895 Big Data Final Project UNI: rz2357 yz2996
We got the data from NYC opendata for 2016 crimes and 2005-2015 crimes.
The temperature data of New York City was obtained from Prof. John Kissock’s website.
The Analysis
folder includes scripts of preprocessing and analysis of the data.
The Prediction
folder includes jupyter notebook scrips of clean and build the model step by step. Used smaller dataset.
It trained both Naive Bayes model and Random Forest Model.
The py file is used for processing all the 10 year data.