Hotspots detection with unsupervised ML

The task at hand is to detect hotspots for Uber pickup in NY to better help drivers be where they are needed.

The data

The dataset is obtained from the Uber Pickups in New York City dataset on Kaggle, and focuses on the month of May 2014.

Clustering

Ouliers are initially removed using DBScan. Hotspots are then determined using KMeans on 9 centroids. The ideal number of centroids is based on the evaluation of the silhoutte score and inertia score of KMeans models train on different number of clusters. The clusters are determined on an hour-by-hour and day-by-day basis.

Data display

Clusters are displayed on an interactive Plotly graph

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
Hour_map.html		Hour_map.html
README.md		README.md
Uber_project_notebook.ipynb		Uber_project_notebook.ipynb
uber-raw-data-may14.csv.zip		uber-raw-data-may14.csv.zip

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Hotspots detection with unsupervised ML

The data

Clustering

Data display

About

Releases

Packages

Languages

HelenaCGarry/Hotspot-detection-with-ML

Folders and files

Latest commit

History

Repository files navigation

Hotspots detection with unsupervised ML

The data

Clustering

Data display

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages