Skip to content

juliast224/group_project_ny_taxi_data

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

5 Commits
 
 
 
 
 
 

Repository files navigation

Group project focused on analysing New York Taxi Data via PySpark (receiving 20/20 points)

  1. Finding out where to put up bus routes
  2. Multinomial logistic regression to classify into no tips / low tips / high tips
  3. Helping taxi drivers where in the city they should go next
  4. K-means clustering to find out where to put taxi stands
  5. Page rank algorithm to find important traffic nodes