Skip to content

Latest commit

 

History

History
7 lines (6 loc) · 393 Bytes

README.md

File metadata and controls

7 lines (6 loc) · 393 Bytes

Group project focused on analysing New York Taxi Data via PySpark (receiving 20/20 points)

  1. Finding out where to put up bus routes
  2. Multinomial logistic regression to classify into no tips / low tips / high tips
  3. Helping taxi drivers where in the city they should go next
  4. K-means clustering to find out where to put taxi stands
  5. Page rank algorithm to find important traffic nodes