Skip to content

Latest commit

 

History

History
20 lines (18 loc) · 724 Bytes

README.md

File metadata and controls

20 lines (18 loc) · 724 Bytes

Big Data Analysis Practice Projects

It is a assemble to include all Practice Projects completed in Big Data Course. All description can check in each part folder.

Source

  • Hadoop
    • Common Friends
    • Top-10 Comman Friends pairs
    • Yelp Dateset Top 10 rating businesses information
    • Yelp Dataset Palo Alto businesses Rating
  • Spark
    • Common Friends
    • Top-10 Comman Friends pairs
    • Yelp Dateset Top 10 rating businesses information
    • Yelp Dataset Palo Alto businesses Rating
  • Spark Stream
    • Movie Clustering (spark-mlib, KMEANS)
    • Use Collaborative filtering find the accuracy(MSE) of ALS model accuracy
    • App Data correction and App Prediction (KMM)
    • Twitter Sentiment Analyzer (Spark Streaming and Kafka)