Analyzes book review data from Amazon and the Amazon-Vine program utilizing PySpark and Amazon Web Service's Relational Database Service (AWS RDS)
-
Updated
Sep 12, 2024 - Jupyter Notebook
Analyzes book review data from Amazon and the Amazon-Vine program utilizing PySpark and Amazon Web Service's Relational Database Service (AWS RDS)
Tutorials on Big Data essentials: Hadoop, MapReduce, Spark.
Practice tasks in Python programming language using Hadoop, MRJob, PySpark for Big Data Analytics.
RECUPERACIÓ DE LA INFORMACIÓ Curs 2023-24 EPSEVG
Big Data analysis project using MapReduce in Python to process movie ratings. Includes scripts for aggregating ratings and identifying the most rated movies, demonstrating data analysis on a large scale.
Exercises in the Scala programming language with an emphasis on big data programming and applications in Apache Hadoop and Apache Spark.
Samples related to data engineering, e.g. spark, embulk, airflow, etc.
The largest collection of publicly accessible Progressive Web Apps*
Analyzing Amazon product reviews
ETH analysis using big data for the QMUL Big Data Processing module. Intended to promote analysis of data retrieved via big data processing
In this project, I used Decision Tree Learning Model as the main algorithm to build the model. Due to the big amount of flight data, we implement the project using MRJob, PySpark and Spark's MLlib then compare the performance and accuracy of those implementations.
Criando seu Ecossistema de Big Data na Nuvem
En esta práctica se empaqueta y distribuye una aplicación Python que descarga y analiza tweets en función de puntuaciones de sentimiento. Los resultados del análisis se guardan en una base de datos MongoDB, y la información se muestra en la web.
Add a description, image, and links to the mrjob topic page so that developers can more easily learn about it.
To associate your repository with the mrjob topic, visit your repo's landing page and select "manage topics."