Data Analytics Laboratory
-
Updated
Sep 23, 2024 - R
Data Analytics Laboratory
hadoop mapreduce algorithm with hadoop streaming (Python)
Worked on Hadoop file streaming
Leveraging the mapreduce paradigm we propose a solution to parallelize the feedforward operation of neural networks in order to speed it up for sufficiently large NN architectures and for sufficiently large datasets. Tested Using the MNIST dataset results can be found in the results.html and results.ipynb files.
Installation and configuration of Hadoop on Google Colaboratory
Hadoop Projects
Text Processing Using Hadoop
A Hadoop MapReduce application to find the maximum temperature in every day of the years 1901 and 1902 from the NCDC weather records.
Learning Hadoop MapReduce Using Python
Market Basket Analysis using Hadoop MapReduce in Python
Построение рекомендательной системы на основе алгоритма коллаборативной фильтрации и технологии Hadoop Streaming
A small library example how to work with binary files with Hadoop Streaming.
Processing and transforming data via Hadoop Ecosystem
Implementation of Word2Vec for large datasets as a Map-Reduce Job using Hadoop Streaming.
A REST-based service that translates the SQL query into MapReduce and Spark jobs. It runs these jobs and provides the JSON object. SQL to MapReduce and Spark translator.
Bootcamp ministrado pela IGTI com o objetivo de abordar de forma intensiva conceitos e práticas da análise de dados, habilitando o aluno para atuar profissionalmente na área.
Mutations
Add a description, image, and links to the hadoop-streaming topic page so that developers can more easily learn about it.
To associate your repository with the hadoop-streaming topic, visit your repo's landing page and select "manage topics."