emr-cluster

Player Unknown's Battlegrounds (PUBG), is a first person shooter game where the goal is to be the last player standing. You are placed on a giant circular map that shrinks as the game goes on, and you must find weapons, armor, and other supplies in order to kill other players / teams and survive.

spark hive aws-lambda api-gateway bigdata s3-bucket tableau aws-cloudformation emr-cluster

Updated Jan 27, 2021
Python

a-Imantha / Mahout-Tutorial

Star

Building a Recommender with Apache Mahout on Amazon Elastic MapReduce (EMR) Tutorial

emr s3-bucket mahout hdfs awscli emr-cluster

Updated Mar 20, 2021
Python

donjude / data-lakes-with-spark

Star

This project is about building a data lake and creating an ETL pipeline in Spark that loads data from Amazon S3, processes the data into analytics tables, and loads them back into S3

python spark apache-spark hadoop ec2 s3 aws-cli hdfs mapreduce amazon-web-services datalake aws-athena spark-sql emr-cluster etl-pipeline

Updated Jun 15, 2021
Python

anthonywong611 / Batch-ETL-with-AWS-EMR-and-MWAA

Star

Create a data pipeline on AWS to execute batch processing in a Spark cluster provisioned by Amazon EMR. ETL using managed airflow: extracts data from S3, transform data using spark, load transformed data back to S3.

airflow s3-bucket aws-cloudformation batch-processing emr-cluster

Updated Jul 12, 2021
Python

praveen-gopal-reddy / ETL-Spark-EMR-AWS-MusicData

Star

To implement a data lake using S3 and Spark on an EMR cluster using AWS Cloud9 environment and develop an ETL pipeline for a Data Lake that extracts data from S3, processes the data using Spark, and loads the data back into S3 as a set of dimensional tables.

python bootstrap spark pyspark cloud9 s3-storage datalake emr-cluster

Updated Jul 30, 2021
Python

demousersrccode / emr-on-airflow-toolkit

Star

A template for creating Amazon EMR clusters using either Amazon MWAA or a Dockerized Airflow Container as a workflow environment

aws airflow emr-cluster mwaa

Updated Aug 10, 2021
Python

Tanay0510 / Data-Lake-with-Spark

Star

Load data from S3, process the data into analytics tables using Spark and load them back into S3. Deployed this Spark process on a cluster using AWS EMR

spark s3 datalake emr-cluster etl-pipeline

Updated Aug 17, 2021
Python

mikeacosta / florasense

Star

Orchestrating Cloud ETL Workloads

aws cloudformation apache-spark lambda-functions data-warehouse data-lake kinesis-stream redshift step-functions emr-cluster etl-pipeline redshift-spectrum

Updated Sep 19, 2021
Python

Improve this page

Add a description, image, and links to the emr-cluster topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the emr-cluster topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

emr-cluster

Here are 37 public repositories matching this topic...

sepulworld / serverless-aws-emr-boilerplate

carlossanchezvega / twitter

darkhipo / emr-example

alikemalocalan / alibaba-cloud-emr-create-examples

deepakag5 / Cloud-Computing-AWS

san089 / goodreads_etl_pipeline

JohnnyLVP / Project-Standar-Documentation

AmandaJunqueira / BigData

jpsalado92 / Udacity-DEND_DataLake-AWSEMR

ucaiado / etl-spark-aws

ucaiado / etl-intraday-bidask

JevyanJ / emr-helper

nileshsingal / PUBG-DATA-ANALYSIS

a-Imantha / Mahout-Tutorial

donjude / data-lakes-with-spark

anthonywong611 / Batch-ETL-with-AWS-EMR-and-MWAA

praveen-gopal-reddy / ETL-Spark-EMR-AWS-MusicData

demousersrccode / emr-on-airflow-toolkit

Tanay0510 / Data-Lake-with-Spark

mikeacosta / florasense

Improve this page

Add this topic to your repo