- spark_emr_dev - Demo of submitting Hadoop ecosystem jobs to AWS EMR
- spark-etl-pipeline - Demo of various Spark ETL processes
- utility_Scala - Scala/Spark programming basic demo
# ├── README.md
# ├── athena : athena query
# ├── build.sbt : build.sbt build sbt dev env
# ├── config : config for cres access AWS, 3rd party services
# ├── data : sample data for script tes
# ├── doc : ref docs
# ├── hive : hive scripts
# ├── project : sbt project files
# ├── pyspark : pyspark code
# ├── quick_start.sh : help script run sbt/spark commands
# ├── script : help script
# ├── src : main scala spark ETL code
# ├── target : compiled java file
# └── task_step : json files define tasks at EMR