Skip to content

prakass1/udacity-sparkify-etl-cassandra

Repository files navigation

udacity-sparkify-etl-cassandra

Udacity Nanodegree - Project-2 Sparify Cassandra ETL

Introduction

The sparkify team is interested in modelling a cassandra database based on their analytical questions. You as a data engineer are supposed to work with the team and their underlying data to create a new cassandra database, and model it in accordance to the queries.

Goals

  1. Modelling the Database.
  2. Understanding of primary key, composite key, and clustering columns.
  3. Node paritioning based on keys.

TODO

  1. Setup a docker-compose for cassandra.

About

Udacity Nanodegree - Project-2 Sparify Cassandra ETL

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published