Skip to content

Latest commit

 

History

History
13 lines (10 loc) · 562 Bytes

README.md

File metadata and controls

13 lines (10 loc) · 562 Bytes

udacity-sparkify-etl-cassandra

Udacity Nanodegree - Project-2 Sparify Cassandra ETL

Introduction

The sparkify team is interested in modelling a cassandra database based on their analytical questions. You as a data engineer are supposed to work with the team and their underlying data to create a new cassandra database, and model it in accordance to the queries.

Goals

  1. Modelling the Database.
  2. Understanding of primary key, composite key, and clustering columns.
  3. Node paritioning based on keys.

TODO

  1. Setup a docker-compose for cassandra.