Skip to content

Sample Apache Airflow DAG to do the orchestration of etl tasks

Notifications You must be signed in to change notification settings

mshobana/stackoverflow_etl

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

3 Commits
 
 
 
 

Repository files navigation

stackoverflow_etl

Sample Apache Airflow DAG to do the orchestration of etl tasks

Problem

Domain: Stackoverflow create report where we need the usernames of stackoverflow posts with top scores

Data source * users - postgres * posts - MySQL

  • Extract the data in one place as JSON (GCS)

  • Move to a staging area (working area) (Bigquery)

  • Transform the data to a format from which users can easily generate the report

  • Load into a datasource (Bigquery)

  • Generate the report

About

Sample Apache Airflow DAG to do the orchestration of etl tasks

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages