Skip to content

KanikaGaikwad/Uber-data-engineering-project-ETL-pipeline

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

19 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Uber Data Analytics Project

106656783-1597073190121-gettyimages-1137475171-GERMANY_UBER This project analyzes Uber data using SQL on Google BigQuery to gain insights into ride-sharing patterns and optimize operations.

Project Objectives

  • Explore and understand the Uber data schema (star schema with fact and dimension tables)
  • Build an ETL pipeline using Mage.ai to load data from Google Cloud Storage to BigQuery
  • Utilize SQL queries in BigQuery to generate reports and answer business questions
  • Create an analytics table for building interactive dashboards in Looker

Technical Stack

Project Setup

Getting Started

  • This project requires familiarity with SQL, BigQuery, and potentially Looker.
  • The Jupyter notebooks contain the data transformation code.
  • The Mage.ai pipeline automates the data loading process to BigQuery.
  • SQL queries written for BigQuery are available within the project directory (or specific location).

Further Exploration

  • Explore specific business questions related to Uber's ride-sharing data.
  • Utilize advanced SQL functions for deeper data analysis.
  • Develop interactive dashboards in Looker to showcase insights

Star Schema Diagram

uber_table

Looker Dashboard

Uber Looker Dashboard

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published