NYC DCA ETL

This was a project done in Azure Data Factory.

I began by extracting data from New York City Open Data: https://opendata.cityofnewyork.us/

From there, I created a Blob Container within an existing storage account. Then I initialized Azure Data Factory to do a series of T-SQL transformations on CSV files. I ultimately wanted to load data into a parquet file. The dataflow looks like this:

The final, loaded result of the ETL process resulted in the creation of a parquet file hosted within a generated blob in the container:

I then went back into my container to look for any issues to trouble shoot. There were no issues to resolve, so I monitored activity in my container in a private dashboard:

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
factorydata		factorydata
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NYC DCA ETL

About

Releases

Packages

Tyriek-cloud/NYC-DCA-ETL

Folders and files

Latest commit

History

Repository files navigation

NYC DCA ETL

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Packages