Skip to content

Obtaining aggregated demographic features (age, race, sex) from 2010 US Census Summary Files and import into MongoDB

Notifications You must be signed in to change notification settings

Tporkka/Transform-and-Load-US-Census-Data-to-MongoDB

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 

Repository files navigation

ETL Script for loading aggregated demographic data from US Census Summary File.

This script extracts and aggregates demographic metrics from the 2010 US Census Summary File 1. My goal was to create JSON objects to display total populations grouped by age, gender, and race for each area of interest. Once I obtained the following form JSON, I exported the JSON objects to MongoDB.

Sample JSON object:

    {  
    "_id" : "asdfasdfa102838",  
    "LOGRECNO" : "000023445",  
    "FIPS" : "55025",  
    "name" : "Dane County",  
    "age" : "15-17",  
    "sex" : "Male",  
    "race" : "Asian",  
    "count" : 26541,  
    }  
];  

The script is flexible and will obtain data for any state, county, subcounty/place with some minor adjustments. For sake of simplicity and processing time I decided to only obtain county level data within Wisconsin. From there I exported all the demographic data for a single county (Dane County).

About

Obtaining aggregated demographic features (age, race, sex) from 2010 US Census Summary Files and import into MongoDB

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages