The US pipelines incidents

From 2010 to October 2017

A Jupyter notebook to analyze trends in pipeline incidents in the US from 2010 to October 2017. Different aspects of the incidents are considered to answer these five questions:

How common are spills?
What is their spatial and temporal distributions?
What is their scale regarding volume and cost?
What are the main causes of spills?
What places have a higher risk?

How it works?

At this moment, all the code is in pipeline.ipynb file. It mainly performs the following actions:

Checks the latest update in the dataset. If the date is more than number_of_days variable, it downloads the latest dataset from the server and replaces the old dataset locally.
Extracts the required columns from the dataset, cleans the values (both text and NaN) and converts the units to more useful ones. Finally it saves the cleaned dataset locally. It also exports a json file containing the summary of data. This file will be used to create a website which shows the summary using D3 library.
Plots multiple figures showing temporal and spatial trends in spills and their financial and environmental damage.
Some figures are exported to be used in the README file and the final report.

Required libraries

numpy
pandas
matplotlib
plotly

Dataset

The dataset contains 'Flagged Incidents' from PHMSA Pipeline Safety website.

Authors

Mahdi Sadjadi - http://mahdisadjadi.com/

Reference

This repository is also published as a blog post.

License

This project is licensed under the MIT License - see the LICENSE.md file for details. The dataset is downloaded from PHMSA.

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
hl2010toPresent.csv		hl2010toPresent.csv
hl2010toPresent_cleaned.csv		hl2010toPresent_cleaned.csv
incident_ditribution.png		incident_ditribution.png
incident_states.png		incident_states.png
pipeline.ipynb		pipeline.ipynb

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

The US pipelines incidents

From 2010 to October 2017

How it works?

Required libraries

Dataset

Authors

Reference

License

About

Releases

Packages

Languages

License

Mahdisadjadi/pipeline-incidents

Folders and files

Latest commit

History

Repository files navigation

The US pipelines incidents

From 2010 to October 2017

How it works?

Required libraries

Dataset

Authors

Reference

License

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages