Everything you need to run the notebook comes bundled in the Anaconda python installation
If you don't want to install from anaconda, you will need
- Python:
- Jupyter notebooks: https://jupyter.org/install
- Numpy Python's scientific computing package. http://www.numpy.org/
- Pandas A very useful data analysis and manipulation package http://pandas.pydata.org/
- Scikit-learn A comprehensive machine learning library http://scikit-learn.org/stable/
- Matplotlib Package for creating charts and other visualizations https://matplotlib.org/
The data exploration part of this lesson is deried from a tutorial on Algorithmic Fairness presented by Flavio Calmon and Caitlin Kuhlman at the Broadening Participation in Data Mining Workshop (BPDM) @KDD 2017 in Halifax, Nova Scotia, and by Caitlin Kuhlman at BPDM @ Howard University in 2019.