Skip to content

Supporting materials for DSVIL 2017, the Data Science & Visualization Institute hosted by NCSU Libraries. April 26, 2017

License

Notifications You must be signed in to change notification settings

libjohn/DSVIL2017

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

DSVIL2017

Supporting materials for a DSVIL presentation on April 26, 2017

Slides

You can find (read, share, etc.) my presentation slides

Time

9:00 am - 11:00 am Web Scraping: Gathering Data from Websites, Parsing HTML & JSON, Orchestrating APIs, and Gathering Twitter Streams (John Little, Duke University)

Description

Preexisting and clean data sets such as the General Social Survey (GSS) or Census data are readily available, cover long periods of time, and have well documented codebooks. Meanwhile, researchers increasingly want to gather their own data from websites which provides a different layer of complexity; accessing content from these sources requires different tools and new techniques. In this workshop we will use an open-source data wrangling tool (OpenRefine) to gather and clean data from webpages, and "crawl" whole websites, discuss and use Application Programming Interfaces (API), and give examples of how APIs are used with social media sources such as Twitter.

Data Science and Visualization Institute for Librarians

About

Supporting materials for DSVIL 2017, the Data Science & Visualization Institute hosted by NCSU Libraries. April 26, 2017

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published