Skip to content

Latest commit

 

History

History
26 lines (17 loc) · 1.54 KB

README.md

File metadata and controls

26 lines (17 loc) · 1.54 KB

Lupus Big Data Project

The purpose of this project is to analyze data from Twitter using Big Data tools to see what Lupus patients, and others affected by Lupus, are saying. This information can be used to help direct Lupus research.

Gathering Data

Twitter has a an array of APIs available to us developers, but we mostly use the search APIs.

To run our code and use these APIs to gather data, you'll need to get your own API access tokens from the Twitter developer's page. (We keep our private access tokens off this public repo).

Analyzing Data

The two languages we use mostly are R and Python 3.x as they're friendly with data analysis, and easy languages to get started in for a project like this.

To make gathering data easier, we use packages such as rtweet and Twython.

Findings & Notebooks

Here are notebooks with our findings: