In order to be able to repeat the clustering on the same dataset, it can make sense to first gather a set of tweets for processing.
-
Get Twitter API key
-
Create Twitter Account
-
Create Twitter Application (https://apps.twitter.com/)
-
Create consumer key and secret
-
Save keys in
config.txt
separated by line break in the following order:-
consumer_key
-
consumer_secret
-
access_token
-
access_token_secret
-
-
-
Run script:
python parse_data.py
two new files are created containing the gathered tweets
-
twitter.dat: tweets in JSON-format
-
twitter.db: same tweets in sqlite database
-