- Sign-up for a Twitter developer account on this link
- Create a Bearer Token (documentation)
- Fill in the field
BEARER_TOKEN
in the.env
file - Create a Reddit developed application on this link (documentation)
- Fill in the fields
CLIENT_ID
,SECRET_TOKEN
,USERNAME
andPASSWORD
in the.env
file - Install and run Kafka (documentation)
- Create a MongoDB database in the cloud (free) or install the server (documentation)
- Fill in the fields
CONNECTION_STRING
in the.env
file
docker-compose -f docker-compose.yml up
- Clone the repository
git clone https://github.com/baptiste-pasquier/trend-tracker
- Install the project
- With
poetry
(installation) :
poetry install
- With
pip
:
pip install -e .
- Install pre-commit
pre-commit install
Warning Each script must be run in a separate console
- Twitter streaming:
python all_services/ingest_tweets/app.py
- Reddit streaming:
python all_services/ingest_reddit/app.py
- Data preprocessing:
python all_services/tsf_data/app.py
- Data clustering:
python all_services/cluster_data/app.py
- Data storage on MongoDB:
python all_services/store_data/app.py
Real-time visualization:
streamlit run streamlit_app.py