lol22_ids20_jan24_project

Introductory sensitivity analysis ML project for Intro to DS Metis course

Feb 7 Questions:

1) What is the question you hope to answer?

Perform sensitivity analysis on airline twitter tweets to determine sentiment level that will allow us to answer the question: Given a particular tweet, is it positive, negative or neutral? How accurate is the prediction? Can we further improve the accuracy by the use of different features, different models, tuning hyperparameters?

2) What data are you planning to use to answer that question?

Twitter US Airline Sentiment Analyze how travelers in February 2015 expressed their feelings on Twitter https://www.kaggle.com/crowdflower/twitter-airline-sentiment

3) What do you know about the data you're using so far?

I am familiar with Twitter and structure of tweets in general, but this is the first time I've seen the dataset. It is pretty self explanatory with main feature being the tweet ('text' in this instance) and 'airline_sentiment' (positive, negative, or neutral) the label.

4) Why did you choose this topic?

NLP is very widely used in the world today and analyzing text is something I find interesting.

Feb 14 Questions:

1) What data have you gathered, and how did you gather it?

Data was gathered from Kaggle.

2) How have you explored the data and what insights have you gained as a result?

EDA was performed using Pandas and Seaborn. A lot of insight was gathered from the results; mainly, the imbalanced data set, the main positive/negative words, and the ability to use different libraries like NLTK, Wordcloud, etc. to further analyze the tweets.

3) Will you be able to answer your question with this data, or do you need to gather more data (or adjust your question)?

Yes, the question will be fully answered.

4) What modeling approach are you using to answer your question?

The plan will be to use Logistic Regression as that performs well in supervised classification problems and provides easy to interpret results. We can easily compare this to another model such as Naive Bayes which also does well with basic NLP problems.

Name		Name	Last commit message	Last commit date
Latest commit History 22 Commits
.gitignore		.gitignore
README.md		README.md
airline_tweets.csv		airline_tweets.csv
metis_intro_ds_project.ipynb		metis_intro_ds_project.ipynb
sentiments_scatter.html		sentiments_scatter.html

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

lol22_ids20_jan24_project

About

Releases

Packages

Languages

nimaim/metis-sensitivity-analysis

Folders and files

Latest commit

History

Repository files navigation

lol22_ids20_jan24_project

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages