The literature provides different expectations about the relationship between the emphasis of an issue through established parties, the vote share of the party 'owning' this issue (i.e. the party being associated with it), and the importance of an issue among the general public.
This project sets out to provide a new measure of parties' emphasis of the immigration issue. It does so by applying supervised machine learning techniques to press releases of German mainstream parties. Additionally, this data will be aggregated monthly and merged with data on public salience (from Google Trends), and polling performance of the German radical right party AfD (from PollOfPolls). Lastly, the project will analyse these data using time-series regression, in order to disentangle the relationship between mainstream parties' issue emphasis of immigration, the issue's salience, and the AfD's vote share in the polls.
- Mainstream party communication on immigration is not affected by the vote share of the AfD in recent polls.
- The AfD vote share in current polls is not affected by mainstream parties' communication on immigration.
- The AfD vote share in current polls is unaffected by public salience of immigration.
- Public salience of immigration increases in response to mainstream parties' press releases on the topic.
- Public salience of immigration decreases the share of mainstream parties' press releases related to immigration.
- A short theoretical introduction to the project that formulates some hypotheses to be tested and describes the output to be produced.
- The code used to scrape the links;
- The code used to scrape the actual press releases.
- The code pre-processing the text.
- A notebook showing how the training and test set for the classifier were coded.
- A notebook used to train the classifier.
- A notebook preparing the variables for time-series estimation.
- The notebook analyzing the data to test the initially formulated hypotheses using time series estimation (VAR).
- A set of slides showcasing the preliminary results presented at the Hot Politics Lab (University of Amsterdam).
Additionally, the used and collected data are provided here:
- A dataset of AfD-polls from PollOfPolls.
- A dataset of monthly Google searches relating to immigration, taken from Google Trends.
- A dataset of the speeches including the coding
- A merged dataset of the aggregated monthly share of mainstream parties' press releases, merged with data on the Google searches about immigration, as well as the the monthly polling of the AfD vote share, based on poll of polls.
- A folder with preliminary data used to construct the coded and aggregated datasets.
1: This project started out as the final project for Damian Trilling's Big Data and Automated Content Analysis course.