The scope of this project is to extract information from media text (Dow Jones Newswires, Handelsblatt etc.), perhaps social media text and call transcripts. Information to extract concerns the following.
- Does the given article mention the cause of inflation?
- if yes, what are the causes?
Before extracting information from the data, we need to filter out non-relevant articles by keywords and tags (if they exist).
- keywords
- en: {"inflation", "prices"}
- de: {}
- tags
- DJN: {"N/DJIB", "N/DJG", "N/GPRW", "N/DJAN", "AWSJ", "WSJE", "N/PREL", "N/NRG", "N/DJBN", "N/AWP", "N/BRNS", "N/JNL", "N/WAL", "N/WLS", "N/WSJ"}
- FAZ: {}
- SZ: {}
- Handelsblatt: {}
Year | #Articles | #Articles (deduplicated) | #Articles mentioning inflation cause |
---|---|---|---|
2018 | 153600 | 152698 | 16114 |
2019 | 139939 | 139529 | 13750 |
2020 | 133093 | 132722 | 14010 |
2021 | 145854 | 144772 | 17526 |
2022 | 187663 | 186971 | 26111 |
2023 | 13762 | 13701 | 1779 |
- Binary Classification
- Zero-shot prompting
- Few-shot prompting
- tba
Resulting articles mentioning the cause of inflation are considered to be a filtered set of articles that will go to the second task.
- Multiclass Classification following schema defined by Andre et al.
- Zero-shot prompting
- Few-shot prompting
- tba
- Dataset creation
- Target data
- Methodology validation: Dow Jones Newswires (en) <-> Andre et al.
- Generalization: FAZ, SZ, Handelsblatt (de) <-> Ontology from Uli and Max survey
- Filtering
1. Regex (inflation, prices)2. Article tags (quite aggressive, but do not know the inner mechanism. DJN has it, not sure about FAZ, SZ and Handelsblatt) - Annotation
- 500 news articles for each dataset
- For DJN, we follow Andre et al., ontology to build DAG
- For German dataset, we need to wait unutil the survey is done.
- Additionally, we want the direction of inflation and factors to be annotated
- Target data
- Tasks
- Has causes?
- What causes?
- Sentiment analysis (document level) -> should be generalizable from public corpora
- prompts
- consider direction (inflation and deflation)
- compare inflation and rise of prices, deflation and drop of prices