POPQUORN: the Potato-Prolific dataset for Question- Answering, Offensiveness, text Rewriting and politeness rating with demographic Nuance
POPQUORN is a collaboration effort of potato and prolific to create an open-source dataset with both individual annotations and annotator's backgrounds. POPQUORN contains 45,000 annotations from 1484 annotators drawing from a US-population representative sample (for age, sex, and race). This GitHub repo contains raw annotations as well as detailed demographic information. POPQUORN is released with the following paper:
Jiaxin Pei and David Jurgens. When Do Annotator Demographics Matter? Measuring the Influence of Annotator Demographics with the POPQUORN Dataset. The 17th Linguistic Annotation Workshop (LAW-XVII) @ACL 2023
All the annotation interfaces are available in potato project-hub
Install potato-annotation
pip install potato-annotation
[fetch] potato get offensiveness
[launch] potato start offensiveness -p 8000
[Annotate] http://localhost:8000/?PROLIFIC_PID=user
[fetch] potato get reading_comprehension
[launch] potato start reading_comprehension -p 8000
[Annotate] http://localhost:8000/?PROLIFIC_PID=user
[fetch] potato get text_rewriting
[launch] potato start text_rewriting -p 8000
[Annotate] http://localhost:8000/?PROLIFIC_PID=user
[fetch] potato get politeness_rating
[launch] potato start politeness_rating -p 8000
[Annotate] http://localhost:8000/?PROLIFIC_PID=user