Quantifying Stereotypes in Language

Code for paper Quantifying Stereotypes in Language (accepted by EACL 2024)

Dataset Details

train	test	val
2,260	358	358

Training Models

You can run the following script to train the models:

python train.py  \
--mode train \
--pre_trained_model_name_or_path [bert-base-uncased, distilbert-base-uncased, roberta-base] \
--train_path data/train.csv \
--val_path data/val.csv \
--test_path data/test.csv \
--lr 0.00001 \
--max_len 50 \
--max_epochs 30 \
--batch_size 128 \
--model_saving_path models

The weights are saved to the models folder.

Predict Stereotype Scores

You can run the following script to predict stereotype scores:

python train.py \
--mode=predict \
--pre_trained_model_name_or_path=models/bert-base-uncased \
--predict_data_path=data/predict/cp_sentence.csv \
--test_saving_path=results/bert_cp_sentence_results.csv

The prediction results are saved to the results folder.

The Trained Weights of the Models

You can download the pre-trained weights for use directly from huggingface:

For the BERT model

# Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("lauyon/quantifying-stereotype-bert")
model = AutoModelForSequenceClassification.from_pretrained("lauyon/quantifying-stereotype-bert")

For the DistilBERT model

# Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("lauyon/quantifying-stereotype-distilbert")
model = AutoModelForSequenceClassification.from_pretrained("lauyon/quantifying-stereotype-distilbert")

For the RoBERTa model

# Load model directly
from transformers import AutoTokenizer, AutoModelForSequenceClassification

tokenizer = AutoTokenizer.from_pretrained("lauyon/quantifying-stereotype-roberta")
model = AutoModelForSequenceClassification.from_pretrained("lauyon/quantifying-stereotype-roberta")

If this work has helped you in any way, please cite it by the following:

@inproceedings{liu-2024-quantifying,
    title = "Quantifying Stereotypes in Language",
    author = "Liu, Yang",
    editor = "Graham, Yvette  and
      Purver, Matthew",
    booktitle = "Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics (Volume 1: Long Papers)",
    month = mar,
    year = "2024",
    address = "St. Julian{'}s, Malta",
    publisher = "Association for Computational Linguistics",
    url = "https://aclanthology.org/2024.eacl-long.74",
    pages = "1223--1240",
    abstract = "A stereotype is a generalized perception of a specific group of humans. It is often potentially encoded in human language, which is more common in texts on social issues. Previous works simply define a sentence as stereotypical and anti-stereotypical. However, the stereotype of a sentence may require fine-grained quantification. In this paper, to fill this gap, we quantify stereotypes in language by annotating a dataset. We use the pre-trained language models (PLMs) to learn this dataset to predict stereotypes of sentences. Then, we discuss stereotypes about common social issues such as hate speech, sexism, sentiments, and disadvantaged and advantaged groups. We demonstrate the connections and differences between stereotypes and common social issues, and all four studies validate the general findings of the current studies. In addition, our work suggests that fine-grained stereotype scores are a highly relevant and competitive dimension for research on social issues. The models and datasets used in this paper are available at https://anonymous.4open.science/r/quantifying{\_}stereotypes{\_}in{\_}language.",
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
data		data
README.md		README.md
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Quantifying Stereotypes in Language

Dataset Details

Training Models

Predict Stereotype Scores

The Trained Weights of the Models

About

Releases

Packages

Languages

nlply/quantifying-stereotypes-in-language

Folders and files

Latest commit

History

Repository files navigation

Quantifying Stereotypes in Language

Dataset Details

Training Models

Predict Stereotype Scores

The Trained Weights of the Models

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages