SICK-NL

A translation of the SICK dataset, for evaluating relatedness and entailment models for Dutch. SICK-NL was obtained by semi-automatically translating SICK (Marelli et al., 2014). Additionally, we provide two stress tests derived from our translation, that deal with semantically equivalent but syntactically different phrasings of the same sentence.

We display some of the evaluation results below. For full details please refer to our EACL 2021 paper, which we ask you to cite if you used any of our code, data, or information from the paper:

@inproceedings{wijnholds-etal-2021-sicknl,
    title = "SICK-NL: A Dataset for Dutch Natural Language Inference",
    author = "Wijnholds, Gijs and Moortgat, Michael",
    booktitle = "Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics",
    month = apr,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://www.aclweb.org/anthology/2021.eacl-main.126/",
}

Code and results

The code implements the evaluation of English and Dutch BERT/RoBERTa/Multilingual BERT models on SICK and SICK-NL and the two stress tests as Natural Language Inference tasks. As a baseline we also evaluate static embeddings on the relatedness task of SICK and SICK-NL.

For relatedness, we use the skipgram vectors of word2vec, and Dutch skipgram vectors. We use the HuggingFace Transformers library to load and train the models. For the Dutch models, we evaluated with BERTje and RobBERT.

Relatedness results (Pearson r)

	SICK		SICK-NL
Skipgram	69.49	Skipgram	56.94
BERT_cls	50.78	BERTje_cls	49.06
BERT_avg	61.36	BERTje_avg	55.55
RoBERTa_cls	46.62	RobBERT_cls	43.93
RoBERTa_avg	62.71	RobBERT_avg	52.33

NLI results (threeway classification accuracy)

	SICK		SICK-NL
BERT	87.34	BERTje	83.94
mBERT	87.02	mBERT	84.53
RoBERTa	90.11	RobBERT	82.02

Name		Name	Last commit message	Last commit date
Latest commit History 88 Commits
code		code
data		data
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SICK-NL

Code and results

Relatedness results (Pearson r)

NLI results (threeway classification accuracy)

About

Releases

Packages

Languages

License

gijswijnholds/sick_nl

Folders and files

Latest commit

History

Repository files navigation

SICK-NL

Code and results

Relatedness results (Pearson r)

NLI results (threeway classification accuracy)

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages