GitHub - alvations/stasis: Semantic Textual Similarity in Python

Stasis - Python wrapper for Semantic Similarity datasets

Under the auspice of the EXPERT project (http://expert-itn.eu/), we have written a python wrapper to the STS datasets and we hope that it helps anyone with easy manipulation the datasets.

If you just need a tab-separated file, you can easily find the sts.csv available in the same repository. The repo also contains other (maybe) useful datasets that are manually compiled by the maintainer when they are free.

Disclaimer: The repository comes as it is. It should NOT be considered as the official SemEval's (Semantic Textual Similarity) STS data and it is not affiliated with the STS organizers. We've created this so that people can easily do something like pandas.read_csv('sts.csv') or graphlab.SFrame('sts.csv') and work with the dataframes with little hassle.

Datasets

Below is a list of datasets/wrappers you can find here

STS: SemEval Semantic Textual Similarity (STS2012 - 2015)
CLSS: SemEval Cross-level Semantic Similarity (CLSS)
SICK: Sentences Involving Compositional Knowledge

Contribute

Please feel free to add datasets/wrappers to the repository. Or post an issue to request for wrappers to the repository.

Cite

Please cite the respective references for the datasets when using them in your publication!

If you want to cite this repository, you can cite this paper where we created used the sts.csv in SemEval-2015

Name		Name	Last commit message	Last commit date
Latest commit History 135 Commits
CLSS-data		CLSS-data
MegaEXPERT-2016		MegaEXPERT-2016
SICK-data		SICK-data
STS-data		STS-data
notebooks		notebooks
other-data		other-data
rakusis		rakusis
sts2016-annotated		sts2016-annotated
sts2016-english-v1.1		sts2016-english-v1.1
README.md		README.md
USAAR-SHEFF_2015_modely_features.sh		USAAR-SHEFF_2015_modely_features.sh
clss-text.csv		clss-text.csv
clss_data.py		clss_data.py
csv2tsv.py		csv2tsv.py
mwa_prop.py		mwa_prop.py
sts.csv		sts.csv
sts2016-test.DLS2014.csv		sts2016-test.DLS2014.csv
sts2016_test.stasis.csv		sts2016_test.stasis.csv
sts2016_train.stasis.csv		sts2016_train.stasis.csv
sts2017.csv		sts2017.csv
sts2017_data.py		sts2017_data.py
sts_compose.py		sts_compose.py
sts_data.py		sts_data.py
sts_glove.py		sts_glove.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Stasis - Python wrapper for Semantic Similarity datasets

Datasets

Contribute

Cite

About

Releases

Packages

Languages

alvations/stasis

Folders and files

Latest commit

History

Repository files navigation

Stasis - Python wrapper for Semantic Similarity datasets

Datasets

Contribute

Cite

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages