NLP_SemanticTextualSimilarity

SemEval 12's Semantic Textual Similarity task, specifically in paraphrase detection

Introduction

SemEval (Semantic Evaluation Exercises) are a series of workshops which have the main aim of the evaluation and comparison of semantic analysis systems. The data and corpora provided by them have become a ’de facto’ set of bench- marks for the NLP comunity.

The SemEval event provides data and evaluation frameworks for several tasks. Task 6 is Semantic Textual Similarity (STS), the purpose of this project.

The description of the event is available at:

http://ixa2.si.ehu.es/starsem/proc/pdf/STARSEM-SEMEVAL051.pdf

and the proceedings of the workshop at:

http://ixa2.si.ehu.es/starsem/proc/program.semeval.html

IHLT STS Project

Statement

This project revolves around utilizing the dataset and task description from SemEval 2012's Semantic Textual Similarity. The primary objective is to implement various approaches for paraphrase detection, focusing on sentence similarity metrics. The project entails:

Exploring lexical dimensions.
Exploring the syntactic dimension independently.
Exploring the combination of both lexical and syntactic dimensions.
Optionally adding new components.

Pre-generated word or sentence embeddings models, including BERT, are not permitted. The project concludes with a comprehensive comparison and commentary on the achieved results, both internally among the approaches and against official benchmarks.

Downloading the data

bash get_data.sh

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.vscode		.vscode
experiments		experiments
features		features
new_features		new_features
src		src
.gitignore		.gitignore
NLP_semantic_textual_similarity.ipynb		NLP_semantic_textual_similarity.ipynb
README.md		README.md
get_data.ps1		get_data.ps1
get_data.sh		get_data.sh
presentation_semantic_textual_similarity.pptx		presentation_semantic_textual_similarity.pptx
requirements_lin.txt		requirements_lin.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

NLP_SemanticTextualSimilarity

Introduction

IHLT STS Project

Statement

Downloading the data

About

Releases

Packages

Contributors 2

Languages

BecTome/NLP_SemanticTextualSimilarity

Folders and files

Latest commit

History

Repository files navigation

NLP_SemanticTextualSimilarity

Introduction

IHLT STS Project

Statement

Downloading the data

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

Packages