A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
-
Updated
Sep 19, 2024 - Python
A CoNLL-U parser that takes a CoNLL-U formatted string and turns it into a nested python dictionary.
An NLP library for Uralic languages such as Finnish, Skolt Sami, Moksha and so on. Also supporting some non-Uralic languages such as Spanish, French, Arabic, Swedish, Norwegian, Russian and English
Pipeline component for spaCy (and other spaCy-wrapped parsers such as spacy-stanza and spacy-udpipe) that adds CoNLL-U properties to a Doc and its sentences and tokens. Can also be used as a command-line tool.
A simple iterator that reads conll and conllu files (https://universaldependencies.org/format.html) without keeping them in memory. It can iterate over words, sentences, or documents.
Scripts used for the preprocessing of the EstGEC-L2 corpus that contains Estonian L2 learner texts error-annotated in the M2 format.
2019 project - french wikipedia corpus data analysis
Naïve transition-based dependency parser in Gluon
conll2praat : Interfaceur syntaxe / prosodie
CONLL-U to Pandas DataFrame
Add a description, image, and links to the conll-u topic page so that developers can more easily learn about it.
To associate your repository with the conll-u topic, visit your repo's landing page and select "manage topics."