Skip to content

OneDeadKey/kalamine-corpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

6 Commits
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

kalamine-corpus

A corpora collection for the kalamine analyzer.

fr / en

Those corpora and stats come from Don Quixote

fra_mixed-typical_2012_1M-sentences

These stats come from University of Leipzig

Sources

French Mixed-Typical 2012, 1M sentences file has been extracted, and the sentence indices have been stripped with awk '!($1="")' fra_mixed-typical_2012_1M/fra_mixed-typical_2012_1M-sentences.txt > fra_mixed-typical_2012_1M-sentences.txt

Bibtex

@misc{fra_mixed_2012,
    author = {Leipzig Corpora Collection},
    title = {French mixed corpus based on material from 2012},
    howpublished = {https://corpora.uni-leipzig.de?corpusId=fra_mixed_2012},
    note = {Accessed: 2024-11-09}
}

About

A corpus collection for the kalamine analyzer.

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published