A corpora collection for the kalamine analyzer.
Those corpora and stats come from Don Quixote
These stats come from University of Leipzig
French Mixed-Typical 2012, 1M sentences file has been extracted, and the
sentence indices have been stripped with awk '!($1="")' fra_mixed-typical_2012_1M/fra_mixed-typical_2012_1M-sentences.txt > fra_mixed-typical_2012_1M-sentences.txt
@misc{fra_mixed_2012,
author = {Leipzig Corpora Collection},
title = {French mixed corpus based on material from 2012},
howpublished = {https://corpora.uni-leipzig.de?corpusId=fra_mixed_2012},
note = {Accessed: 2024-11-09}
}