kalamine-corpus

A corpora collection for the kalamine analyzer.

`fr` / `en`

Those corpora and stats come from Don Quixote

`fra_mixed-typical_2012_1M-sentences`

These stats come from University of Leipzig

Sources

French Mixed-Typical 2012, 1M sentences file has been extracted, and the sentence indices have been stripped with awk '!($1="")' fra_mixed-typical_2012_1M/fra_mixed-typical_2012_1M-sentences.txt > fra_mixed-typical_2012_1M-sentences.txt

Bibtex

@misc{fra_mixed_2012,
    author = {Leipzig Corpora Collection},
    title = {French mixed corpus based on material from 2012},
    howpublished = {https://corpora.uni-leipzig.de?corpusId=fra_mixed_2012},
    note = {Accessed: 2024-11-09}
}

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
bin		bin
corpus		corpus
.gitignore		.gitignore
LICENSE		LICENSE
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

kalamine-corpus

`fr` / `en`

`fra_mixed-typical_2012_1M-sentences`

Sources

Bibtex

About

Releases

Packages

Contributors 2

Languages

License

OneDeadKey/kalamine-corpus

Folders and files

Latest commit

History

Repository files navigation

kalamine-corpus

fr / en

fra_mixed-typical_2012_1M-sentences

Sources

Bibtex

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Languages

`fr` / `en`

`fra_mixed-typical_2012_1M-sentences`

Packages