Skip to content

ajithalbus/TamilCorpus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

9 Commits
 
 
 
 
 
 
 
 

Repository files navigation

TamilCorpus

Open Source Tamil Corpus of 58M words

Source : Wikipedia,TheHindu(Tamil) 

Usage

Run extract.sh to extract the compressed files.

P.S : A little cleansing might be needed.

About

Open Source Tamil Corpus of 58M words

Topics

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages