Language Identification classification using XGBoost
-
Updated
Feb 27, 2021 - Python
Language Identification classification using XGBoost
π Language identification for Scandinavian languages
Language Identification Models
π 4th year Advanced Object Oriented Programming project. A web-based service capable of identifying the language classification of a submitted body of text. The OutOfPlaceMetric is used to compare the distance i.e. the similarity, of the text and the actual language of the text. A database is built from the subject file and is split into k-mers,β¦
π 4th year Artificial Intelligence project. Using the Encog library, it uses vector hashing in conjunction with K-Fold Cross Validation to train a neural network using the WiLI Language Dataset. This neural network can then be used to predict the language of an input.
Add a description, image, and links to the wili-2018-dataset topic page so that developers can more easily learn about it.
To associate your repository with the wili-2018-dataset topic, visit your repo's landing page and select "manage topics."