Correct spellings of misspelled Odia words.
See the dependencies in requirements.txt
.
The code has been tested with Python 3.6.
We use Fasttext for correcting spelling mistakes. Check out this to get a guide 📘 to Fasttext.
- First download Odia text data with mixed vocabulary.
mkdir data
cd data
!wget https://storage.googleapis.com/ai4bharat-public-indic-nlp-corpora/data/monolingual/indicnlp_v1/sentence/or.txt.gz
tar -zxvf or.txt.gz
head or
- Then download text with clean vocabulary from Kaggle.
Put that inside
data
directory. - Train Fasttext embeddings. See the notebook
fasttext.ipynb
. - Build two sets of vocabulary, one mixed and another clean. See the notebook
vocabulary.ipynb
. - Finally run
controller.py
to start the web app. Go to http://127.0.0.1:31137/spelling to access the web app.
# web app
python controller.py # open http://127.0.0.1:31137/spelling in browser