BNLP is a natural language processing toolkit for Bengali Language.
-
Updated
Sep 11, 2024 - Jupyter Notebook
BNLP is a natural language processing toolkit for Bengali Language.
This repository contains the official release of the model "BanglaBERT" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaBERT: Language Model Pretraining and Benchmarks for Low-Resource Language Understanding Evaluation in Bangla" accpeted in Findings of the Annual Conference of the North American Chap…
This repository contains the code and data of the paper titled "Not Low-Resource Anymore: Aligner Ensembling, Batch Filtering, and New Datasets for Bengali-English Machine Translation" published in Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP 2020), November 16 - November 20, 2020.
Deep learning Bangla resources with TensorFlow
This repository contains the official release of the model "BanglaT5" and associated downstream finetuning code and datasets introduced in the paper titled "BanglaNLG: Benchmarks and Resources for Evaluating Low-Resource Natural Language Generation in Bangla".
Bangla-Bert is a pretrained bert model for Bengali language
✍️ Bengali Alphabet (বাংলা বর্ণমালা)
Awesome datasets for Bangla language computing.
Transformer based Bangla Speech Recognition
Bangla Machine Translator
Nirmol is an open-source dataset and API for detecting Bangla slang words. Detect offensive/bad/slang words in Bangla/Bengali/Banglish sentences. A helpful API and dataset for developers and researchers.
Bangla news classification and generation
Automatic Context Sensitive Spelling Correction for Bangla Text Using Bert and Levenstein Distance
BNLTK(Bangla Natural Language Processing Toolkit): a python package for NLP in Bangla
Bangla word2vec using skipgram approach
This repository contains the code, data, and associated models of the paper titled "BanglaParaphrase: A High-Quality Bangla Paraphrase Dataset", accepted in Proceedings of the Asia-Pacific Chapter of the Association for Computational Linguistics: AACL 2022.
A collection of Bangla newspaper and blog crawlers. Can be used to mine bangla text data for Natural Language Processing tasks.
Bangla NLP toolkit.
The default auto correct dictionary added in avro Bangla keyboard doesn't contain enough word. So, this is my approach to enrich the dictionary. This file contains the correct spelling of commonly used Bangla words.
Different bangla datasets for sentiment analysis on bangla text
Add a description, image, and links to the bangla-nlp topic page so that developers can more easily learn about it.
To associate your repository with the bangla-nlp topic, visit your repo's landing page and select "manage topics."