A simple, consistent and extendable toolkit for IndicTrans2
-
Updated
Aug 27, 2024 - Python
A simple, consistent and extendable toolkit for IndicTrans2
MILU (Multi-task Indic Language Understanding Benchmark) is a comprehensive evaluation dataset designed to assess the performance of LLMs across 11 Indic languages.
Fine-tuned and compared 3 🤗 pre-trained Multilingual LLMs
This repository contains Python implementations for processing multilingual text data, focusing on language classification and translation tasks. The project addresses two distinct tasks: language classification and English translation, each involving different complexities in the processing of text data.
Setu dashboard is a all-in-one streamlit application that allows users to provide feedback on the outputs of the setu data cleaning pipeline for @AI4Bharat
Add a description, image, and links to the ai4bharat topic page so that developers can more easily learn about it.
To associate your repository with the ai4bharat topic, visit your repo's landing page and select "manage topics."