AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations
-
Updated
Mar 29, 2023
AlphaMWE: Construction of Multilingual Parallel Corpora with MWE Annotations
Created a multilingual training corpus across 15 Indian languages (including English) by compiling different sources
Scripts that were used to creative an interactive website displaying the stats for the Indic multilingual train corpus - Boli, developed by us
Add a description, image, and links to the multilingual-corpus topic page so that developers can more easily learn about it.
To associate your repository with the multilingual-corpus topic, visit your repo's landing page and select "manage topics."