indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
-
Updated
Jan 2, 2024 - Jupyter Notebook
indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.
Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages
Non-contextual : Word2Vec, FastText Contextual : BERT, RoBERTa, ELECTRA, CamemBERT, Distil-BERT, XLM-RoBERTa Analyzed embedding models, used the best one to build a Flask web app for Hindi NER and data collection from user feedback, deployed on AWS.
Contextualized Topic Modeling using Zero-Shot Learning on Indic Languages (IndicCTM)
KPT: Kannada Pre-trained Transformer
We have done cleaning on the Hindi dataset and removed the characters which are not required in it
Add a description, image, and links to the indic-nlp topic page so that developers can more easily learn about it.
To associate your repository with the indic-nlp topic, visit your repo's landing page and select "manage topics."