Publications
DIWALI-Diversity and Inclusivity aWare cuLture specific Items for India: Dataset and Assessment of LLMs for Cultural Text Adaptation in Indian Context
Pramit Sahoo*, Maharaj Brahma*, Maunendra Sankar Desarkar.
EMNLP 2025.
Oral Presentation
MorphTok: Morphologically grounded tokenization for Indic languages
Maharaj Brahma, NJ Karthika, Atul Singh, Devaraj Adiga, Smruti Bhate, Ganesh Ramakrishnan, Rohit Saluja, Maunendra Sankar Desarkar.
TokShop, ICML 2025.
NLIP-Lab-IITH Multilingual MT System for WAT24 MT Shared Task
Maharaj Brahma, Pramit Sahoo, Maunendra Sankar Desarkar.
WAT24 MT Shared Task 2024.
🏆 Best System Submission
NLIP_Lab-IITH Low-Resource MT System for WMT24 Indic MT Shared Task
Pramit Sahoo, Maharaj Brahma, Maunendra Sankar Desarkar.
WAT24 MT Shared Task 2024.
SelectNoise: Unsupervised Noise Injection to Enable Zero-Shot Machine Translation for Extremely Low-resource Languages
Maharaj Brahma*, Kaushal Kumar Maurya*, Maunendra Sankar Desarkar.
EMNLP Findings 2023.
🏆 Best Poster (2nd) at IndoML 24