
Raj Dabre
machine translation
multilingual
benchmark
natural language generation
indian languages
in-context learning
low-resource languages
large language models
computational social science
transfer learning
summarization
neural machine translation
robustness
pretraining
pragmatics
13
presentations
3
number of views
1
citations
SHORT BIO
Raj Dabre received his M.Tech. from IIT Bombay, India and his Ph.D. from Kyoto University, Japan. He is a researcher at NICT, Japan and a visiting researcher at AI4Bharat. His research interests center on natural language processing, particularly neural machine translation for low resource languages, and on model compression and computing efficiency. He has MT and NLG related publications in ACL, EMNLP, AAAI, NAACL, COLING, INTERSPEECH and WMT. He is a current member of the organizing committee of the Workshop on Asian Translation. He has previously conducted tutorials on neural machine translation and multilingual machine translation at IJCNLP 2017 and COLING 2020, respectively.
Presentations

Do Not Worry if You Do Not Have Data: Building Pretrained Language Models Using Translationese
Meet Doshi and 2 other authors

A Morphology-Based Investigation of Positional Encodings
Poulami Ghosh and 3 other authors

How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?
Anushka Singh and 5 other authors

An Empirical Study of In-context Learning in LLMs for Machine Translation
Pranjal Chitale and 2 other authors

PUB: A Pragmatics Understanding Benchmark for Assessing LLMs' Pragmatics Capabilities
settaluri sravanthi and 5 other authors

Kreyòl-MT: Building MT for Latin American, Caribbean and Colonial African Creole Languages
Nathaniel Romney Robinson and 15 other authors

CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation
Aswanth Kumar M and 3 other authors

DecoMT: Decomposed Prompting for Machine Translation Between Related Languages using Large Language Models
Ratish Puduppully and 4 other authors

YANMTT: Yet Another Neural Machine Translation Toolkit
Raj Dabre and 3 other authors

IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages
Raj Dabre and 8 other authors

KreolMorisienMT: A Dataset for Mauritian Creole Machine Translation
Raj Dabre and 1 other author

IndicBART: A Pre-trained Model for Indic Natural Language Generation
Raj Dabre and 5 other authors

Harnessing Cross-lingual Features to Improve Cognate Detection for Low-resource Languages
Diptesh Kanojia and 5 other authors