Mitesh M. Khapra, DSAI, IITM

My research focuses on bringing parity in AI technologies for Indian languages with respect to English with open-source contributions in tools, datasets, neural models, and reference applications. My specific areas of interest are pretrained multilingual models for natural language and speech, neural machine translation, efficient models for automatic speech recognition, and evaluation metrics for natural language generation. Please visit the Nilekani Centre at AI4Bharat to know more about my work.

Mitesh M. Khapra is an Associate Professor in the Department of Data Science and AI (DSAI), Wadhwani School of Data Science and AI at IIT Madras. He heads the AI4Bharat Research Lab at IIT Madras which focuses on building datasets, tools, models and applications for Indian languages. His research work has been published in several top conferences and journals including TACL, ACL, NeurIPS, TALLIP, EMNLP, EACL, AAAI, etc. He has also served as Area Chair or Senior PC member in top conferences such as ICLR and AAAI. Prior to IIT Madras, he was a Researcher at IBM Research India for four and a half years, where he worked on several interesting problems in the areas of Statistical Machine Translation, Cross Language Learning, Multimodal Learning, Argument Mining and Deep Learning. Prior to IBM, he completed his PhD and M.Tech from IIT Bombay in Jan 2012 and July 2008 respectively. His PhD thesis dealt with the important problem of reusing resources for multilingual computation. During his PhD he was a recipient of the IBM PhD Fellowship (2011) and the Microsoft Rising Star Award (2011). He is also a recipient of the Google Faculty Research Award (2018), the IITM Young Faculty Recognition Award (2019), the Prof. B. Yegnanarayana Award for Excellence in Research and Teaching (2020) and the Srimathi Marti Annapurna Gurunath Award for Excellence in Teaching (2022). Srimathi Marti Annapurna Gurunath Award for Excellence in Teaching, 2022
Nasscom AI Game Changer Award, 2021 (Academic Category: Accepted on behalf of Team Samanantar)
Prof. B. Yegnanarayana Award for Excellence in Research and Teaching, 2020
IITM Young Faculty Recognition Award, 2019
Google Faculty Research Award, 2018
Microsoft Rising Star Award, 2011
IBM PhD Fellowship, 2011-2012

Students

PhD: Preksha Nema (Google PhD Fellowship (2017) , with Prof. Ravindran), Ananya B. Sai ( Google PhD Fellowship, 2019), Tahir Javed (Google PhD Fellowship, 2022), Sushane Parthan, Sumanth Doddapaneni (Google PhD Fellowship, 2023), Kaushal Bhogale, Praveen S V (Google PhD Fellowship, 2024), Mohammed Safi Ur Rahman Khan, Oikantik Nath, Sakshi Joshi
MS: Siddharth Arora (2015), Shashank Shrivastava (with Prof. Sutanu 2016), Shreyas Shetty (2018), Nikita Moghe (with Prof. Ravindran 2018), Suman Banerjee (Best MS Thesis Award, INAE Innovative Students Project Award 2020 - Masters Level 2019), Shweta Bhardwaj (2019), Pritha Ganguly (2019), Nitesh Methani (2020), Madhura Pande (with Prof. Pratyush Kumar 2021), Raghavan A K (2023), Yash Madhani (2023), Pranjal Agadh Chitale (2024), Nandini Mundra (2024), Anushka Singh, Srija Anand, Ashwin Sankar, Abhishek Ranjan, Thanmay Jayakumar
M.Tech: Ananya b. Sai (2018), Shubham Patel (Best M.Tech Project Award 2019), Jaya Ingle (Best M.Tech Project Award 2019), Amar Vashishth (2019), Siddharth AP (2019), B Krishnanjali (with Prof. Pratyush Kumar 2020), Bethu Sai Sampath (with Prof. Pratyush Kumar 2020), Nitin John Titus (with Prof. Pratyush Kumar 2020), Manideep Ladi (2022), Aman Kumar (2022), Himani Shrotiya (2022), Prachi Sahu (2022), Priyanka Bedekar (2022), Rigved Sah (2022), Shubham Randive (2022), Sumit Negi (2022), Sushane Parthan (2022), Mohammed Safi Ur Rahman Khan (2023), Siddesh Hegde (2023), Keyur Raval (2023), Sanjanaa GV (2023), Aswanth Kumar (2023), Varun Gumma (2023), Bibhuti Majhi (2024), Sarthak Naithani (2024), Sai Sree Ram Putta (2024), Ravi Prakash Singh (2024), Raj Mahajan (2024), Sathish Kumar Reddy (2024), Tanmay Pramod Garde (2025), Amar Kumar Sharma (2025), Shubhodeep Chanda (2025)
DD: Sanchit Agrawal (Best DDP Award 2018), Gurneet Singh (2018), Ishu Garg (2018), Soham Parikh (2018), Sowjanya Vemuri (2019), Ajaykrishnan Jayagopal (2019), Satishkumar Golla (2019), Umang Sinha (2020), Aashay Doshi (2020), Sandesh Katta (2020), Siddharth Devulapalli (2020), Sahana Ramnath (2020), Monisha Jegadeesan (2020), M Akash Kumar (2021), Dev Sheth (2022), Aghin Shah (2022), Nayan N (2022), Devakrishna Asokar (2022)
B.Tech: Susanna Maria Baby (2017), Revanth Reddy (2018), Aarvith Muthu (2019), Emil Biju (2021), Abishek S (2022)
(the number in the bracket is the year of graduation)

[J12] Praveen S V, Amogh Gulati, Ashwin Sankar, Srija Anand, Anirudh Gupta, Anirudh Mukherjee, Shiva Kumar Marepally, Ankur Bhatia, Saloni Jaju, Suvrat Bhooshan, Mitesh M. Khapra. Rethinking MUSHRA: Addressing Modern Challenges in Text-to-Speech Evaluation. Transactions on Machine Learning Research (TMLR), 2025.
[J11] Sumanth Doddapaneni, Gowtham Ramesh, Mitesh M. Khapra, Anoop Kunchukuttan, Pratyush Kumar. A Primer on Pretrained Multilingual Language Models. ACM Computing Surveys (ACM CSUR), 2025.
[J10] Jay Gala, Pranjal A Chitale, A K Raghavan, Varun Gumma, Sumanth Doddapaneni, Aswanth Kumar M, Janki Atul Nawale, Anupama Sujatha, Ratish Puduppully, Vivek Raghavan, Pratyush Kumar, Mitesh M Khapra, Raj Dabre, Anoop Kunchukuttan. IndicTrans2: Towards High-Quality and Accessible Machine Translation Models for all 22 Scheduled Indian Languages. Transactions on Machine Learning Research ( TMLR ), 2022.
[J09] Aravinth Bheemaraj, Mayank Jobanputra, Raghavan AK, Ajitesh Sharma, Sujit Sahoo, Harshita Diddee, Mahalakshmi J, Divyanshu Kakwani, Navneet Kumar, Aswin Pradeep, Srihari Nagaraj, Kumar Deepak, Vivek Raghavan, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Shantadevi Khapra. Samanantar: The Largest Publicly Available Parallel Corpora Collection for 11 Indic Languages. Transactions of the Association for Computational Linguistics,( TACL ), 2022.
[J08] Ananya B. Sai, M Akash Kumar, Mitesh M. Khapra. A Survey of Evaluation Metrics Used for NLG Systems. ACM Computing Surveys (ACM CSUR), 2021.
[J07] Ananya B. Sai, M Akash Kumar, Siddharatha Arora, Mitesh M. Khapra. Improving Dialog Evaluation with a Multi-reference Adversarial Dataset and Large Scale Pretraining. Transactions of the Association for Computational Linguistics (TACL), 2020.
[J06] Suman Banerjee, Mitesh M. Khapra. Graph Convolutional Network with Sequential Attention for Goal-oriented Dialogue Systems. Transactions of the Association for Computational Linguistics (TACL), 2019.
[J05] Rudra Murthy, Mitesh M. Khapra, Dr. Pushpak Bhattacharyya. Improving NER Tagging Performance in Low-Resource Languages via Multilingual Learning. The ACM Transactions on Asian and Low-Resource Language Information Processing (TALLIP), 2018.
[J04] Anoop Kunchukuttan, Mitesh Khapra, Gurneet Singh, Pushpak Bhattacharyya. Leveraging Orthographic Similarity for Multilingual Neural Transliteration. Transactions of the Association for Computational Linguistics (TACL), 2018.
[J03] Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran. Studying the Plasticity in Deep Convolutional Neural Networks using Random Pruning. To appear in the Journal of Machine Vision and Applications (MVA). Springer.
[J02] Sarath Chandar, Mitesh M. Khapra, Hugo Larochelle, Balaraman Ravindran, Correlational Neural Networks, Neural Computation, February 2016.
[J01] A Kumaran, Mitesh M. Khapra and Pushpak Bhattacharyya, Compositional Machine Transliteration, accepted for publication in Transactions on Asian Language Information Processing ( TALIP ), December 2010.

[C78] Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Dilip Venkatesh, Raj Dabre, Anoop Kunchukuttan, Mitesh M. Khapra: Cross-Lingual Auto Evaluation for Assessing Multilingual LLMs. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria, July 2025.
[C77] Kaushal Santosh Bhogale, Deovrat Mehendale, Tahir Javed, Devbrat Anuragi, Sakshi Joshi, Sai Sundaresan, Aparna Ananthanarayanan, Sharmistha Dey, Sathish Kumar Reddy G, Anusha Srinivasan, Abhigyan Raman, Pratyush Kumar, Mitesh M. Khapra: Towards Bringing Parity in Pretraining Datasets for Low-resource Indian Languages. In Proceedings of IEEE International Conference on Acoustics, Speech, and Signal Processing (ICASSP 2025), Hyderabad, India, April 2025.
[C76] Oikantik Nath, Hanani Bathina, Mohammed Safi Ur Rahman Khan, Mitesh M. Khapra: Can Vision-Language Models Evaluate Handwritten Math?. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria, July 2025.
[C75] Janki Atul Nawale, Mohammed Safi Ur Rahman Khan, Janani D, Mansi Gupta, Danish Pruthi, Mitesh M. Khapra: FairI Tales: Evaluation of Fairness in Indian Contexts with a Focus on Bias and Stereotypes. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria, July 2025.
[C74] Ashwin Sankar, Sparsh Jain, Nikhil Narasimhan, Devilal Choudhary, Dhairya Suman, Mohammed Safi Ur Rahman Khan, Anoop Kunchukuttan, Mitesh M Khapra, Raj Dabre: Towards Building Large Scale Datasets and State-of-the-Art Automatic Speech Translation Systems for 14 Indian Languages. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), Vienna, Austria, July 2025.
[Best Theme Paper Award] [C73] Sakshi Joshi, Eldho Ittan George, Tahir Javed, Kaushal Bhogale, Nikhil Narasimhan, Mitesh M. Khapra: Recognizing Every Voice: Towards Inclusive ASR for Rural Bhojpuri Women. In Proceedings of (INTERSPEECH 2025), Rotterdam, The Netherlands, September 2025.
[C72] Ashwin Sankar, Yoach Lacombe, Sherry Thomas, Praveen Srinivasa Varadhan, Sanchit Gandhi, Mitesh M Khapra: Rasmalai: Resources for Adaptive Speech Modeling in Indian Languages with Accents and Intonations. In Proceedings of (INTERSPEECH 2025), Rotterdam, The Netherlands, September 2025.
[C71] Tahir Javed, Kaushal Bhogale, Mitesh M. Khapra: NIRANTAR: Continual Learning with New Languages and Domains on Real-world Speech Data. In Proceedings of (INTERSPEECH 2025), Rotterdam, The Netherlands, September 2025.
[C70] Praveen Srinivasa Varadhan, Sherry Thomas, Sai Teja M. S., Suvrat Bhooshan, Mitesh M. Khapra: The State Of TTS: A Case Study with Human Fooling Rates. In Proceedings of (INTERSPEECH 2025), Rotterdam, The Netherlands, September 2025.
[C69] Ashwin Sankar, Srija Anand, Praveen Srinivasa Varadhan, Sherry Thomas, Mehak Singal, Shridhar Kumar, Deovrat Mehendale, Aditi Krishana, Giri Raju, Mitesh Khapra: IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS. In Proceedings of The Thirty-Eighth Conference on Neural Information Processing Systems (NeurIPS 2024, Track on Datasets and Benchmarks), New Orleans, Louisiana, USA, December 2024.
[C68] Nandini Mundra, Aditya Nanda Kishore, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M. Khapra: An Empirical Comparison of Vocabulary Expansion and Initialization Approaches for Language Models. In Proceedings of the 28th Conference on Computational Natural Language Learning (CoNLL 2024), Miami, Florida, USA, November 2024.
[C67] Anushka Singh, Ananya Sai, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh Khapra: How Good is Zero-Shot MT Evaluation for Low Resource Indian Languages?. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, August 2024.
[C66] Srija Anand, Praveen Srinivasa Varadhan, Ashwin Sankar, Giri Raju, Mitesh M. Khapra: Enhancing Out-of-Vocabulary Performance of Indian TTS Systems for Practical Applications through Low-Effort Data Strategies. In Proceedings of (INTERSPEECH 2024), Kos, Greece, September 2024.
[C65] Praveen Srinivasa Varadhan, Ashwin Sankar, Giri Raju, Mitesh M. Khapra: Rasa: Building Expressive Speech Synthesis Systems for Indian Languages in Low-resource Settings. In Proceedings of (INTERSPEECH 2024), Kos, Greece, September 2024.
[Outstanding Paper Award] [C64] Sumanth Doddapaneni, Mohammed Safi Ur Rahman Khan, Sshubam Verma, Mitesh M. Khapra: Finding Blind Spots in Evaluator LLMs with Interpretable Checklists. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing (EMNLP 2024), Miami, Florida, USA, November 2024.

[C63] Tahir Javed, Janki Atul Nawale, Eldho Ittan George, Sakshi Joshi, Kaushal Santosh Bhogale, Deovrat Mehendale, Ishvinder Virender Sethi, Aparna Ananthanarayanan, Hafsah Faquih, Pratiti Palit, Sneha Ravishankar, Saranya Sukumaran, Tripura Panchagnula, Sunjay Murali, Kunal Sharad Gandhi, Ambujavalli R, Manickam K M, C Venkata Vaijayanthi, Krishnan Srinivasa Raghavan Karunganni, Pratyush Kumar, Mitesh M. Khapra: IndicVoices: Towards building an Inclusive Multilingual Speech Dataset for Indian Languages. In Findings of the Association for Computational Linguistics: (ACL 2024), Bangkok, Thailand, August 2024.
[C62] Tahir Javed, Janki Nawale, Sakshi Joshi, Eldho George, Kaushal Bhogale, Deovrat Mehendale, Mitesh M. Khapra: LAHAJA: A Robust Multi-accent Benchmark for Evaluating Hindi ASR Systems. In Proceedings of (INTERSPEECH 2024), Kos, Greece, September 2024.
[Outstanding Paper Award] [C61] Mohammed Safi Ur Rahman Khan, Priyam Mehta, Ananth Sankar, Umashankar Kumaravelan, Sumanth Doddapaneni, Suriyaprasaad B, Varun Balan G, Sparsh Jain, Anoop Kunchukuttan, Pratyush Kumar, Raj Dabre, Mitesh M. Khapra: IndicLLMSuite: A Blueprint for Creating Pre-training and Fine-Tuning Datasets for Indian Languages. In Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (ACL 2024), Bangkok, Thailand, August 2024.

[C60] Tahir Javed, Sakshi Joshi, Vignesh Nagarajan, Sai Sundaresan, Janki Nawale, Abhigyan Raman, Kaushal Bhogale, Pratyush Kumar, Mitesh M. Khapra: Svarah: Evaluating English ASR Systems on Indian Accents. In Proceedings of (INTERSPEECH 2023), Dublin, Ireland, August 2023.
[C59] Yash Madhani, Mitesh M. Khapra, Anoop Kunchukuttan: Bhasha-Abhijnaanam: Native-script and Romanized Language Identification for 22 Indic languages. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 2023.
[C58] Ananya B. Sai, Vignesh Nagarajan, Tanay Dixit, Raj Dabre, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra: IndicMT Eval: A Dataset to Meta-Evaluate Machine Translation Metrics for Indian Languages. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 2023.
[C57] Kaushal Santosh Bhogale, Sai Sundaresan, Abhigyan Raman, Tahir Javed, Mitesh M. Khapra, Pratyush Kumar: Vistaar: Diverse Benchmarks and Training Sets for Indian Language ASR. In Proceedings of (INTERSPEECH 2023), Dublin, Ireland, August 2023.
[C56] Aswanth Kumar, Ratish Puduppully, Raj Dabre, Anoop Kunchukuttan: CTQScorer: Combining Multiple Features for In-context Example Selection for Machine Translation. In Findings of the Association for Computational Linguistics: (EMNLP 2023), Singapore, December 2023.
[C55] Sumanth Doddapaneni, Rahul Aralikatte, Gowtham Ramesh, Shreya Goyal, Mitesh M. Khapra, Anoop Kunchukuttan, Pratyush Kumar: Towards Leaving No Indic Language Behind: Building Monolingual Corpora, Benchmark and Models for Indic Languages. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 2023.
[C54] Arnav Mhaske, Harshit Kedia, Sumanth Doddapaneni, Mitesh M. Khapra, Pratyush Kumar, Rudra Murthy V, Anoop Kunchukuttan: Naamapadam: A Large-Scale Named Entity Annotated Data for Indic Languages. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (ACL 2023), Toronto, Canada, July 2023.
[C53] Gokul NC, Manideep Ladi, Sumit Negi, Prem Selvaraj, Pratyush Kumar, Mitesh M. Khapra: Addressing Resource Scarcity across Sign Languages with Multilingual Pretraining and Unified-Vocabulary Datasets. In Proceedings of the 36th Conference on Neural Information Processing Systems (NeurIPS 2022), New Orleans, Louisiana, USA, November 2022.
[C52] Yash Madhani, Sushane Parthan, Priyanka Bedekar, Gokul NC, Ruchi Khapra, Anoop Kunchukuttan, Pratyush Kumar, Mitesh Khapra. Aksharantar: Open Indic-language Transliteration datasets and models for the Next Billion Users. In Proceedings of Empirical Methods in Natural Language Processing Findings (EMNLP Findings 2023), Singapore, December 2023.
[C51] Gokul Karthik Kumar^*, Praveen S V^*, Pratyush Kumar, Mitesh M. Khapra, Karthik Nandakumar. Towards Building Text-To-Speech Systems for the Next Billion Users. In Proceedings of IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June, 2022.
[C50] Kaushal Bhogale, Abhigyan Raman, Tahir Javed, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra. Effectiveness of Mining Audio and Text Pairs from Public Data for Improving ASR Systems for Low-Resource Languages. In 2023 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP 2023), Rhodes Island, Greece, June, 2023.
[C49] Tahir Javed, Kaushal Santosh Bhogale, Abhigyan Raman, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M. Khapra. IndicSUPERB: A Speech Processing Universal Performance Benchmark for Indian languages. In Proceedings of the Thirty-Sixth AAAI Conference on Artificial Intelligence (AAAI 2023), Washington, DC, USA, February, 2023.
[C48] Aman Kumar, Himani Shrotriya, Prachi Sahu, Amogh Mishra, Raj Dabre, Ratish Puduppully, Anoop Kunchukuttan, Mitesh M. Khapra, Pratyush Kumar: IndicNLG Benchmark: Multilingual Datasets for Diverse NLG Tasks in Indic Languages. In Proceedings of Empirical Methods in Natural Language Processing Findings (EMNLP Findings 2022), Abu Dhabi, December 2022.
[C47] Emil Biju, Anirudh Sriram, Pratyush Kumar, Mitesh M. Khapra: Input-specific Attention Subnetworks for Adversarial Detection. Findings of the Association for Computational Linguistics (ACL -Findings 2022), Dublin, Ireland, May 2022.
[C46] Raj Dabre, Himani Shrotriya, Anoop Kunchukuttan, Ratish Puduppully, Mitesh Khapra, and Pratyush Kumar. IndicBART: A Pre-trained Model for Indic Natural Language Generation. In Findings of the Association for Computational Linguistics (ACL -Findings 2022), Dublin, Ireland, May 2022.
[C45] Prem Selvaraj, Gokul Nc, Pratyush Kumar, Mitesh M. Khapra: OpenHands: Making Sign Language Recognition Accessible with Pose-based Pretrained Models across Languages. In Proceedings of the Association for Computational Linguistics (ACL 2022), Dublin, Ireland, May 2022.
[Outstanding Paper Award] [C44] Akash Kumar Mohankumar, Mitesh M. Khapra: Active Evaluation: Efficient NLG Evaluation with Few Pairwise Comparisons. In Proceedings of the Association for Computational Linguistics (ACL 2022), Dublin, Ireland, May 2022.
[C43] Tahir Javed, Sumanth Doddapaneni, Abhigyan Raman, Kaushal Santosh Bhogale, Gowtham Ramesh, Anoop Kunchukuttan, Pratyush Kumar, Mitesh M Khapra. Towards Building ASR Systems for the Next Billion Users. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2022), February, 2022.
[C42] Ananya B. Sai, Tanay Dixit, Dev Sheth, Sreyas Mohan and Mitesh M. Khapra: Perturbation CheckLists for Evaluating NLG Evaluation Metrics. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2021), Punta Cana, Dominican Republic, November, 2021.
[C41] Dev Yashpal Sheth, Sreyas Mohan, Joshua Vincent, Ramon Manzorro, Peter A. Crozier, Mitesh M. Khapra, Eero P. Simoncelli and Carlos Fernandez-Granda: Unsupervised Deep Video Denoising. IEEE/CVF International Conference on Computer Vision (ICCV 2021 ), October, 2021.
[C40] Madhura Pande, Aakriti Budhraja, Preksha Nema, Pratyush Kumar, Mitesh M. Khapra: The heads hypothesis: A unifying statistical approach towards understanding multi-headed attention in BERT. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), February, 2021.
[C39] Pritha Ganguly, Nitesh Methani, Mitesh M. Khapra, Pratyush Kumar: A Systematic Evaluation of Object Detection Networks for Scientific Plots. In Proceedings of the Thirty-Fifth AAAI Conference on Artificial Intelligence (AAAI 2021), February, 2021.
[C38] Emil Biju, Anirudh Sriram, Mitesh M. Khapra, Pratyush Kumar: Joint Transformer/RNN Architecture for Gesture Typing in Indic Languages.In Proceedings of the The 27th International Conference on Computational Linguistics (COLING 2020), Nov-Dec 2020.
[C37] Divyanshu Kakwani, Anoop Kunchukuttan, Satish Golla, Gokul N. C., Avik Bhattacharyya, Mitesh M. Khapra, Pratyush Kumar: iNLPSuite: Monolingual Corpora, Evaluation Benchmarks and Pre-trained Multilingual Language Models for Indian Languages. In Proceedings of Empirical Methods in Natural Language Processing Findings (EMNLP Findings 2020), November 2020.
[C36] M Akash Kumar, Preksha Nema, Sharan Narasimhan, Mitesh M. Khapra, Balaji Vasan Srinivasan, Balaraman Ravindran: Towards Transparent and Explainable Attention Models. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2020), July, 2020.
[C35] Nitesh Methani, Pritha Ganguly, Mitesh M. Khapra, Pratyush Kumar: PlotQA: Reasoning over Scientific Plots. In the Proceedings of the Eighteenth IEEE Winter Conference on Applications of Computer Vision (WACV 2020), Aspen, Colorado, USA, March 2020.
[C34] Preksha Nema, M Akash Kumar, Mitesh M. Khapra, Balaji V Srinivasan, Balaraman Ravindran: Let's Ask Again: Refine Network for Automatic Question Generation. In Proceedings of Empirical Methods in Natural Language Processing ( EMNLP 2019 ), Hong Kong, November 2019.
[C33] Shweta Bhardwaj, Mitesh M. Khapra and Mukundhan Srinivasan: Efficient Video Classification Using Fewer Frames. IEEE International Conference on Computer Vision and Pattern Recognition ( CVPR 2019 ), Long Beach, California, USA, June, 2019
[C32] Siddhartha Arora, Mitesh M. Khapra and Harish G. Ramaswamy: On Knowledge distillation from complex networks for response prediction. In Proceedings of 17th Annual Conference of the North American Chapter of the Association for Computational Linguistics ( NAACL 2019 ), Minneapolis, USA, June 2–7, 2019
[C31] Ananya Sai, Mithun Das Gupta, Mukundhan Srinivasan, Mitesh M. Khapra: Re-evaluating ADEM: A Deeper Look at Scoring Dialogue Responses, In Proceedings of the Thirty-Third AAAI Conference on Artificial Intelligence (AAAI 2019), Honolulu, Hawaii, USA, January - February, 2019
[C30] Anirban Laha, Saneem Ahmed Chemmengath, Priyanka Agrawal, Mitesh M. Khapra, Karthik Sankaranarayanan, Harish Ramaswamy: On Controllable Sparse Alternatives to Softmax, Neural Information Processing Systems (NeurIPS 2018), Montreal, December 2018
[C29] Preksha Nema and Mitesh M. Khapra: Towards a Better Metric for Evaluating Question Generation Systems. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2018), Brussels, Belgium, NOvember 2018.
[C28] Nikita Moghe, Siddhartha Arora, Suman Banerjee and Mitesh M. Khapra: Towards Exploiting Background Knowledge for Building Conversation Systems. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2018), Brussels, Belgium, NOvember 2018.
[C27] Suman Banerjee, Nikita Moghe, Siddhartha Arora, Mitesh M. Khapra: A Dataset for Building Code-Mixed Goal Oriented Conversation Systems. In Proceedings of the The 27th International Conference on Computational Linguistics (COLING 2018), Santa Fe, New-Mexico, USA, August 2018.
[C26] Amrita Saha, Rahul Aralikatte, Mitesh M. Khapra and Karthik Sankaranarayanan: DuoRC: Towards Complex Language Understanding with Paraphrased Reading Comprehension. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (ACL 2018), Melbourne, Australia, July, 2018.
[C25] Soham Parikh, Ananya Sai, Preksha Nema, Mitesh M Khapra: ElimiNet: A Model for Eliminating Options for Reading Comprehension with Multiple Choice Questions. In Proceedings of the 27th International Joint Conference on Artificial Intelligence (IJCAI 2018), Stockholm, Sweden, July, 2018.
[C24] Preksha Nema, Shreyas Shetty M, Parag Jain, Anirban Laha, Karthik Sankaranarayanan and Mitesh M. Khapra: Generating Descriptions from Structured Data Using a Bifocal Attention Mechanism and Gated Orthogonalization. In Proceedings of 16th Annual Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2018), New Orleans, June, 2018.
[C23] Amrita Saha, Megha Nawhal, Mitesh M. Khapra, Vikas Raykar: Learning Disentangled Multimodal Representations for the Fashion Domain. In the Proceedings of the Eighteenth IEEE Winter Conference on Applications of Computer Vision (WACV 2018), Lake Tahoe, NV/CA, USA, March 2018.
[C22] Deepak Mittal, Shweta Bhardwaj, Mitesh M. Khapra, Balaraman Ravindran: Recovering from Random Pruning: On the Plasticity of Deep Convolutional Neural Networks. In the Proceedings of the Eighteenth IEEE Winter Conference on Applications of Computer Vision (WACV 2018), Lake Tahoe, NV/CA, USA, March 2018.
[C21] Amrita Saha, Vardaan Pahuja, Mitesh M. Khapra, Karthik Sankaranarayanan, Sarath Chandar : Complex Sequential Question Answering: Towards Learning to Converse Over Linked Question Answer Pairs with a Knowledge Graph. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, USA, February 2018.
[C20] Amrita Saha, Mitesh M. Khapra, Karthik Sankaranarayanan : Towards Building Large Scale Multimodal Domain-Aware Conversation Systems. In Proceedings of the Thirty-Second AAAI Conference on Artificial Intelligence (AAAI 2018), New Orleans, Louisiana, USA, February 2018.
[C19] Preksha Nema, Mitesh M. Khapra, Anirban Laha, Balaraman Ravindran: Diversity driven attention model for query-based abstractive summarization. In the Proceedings of the Fifty-Fifth Annual Meeting of the Association of Computational Linguistics (ACL 2017), Vancouver, Canada, July 2017.
[C18] Sathish Reddy, Dinesh Raghu, Mitesh M. Khapra, Sachindra Joshi: Generating Natural Language Question-Answer Pairs from a Knowledge Graph Using a RNN Based Question Generation Model. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2017), Valencia, Spain, April, 2017.
[C17] Amrita Saha, Mitesh M. Khapra, Sarath Chandar, Janarthanan Rajendran, Kyunghyun Cho: A Correlational Encoder Decoder Architecture for Pivot Based Sequence Generation. Computational Linguistics Conference (COLING 2016), Osaka, Japan, December 2016
[C16] Janarthanan Rajendran, Mitesh M. Khapra, Sarath Chandar, Balaraman Ravindran: Bridge Correlational Neural Networks for Multilingual Multimodal Representation Learning. In North American Association of Computational Linguistics (NAACL 2016), Atlanta, USA, June 2016, pp. 171–181.
[C15] Ruty Rinott, Lena Dankin, Carlos Alzate Perez, Mitesh M. Khapra, Ehud Aharoni, Noam Slonim: Show Me Your Evidence - an Automatic Method for Context Dependent Evidence Detection. In Proceedings of Empirical Methods in Natural Language Processing (EMNLP 2015), Portugal, September 2015, pp. 440-450.
[C14] A. P. Sarath Chandar, Stanislas Lauly, Hugo Larochelle, Mitesh M. Khapra, Balaraman Ravindran, Vikas C. Raykar, Amrita Saha, An Autoencoder Approach to Learning Bilingual Word Representations, Neural Information Processing Systems (NeurIPS 2014), Montreal, December 2014, pp. 1853-1861.
[C13] Mitesh M. Khapra, Ananthakrishnan Ramanathan, Anoop Kunchukuttan, Karthik Visweswariah, Pushpak Bhattacharyya, When Transliteration Met Crowdsourcing : An Empirical Study of Transliteration via Crowdsourcing using Efficient, Non-redundant and Fair Quality Control, Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC 2014), Reykjavik, Iceland, May 2014, pp. 196-202
[C12] Mitesh M. Khapra, Ananthakrishnan Ramanathan, Karthik Visweswariah, Improving reordering performance using higher order and structural features, in North American Association of Computational Linguistics (NAACL 2013), Atlanta, USA, June 2013, pp. 315-324.
[C11] Karthik Visweswariah, Mitesh M. Khapra, Ananthakrishnan Ramanathan, Cut the noise: Mutually reinforcing reordering and alignments for improved machine translation, in Annual Meeting of the Association of Computational Linguistics (ACL 2013), Bulgaria, August 2013, pp. 1275-1284.
[C10] Mitesh M. Khapra, Salil Joshi, Arindam Chatterjee and Pushpak Bhattacharyya, Together We Can: Bilingual Bootstrapping for WSD , Annual Meeting of the Association of Computational Linguistics (ACL 2011) Oregon, USA, June 2011, pp. 561-569.
[C09] Mitesh M. Khapra, Salil Joshi and Pushpak Bhattacharyya, It Takes Two to Tango: A Bilingual Unsupervised Approach for Estimating Sense Distributions using Expectation Maximization , 5th International Conference on Natural Language Processing (IJCNLP 2011), Chiang Mai, Thailand, November 2011, pp. 695-704.
[C08] Mitesh M. Khapra, Raghavendra Udupa, A. Kumaran, and Pushpak Bhattacharya, PR + RQ ≈ P Q: Transliteration Mining Using Bridge Language, in American Association for Artificial Intelligence (AAAI 2010) , July 2010.
[C07] Mitesh Khapra, Anup Kulkarni, Saurabh Sohoney and Pushpak Bhattacharyya, All Words Domain Adapted WSD: Finding a Middle Ground between Supervision and Unsupervision, Conference of Association of Computational Linguistics (ACL 2010), Uppsala, Sweden, July 2010, pp. 1532-1541.
[C06] Harshada Gune, Mugdha Bapat, Mitesh Khapra and Pushpak Bhattacharyya, Verbs are where all the Action Lies: Experinces of Shallow Parsing of a Morphologically Rich Language, Computational Linguistics Conference (COLING 2010), Beijing, China, August 2010, pp. 347-355.
[C05] Mitesh M. Khapra, Saurabh Sohoney, Anup Kulkarni and Pushpak Bhattacharyya, Value for Money: Balancing Annotation Effort, Lexicon Building and Accuracy for Multilingual WSD, Computational Linguistics Conference (COLING 2010), Beijing, China, August 2010, pp. 555-563.
[C04] Raghavendra Udupa and Mitesh M. Khapra, Transliteration Equivalence using Canonical Correlation Analysis, in European Conference on Information Retrieval (ECIR 2010), March 2010, UK, pp. 75-86.
[C03] Mitesh M. Khapra, A Kumaran and Pushpak Bhattacharyya. Everybody loves a rich cousin: An empirical study of transliteration through bridge languages, in North American Association of Computational Linguistics (NAACL 2010, June 2010, Los Angeles, USA, pp. 420-428.
[C02] Raghavendra Udupa and Mitesh M. Khapra. Improving the Multilingual User Experience of Wikipedia Using Cross-Language Name Search, in North American Association of Computational Linguistics (NAACL 2010), June 2010, Los Angeles, USA, pp. 420-428.
[C01] Mitesh M. Khapra, Sapan Shah, Piyush Kedia and Pushpak Bhattacharyya, Projecting Parameters for Multilingual Word Sense Disambiguation, Empirical Methods in Natural Language Processing (EMNLP 2009), Singapore, August, 2009, pp. 459-467.

Last updated on 01/07/2022 at 10:00 a.m. IST

Mitesh M. Khapra

Research

Bio

Awards

Teaching

Students

Publications