Broadly, I am interested in applying machine learning algorithms to different problems in speech, text and image domains.
My Ph.D. research is on unsupervised audio segmentation based on specific events. In this, I am working towards improving the speaker boundaries in conversational speech for the speaker diarization systems. In my work, I am trying to incorporate speaker discriminative representation learning within an unsupervised clustering framework. These representations can be obtained from the neural networks or other discriminative classifiers. I am also working towards continual incremental learning in speaker diarization and Knowledge Distillation for E2E ASR model compression.
Journal Paper:
(Jan 2014 - Feb 2016)
I was involved in the project on Speaker Recognition and Speaker Diarization sponsored by Defence Research & Development Organisation (DRDO), India. I was responsible for parallelizing UBM computations (on Nvidia GPUs using CUDA-C) and developing the speaker diarization system.
(Jan 2016 - Present)
I have been teaching assistant for the following courses.
• Pattern Recognition and Machine Learning (Fall 2019, 2018, 2017)(2016 - 2019)
Throughout my Ph.D. journey I was fortunate to mentor some of the brilliant students from IIT Madras and other Indian institutes with their projects/research at IIT Madras.
I am a Ph.D. candidate in the Department of Computer Science and Engineering at Indian Institute of Technology Madras, India.
I joined IIT Madras in 2014 for Master of Science (by research) programme with Prof. C Chandra Sekhar where I was working on Speaker Verification and Speaker Diarization.
I continued my Ph.D. from 2016 jointly with Prof. C Chandra Sekhar and Prof. Hema A Murthy on the topic of Speaker Diarization.
I also collaborate with Dr. Srikanth Madikeri, Idiap Research Institute, Switzerland.
For my research work, I have been fortunate to collaborate with Prof. Shrikanth Narayanan (USC, USA) and Prof. Mriganka Sur (MIT, USA).
During my Ph.D., I have mentored many students from IIT M and other Indian institutes with their projects/research works.
I was also a visiting research student at Mila - Quebec AI Institute, Canada, under the supervision of Prof. Yoshua Bengio and Dr. Mirco Ravanelli.
Let us be social
Research Scholar, IIT Madras
nauman@cse.iitm.ac.in
• Recipient of STAR Teaching Assistant (STAR-TA) award, Department of Computer Science and Engineering, IIT Madras, 2020.
• Recipient of Kris Gopalakrishnan Endowment Student Travel grant award to visit ICASSP-2019, Brighton, UK, 2019.
• Recipient of STAR Teaching Assistant (STAR-TA) award, Department of Computer Science and Engineering, IIT Madras, May, 2018.
• 1st prize winner in 5-Minute PhD Thesis (5MPT) competition supported by International Speech Communication Association (ISCA) at Summer School on Speech Signal Processing (S4P), DA-IICT, India, July, 2017.
• Recipient of the International Speech Communication Association (ISCA) travel grant award 2016 to visit Interspeech at San Francisco, USA, 2016.
• Secured All India Rank 21 in part A of Joint Entrance Screening Test (JEST) for Computer Science conducted by the Institute of Mathematical Sciences (IMSc), India, 2013.
• Secured All India Rank 6 in SRMEE (Computer Science), an entrance exam for post-graduation conducted by SRM University, India, 2013.