Sherry Thomas

Research Scientist at AI4Bharat, IIT Madras, specializing in Natural Language Processing and Text-to-Speech systems. Published researcher at top-tier venues (NeurIPS, Interspeech) with expertise in building large-scale multilingual AI systems for Indian languages. BS in Data Science (IIT Madras, 9.52 CGPA) with a unique interdisciplinary background combining Philosophy and Computer Science.

Research Highlights

RASMALAI

Interspeech 2025

Introduces RASMALAI, a large multilingual dataset for controllable text-to-speech, and develops IndicParlerTTS, a controllable multilingual text-to-speech model for 24 languages.

View on HuggingFace

The State Of TTS: A Case Study with Human Fooling Rates

Interspeech 2025

Introduces Human Fooling Rate (HFR) to directly measure speech indistinguishability, revealing that CMOS-based parity claims are misleading via HFR evaluation of TTS systems.

IndicVoices-R: Unlocking a Massive Multilingual Multi-speaker Speech Corpus for Scaling Indian TTS

NeurIPS 2024

Created large IndicVoices-R dataset for multilingual Indian text-to-speech. Benchmark evaluates zero-shot Indian TTS, model improves generalization across diverse speakers and languages.

Work Experience

AI4Bharat, IIT Madras

Research Scientist | January 2025 - Present

Data Science Intern | February 2024 - December 2024

  • Contributing to the TTS Lab by cleaning, preparing data, and training models
  • Building open-source text-to-speech models for the 22 official languages of India
  • Co-authored multiple research papers published at NeurIPS and Interspeech

IIT Madras

ML Research Intern | September 2023 - January 2024

  • Developed content bridging theory and practice in classical machine learning
  • Created high-quality ML notebooks exploring model internals and guiding effective EDA

Teaching Assistant - Machine Learning Techniques (CS2007) | May 2023 - August 2023

  • Prepared comprehensive course notes adopted as standard course material
  • Mentored students and conducted sessions on classical ML topics

Education

Degree Institution CGPA Year
BS in Data Science and Applications IIT Madras 9.52/10 2025
BA in Philosophy Christ University, Bengaluru 9.57/10 2021

Technical Skills

Research Areas: Text-to-Speech, Natural Language Processing, Large Language Models, Multilingual AI, Speech Synthesis

Languages: Python, JavaScript, SQL, Java

ML/DL Frameworks: PyTorch, TensorFlow, HuggingFace Transformers, Scikit-learn, Pandas, OpenCV

Web & Backend: Flask, Django, Vue.js, SQLAlchemy, Celery, Redis

Mobile Development: Flutter, Dart

Tools & Infrastructure: PostgreSQL, MinIO, Docker, Git

NLP/LLM Tools: llama.cpp, Langchain, Llama, Mistral, Bloom, Flan-T5

Certifications