Principal Speech Scientist
Location: Hyderabad – Work From Office (5 Days)
Experience: 7–12 Years
Domain: Speech AI | NLP | Machine Learning | Deep Learning
Role Overview
We are looking for a highly experienced Principal Speech Scientist to lead innovation in speech and language technologies. This is a senior individual contributor role with strong technical leadership responsibilities, focused on defining research direction, mentoring teams, and building production-ready speech AI solutions at scale.
Key Responsibilities
Research Strategy & Technical Leadership
- Define and execute the long-term technical vision and roadmap for speech science research
- Drive innovation by translating state-of-the-art academic research into scalable production solutions
- Provide technical leadership and guide teams on best practices in model development and experimentation
Speech & Language Model Development
- Design and develop advanced speech systems including ASR, TTS, speaker verification, diarization, and speech enhancement
- Build high-performance models for speech translation and natural language understanding
- Develop scalable model pipelines and optimize model performance
Cross-Functional Collaboration
- Work closely with product, engineering, and design teams to align research initiatives with business goals
- Influence product direction through applied research and technical insights
- Contribute to architectural decisions related to AI-driven features
Mentorship & Thought Leadership
- Mentor scientists and engineers, fostering technical excellence and innovation culture
- Represent the organization in conferences and research forums
- Contribute to publications, patents, and knowledge sharing within the AI research community
Experimentation & Model Optimization
- Establish best practices for experimentation, evaluation metrics, and data pipeline development
- Lead initiatives for model optimization, scalability, and deployment readiness
- Support strategic decisions related to technology adoption and partnerships
Required Skills & Qualifications
- PhD or Master's degree in Speech Processing, Computational Linguistics, Electrical Engineering, Computer Science, or related field
- 7+ years of industry experience in speech and audio processing
- Strong expertise in ASR, TTS, speaker diarization, speech enhancement, or speech translation
- Strong programming skills in Python with experience in PyTorch or TensorFlow
- Experience with large-scale model training and distributed systems
- Ability to convert complex business problems into structured research solutions
- Strong communication skills and experience mentoring technical teams
Nice to Have
- Experience with large language models and multimodal AI architectures
- Experience optimizing models for edge or on-device deployment
- Research publications in leading speech or AI conferences
- Patent contributions in speech or audio technology