Principal Speech Scientist

zyoin group

Hyderabad, India

7-12 Years

This job is no longer accepting applications

Posted a month ago

Job Description

Principal Speech Scientist

Location: Hyderabad – Work From Office (5 Days)

Experience: 7–12 Years

Domain: Speech AI | NLP | Machine Learning | Deep Learning

Role Overview

We are looking for a highly experienced Principal Speech Scientist to lead innovation in speech and language technologies. This is a senior individual contributor role with strong technical leadership responsibilities, focused on defining research direction, mentoring teams, and building production-ready speech AI solutions at scale.

Key Responsibilities

Research Strategy & Technical Leadership

Define and execute the long-term technical vision and roadmap for speech science research
Drive innovation by translating state-of-the-art academic research into scalable production solutions
Provide technical leadership and guide teams on best practices in model development and experimentation

Speech & Language Model Development

Design and develop advanced speech systems including ASR, TTS, speaker verification, diarization, and speech enhancement
Build high-performance models for speech translation and natural language understanding
Develop scalable model pipelines and optimize model performance

Cross-Functional Collaboration

Work closely with product, engineering, and design teams to align research initiatives with business goals
Influence product direction through applied research and technical insights
Contribute to architectural decisions related to AI-driven features

Mentorship & Thought Leadership

Mentor scientists and engineers, fostering technical excellence and innovation culture
Represent the organization in conferences and research forums
Contribute to publications, patents, and knowledge sharing within the AI research community

Experimentation & Model Optimization

Establish best practices for experimentation, evaluation metrics, and data pipeline development
Lead initiatives for model optimization, scalability, and deployment readiness
Support strategic decisions related to technology adoption and partnerships

Required Skills & Qualifications

PhD or Master's degree in Speech Processing, Computational Linguistics, Electrical Engineering, Computer Science, or related field
7+ years of industry experience in speech and audio processing
Strong expertise in ASR, TTS, speaker diarization, speech enhancement, or speech translation
Strong programming skills in Python with experience in PyTorch or TensorFlow
Experience with large-scale model training and distributed systems
Ability to convert complex business problems into structured research solutions
Strong communication skills and experience mentoring technical teams

Nice to Have