
Search by job, company or skills

Speech AI Research Engineer
About the Role
We are building structured, high-quality voice datasets for frontier AI companies working on speech-to-text, speech-to-speech, and multimodal AI systems.
We are looking for a Machine Learning Researcher with a focus on voice and speech AI — someone who can rigorously evaluate datasets across evolving speech models, identify performance gaps across Indic and global languages, and publish those findings as structured research for the broader AI community.
This role sits at the intersection of benchmarking, linguistic diversity, and data strategy. If you are deeply curious about how models fail — especially across underrepresented languages and accents — this is built for you.
What You'll Own
Cross-Model Benchmarking & Evaluation
Model Gap Analysis — Indic & Global Languages
Dataset Quality & Supplier Scoring
Research Publishing & Community Presence
What We're Looking For
Technical Skills
Ideal Mindset
What Success Looks Like in 90 Days
Job ID: 145806079
Skills:
Deep Learning, Pytorch, Python, ASR, Transformers, LLMs, distributed training, retrieval-augmented generation, agentic systems, speech understanding, Optimization, efficient inference
Skills:
Machine Learning, Python, Computer Vision, Deep Learning, Data-Driven Solutions, Edge Deployments, Driver Assistance Algorithms, Machine Learning Infrastructure
Skills:
Python, Statistical Analysis, eval frameworks, Transformers, Data Processing, RL for agents, DPO, Go, RLHF
Skills:
C, Java, Python, Kubernetes, Docker, Javascript, MLops, LLMOps, Realtime APIs, Distributed System Design, AI Domain Expertise
Skills:
Python, Pytorch, Hugging Face Ecosystem, Image datasets, Vision Models, Diffusion Models, Distributed foundation model training, Finetuning Multi-Model LLM models, FSDP
We don’t charge any money for job offers