Research Engineer

humyn labs

Bengaluru, India

2-4 Years

Save

Posted a month ago
Be among the first 10 applicants

Early Applicant

Job Description

Speech AI Research Engineer

About the Role

We are building structured, high-quality voice datasets for frontier AI companies working on speech-to-text, speech-to-speech, and multimodal AI systems.

We are looking for a Machine Learning Researcher with a focus on voice and speech AI — someone who can rigorously evaluate datasets across evolving speech models, identify performance gaps across Indic and global languages, and publish those findings as structured research for the broader AI community.

This role sits at the intersection of benchmarking, linguistic diversity, and data strategy. If you are deeply curious about how models fail — especially across underrepresented languages and accents — this is built for you.

What You'll Own

Cross-Model Benchmarking & Evaluation

Benchmark voice datasets across ASR and speech models (Whisper, Deepgram, Google STT, Azure Speech, and emerging open-source models)
Measure performance using WER, CER, MOS, robustness, latency, and error pattern analysis
Design structured experiments to understand how dataset characteristics impact model accuracy
Compare performance across multilingual, dialect-heavy, emotional, and noisy speech data

Model Gap Analysis — Indic & Global Languages

Systematically identify where speech models underperform across: Indic languages and dialects (Hindi, Tamil, Telugu, Bengali, Kannada, etc.), code-switching and transliteration, emotional and conversational speech, low-resource language scenarios, and background noise / real-world audio conditions
Quantify model weaknesses through structured, reproducible analysis
Map performance gaps to specific dataset requirements — you will help define what data models actually need next

Dataset Quality & Supplier Scoring

Build a standardized dataset quality scoring rubric with measurable criteria: audio clarity, speaker diversity, annotation accuracy, emotion depth, and accent/dialect coverage
Tag and rank data suppliers based on objective quality signals

Research Publishing & Community Presence

Publish benchmarking findings as blog posts and LinkedIn articles accessible to both technical and non-technical audiences
Contribute to internal evaluation reports tracking performance shifts as new models are released
Stay current on evolving speech model architectures and share outside-in insights with AI research teams and clients

What We're Looking For

2-4 years of experience in speech AI, audio ML, NLP, or applied AI research
Hands-on experience with ASR/TTS systems and understanding of model behaviour
Exposure to running experiments, evaluating models, and designing evaluation frameworks
Strong Python skills and comfort with ML experimentation workflows
Genuine interest in linguistic diversity — particularly Indic languages — and how models perform across them
Strong written communication skills with the ability to turn research into clear, publishable content

Technical Skills

Python (mandatory)
PyTorch or TensorFlow
Whisper, SpeechBrain, Kaldi, or similar toolkits
Familiarity with WER, CER, MOS, SNR metrics
Experience with multilingual or low-resource datasets (preferred)

Ideal Mindset

Curious about model failure modes, not just model capabilities
Analytical and detail-oriented, with a bias for reproducibility
Comfortable reading research papers and independently testing new APIs
Excited to share work publicly — blogs, LinkedIn, open datasets

What Success Looks Like in 90 Days

Benchmarking framework live across at least 3 speech models
First published blog post or LinkedIn article on model gaps in Indic languages
Dataset quality scoring rubric operational
At least 2 model gap analyses mapped to concrete data recommendations

More Info

Job Type:

Industry:

Function:

Employment Type:

About Company

humyn labsJob Source: www.linkedin.com

Job ID: 145806079

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 20-05-2026 00:33:44 PM

Homejobs in Bengaluru / BangaloreResearch Engineer

Similar Jobs

Principal AI Research Engineer

Level AI

6-8 yrs

Bengaluru, India

Skills:

Deep Learning, Pytorch, Python, ASR, Transformers, LLMs, distributed training, retrieval-augmented generation, agentic systems, speech understanding, Optimization, efficient inference

Senior Research Engineer

Netradyne

6-8 yrs

Bengaluru, India

Skills:

Machine Learning, Python, Computer Vision, Deep Learning, Data-Driven Solutions, Edge Deployments, Driver Assistance Algorithms, Machine Learning Infrastructure

AI Research Engineer

EMERGEnT

5-8 yrs

Bengaluru, India

Skills:

Python, Statistical Analysis, eval frameworks, Transformers, Data Processing, RL for agents, DPO, Go, RLHF