Senior Applied Research Engineer

Spyne

Gurugram, Gurugram, India

2-5 Years

This job is no longer accepting applications

Posted 3 months ago

Job Description

Senior Applied Researcher AI (Computer Vision / Multimodal)

Location: Gurgaon (In-Office)

Function: AI

Reports To: CTO

Spyne is transforming automotive retail with Generative AI. What began with a simple ideahelp dealers sell cars online fasterhas evolved into the world's first AI-powered automotive retail ecosystem, reshaping how 350,000+ dealers market and sell vehicles globally.

We are backed by $16M from Vertex Ventures, Accel, and leading investors. With 5 revenue growth in 15 months, we're now pushing for 34 more this year. We've launched industry-first AI image and 360 solutions, expanded into the US and Europe, and are now building a full-scale GenAI Automotive Retail Suite.

Explore more:

Studio AI Product YouTube

VSmartView (Vini Teaser) YouTube

Public News Spyne raises $16 Million

More: New Office, One of the podcasts, Dave as Head of US Sales

Job Description

We are looking for a Research Lead / Senior Applied Researcher (AI/ML/CV) to join Spyne's Computer Vision team. This is a senior IC (individual contributor) role for someone who enjoys inventing new architectures, pushing boundaries in multimodal intelligence, and building large-scale AI systems that ship to production.

This is not an academic paper-writing job. This is a build the thing nobody has built before job.

What You Will Work On

You will design foundational models and agent frameworks across:

Generative Vision

Premium automotive image creation (Diffusion, 3D, NeRFs, SDFs, Gaussian Splatting)

Voice Intelligence

Real-time voice agents (ASR, TTS, latency-optimized dialogue models)

Text Intelligence

Reasoning agents for dealer workflows (lead handling, follow-ups, service flows)

Multimodal & Cross-Modal Learning

Vision Text Audio alignment
Embeddings, routing models, and representational unification

Self-Improving Systems

RLHF pipelines
Data-engineered tuning loops
Autonomous feedback-driven model refinement

You will be the architect who prototypes, validates, breaks, rebuilds, and productionizes new ideas before they scale across the company.

Your Responsibilities

Design and train novel CV, LLM, and multimodal architectures optimized for dealership workflows.
Build generative pipelines that convert raw images, video, or speech into structured, high-value outputs.
Execute applied research in diffusion models, retrieval-augmented LLMs, 3D reconstruction, and voice models.
Produce clean, reproducible research code with rigorous experimentation.
Collaborate with infrastructure teams to deploy models at scale with low latency and high uptime.
Work closely with the AI Research Manager and Product teams to define future research directions.

Job Requirements

PhD in CS/Engineering from a top university OR Master's with strong CV/GANs experience.
Publications in peer-reviewed conferences (NeurIPS, CVPR, ICML, ICLR, ICCV, ACL) preferred.
25 years experience in Computer Vision, Deep Learning, Generative Modeling, or LLM research.
Deep understanding of diffusion, transformers, 3D vision, and multimodal architectures.
Ability to train both large and small language models, and debug complex training behaviours.
Strong grounding in optimization, probabilistic modeling, and representation learning.
Experience deploying research models to production is a strong advantage.
Demonstrated ability to independently explore, prototype, and ship results.

Preferred Background

Experience with one or more of the following: