Senior Machine Learning Engineer

Recro

India

4-10 Years

Save

Posted 2 days ago
Be among the first 10 applicants

Early Applicant

Job Description

Senior Machine Learning Engineer (Search, Recommendations & Conversational AI)

Location: Remote

Role Type: Senior IC (P3+)

Team: Discovery Search, Recommendations & AI Experiences

Urgency: Immediate (preIPL launch milestones)

About the Role

We are building next-generation conversational search, personalized recommendations, and AI-driven discovery for one of India's largest entertainment ecosystems. This role is for a hands-on ML Engineer who can design, train, and productionize models powering search relevance, retrieval, personalization, and LLM-based conversational experiences at massive scale.

You will work closely with backend, platform, and catalog enrichment teams to deliver high-quality ML components under tight performance and latency constraints.

Key Responsibilities

Build and improve search ranking, retrieval, and query understanding models.
Develop ML components for Conversational Search:
Multi-turn context handling
Query intent detection and classification
Retrieval-augmented generation (RAG) pipelines
Reasoning workflows (ReAct, static + dynamic agent flows)
Design and optimize embedding models, vector stores, and similarity search systems.
Build personalized ranking and recommendation models using deep learning.
Work on large-scale ML systems optimized for:
Low latency
High throughput
Cost-efficient inference
Implement ML pipeline best practices (versioning, monitoring, A/B testing, observability).
Collaborate with platform teams to integrate ML services across search, recommendations, and conversational agents.
Develop caching strategies (prompt cache, vector cache, similarity caching) to hit strict SLA targets.
Contribute to long-term roadmap: foundational retrieval models, multi-objective optimization, user lifecycle modeling.

Required Qualifications

410 years of experience in Machine Learning / Applied ML engineering.
Strong foundations in ML, deep learning, Transformers, and neural retrieval.
Hands-on experience with:
Search systems (retrieval + ranking)
Recommendation models
Embedding models & vector databases
TensorFlow / PyTorch
Proven experience building production-grade ML systems at scale.
Familiarity with LLMs, RAG architectures, prompt engineering, and agent workflows.
Strong coding skills (Python) and experience with modern ML stack (TensorFlow, PyTorch, Faiss/ScaNN, Triton, etc.).
Ability to work closely with backend teams to deploy models in distributed systems.
Excellent problem-solving skills and comfort working on ambiguous, high-impact problems.

Preferred Qualifications

Experience with conversational AI, chat-based retrieval, or multi-turn dialog modeling.
Experience in media, streaming, sports data or large catalog discovery.
Knowledge of micro-drama, short-video personalization, or multi-objective recommendation systems.
Strong understanding of scalability patterns: batching, async orchestration, caching layers.

Why Join

Work on flagship launches (World Cup IPL) impacting hundreds of millions of users.
Solve some of the most challenging problems in search, discovery, and conversational AI at scale.
Collaborate with a world-class team building foundational discovery platforms for India's largest digital ecosystem.