Lead Generative AI Engineer

Tekskills Inc.

Pune, India

8-10 Years

This job is no longer accepting applications

Posted 2 months ago

Job Description

Role: Lead Generative AI Engineer

Experience: 8+ years

Location: Gurgaon / Pune

Mode of work :WFO

Must have

Gen AI experience -3Y

Agentic Ai experience -1Y

Must have team leading exp

Role Overview

We are looking for a Senior Gen AI Engineer with proven experience in building and deploying Generative AI and scaled ML systems. This role demands end-to-end ownership of AI solutions from model selection, fine-tuning, and pipeline design to deployment on cloud ML platforms (AWS SageMaker).

The ideal candidate will be capable of architecting multi-agent frameworks and MCP pipelines, integrating them into enterprise systems, and delivering scalable, production-grade AI solutions. Along with strong engineering and system design expertise, this role involves mentoring junior engineers, leading PoCs, and collaborating closely with product and research teams to shape the organization's GenAI roadmap.

Key Responsibilities

Solution Architecture & Design

Architect and implement enterprise-grade generative AI pipelines (LLMs, RAG systems, diffusion models).
Design and optimize multi-component pipelines (MCPs) and agentic frameworks for workflow automation.
Evaluate and implement architectural trade-offs for scalability, latency, and cost efficiency.

Model Development & Scaling

Lead the development of traditional ML and statistical models, integrating them with GenAI systems.
Fine-tune LLMs, build RAG pipelines with vector databases (FAISS, Pinecone, PgVector), and optimize inference performance.
Scale training and deployment on AWS SageMaker using distributed training and monitoring techniques.

Engineering & MLOps

Implement CI/CD for ML using GitLab pipelines, ensuring reproducibility and automation across the model lifecycle.
Develop observability, monitoring, and governance frameworks for deployed AI models.
Collaborate with DevOps and Data Engineering teams to integrate AI services into production environments.

Leadership & Collaboration

Mentor and guide junior Gen AI engineers; review code and establish best practices.
Work with cross-functional teams (Product, Research, Data Engineering) to align AI solutions with business goals.
Lead PoCs, pilots, and research initiatives to explore new frameworks and technologies.

Required Skills

Proven experience in enterprise AI/GenAI solution deployment in production environments.
Expertise in LangChain, LlamaIndex, and other orchestration frameworks.
Exposure to distributed systems, GPU optimization, and cost-efficient model scaling.
Programming & Modeling: Advanced proficiency in Python, PyTorch, TensorFlow, Hugging Face, and scikit-learn.
Generative AI Expertise: Hands-on experience with LLMs, transformers, diffusion models, and RAG architectures.
Agentic & MCP: Strong background in agent-based frameworks and multi-component pipelines.
Scaling AI: Practical knowledge of parallel/distributed training, optimization, and scaled inference.
Cloud ML: Deep expertise in AWS SageMaker (training, hyperparameter tuning, endpoints, monitoring).
MLOps: Proficient in CI/CD, model lifecycle management, and monitoring tools such as GitLab, MLflow, and Kubeflow.
System Thinking: Ability to design solutions considering throughput, latency, fault tolerance, and observability.