
Search by job, company or skills
About Client:
Our Client is a leading technology solutions company with a strong engineering pedigree. Headquartered in Pune, India, they are part of the USD 4.4 billion RPG Group, serving over 145 global clients. It has been a public-listed company for over 50 years, disrupting the status quo through our deep engineering capabilities, innovation, and velocity.
Our 10,500+ workforce across 30+ global locations delivers cutting-edge digital solutions, unlocks growth, and empowers our clients to thrive and succeed in a world of constant change.
Job Title :Generative AI Engineer
Location:Pune
·Experience: 9- 12 Years
Work Mode : WFO
Job Type : Full Time
Notice Period:- 0-30 Days
Detailed JD:
Key SkillS: GEN AI+Python
Key Responsibilities:
• Generative AI Systems Design: Architect scalable, high-performance applications powered by Generative AI models (e.g., LLMs, diffusion models), with a focus on real-world usability and business integration.
• Prompt Engineering & RAG Pipelines: Design and implement optimized prompt strategies and Retrieval-Augmented Generation (RAG) workflows using LangChain and similar frameworks.
• Agentic AI & Autonomous Workflows: Develop and manage intelligent agents capable of multi-step reasoning, planning, and decision-making using frameworks like LangChain Agents or custom implementations.
• Vector Store Integration: Design and integrate vector databases for semantic search and document retrieval in GenAI workflows.
• End-to-End AI Application Development: Build and deploy AI-driven applications from scratch using Python and modern ML/AI toolkits. Ensure clean integration with APIs, databases, and frontend services.
• Microservices & API Development: Create modular, maintainable backend services using microservices architecture. Ensure RESTful APIs for smooth communication.
• CI/CD & DevOps: Develop and maintain CI/CD pipelines for model and app deployment. Ensure reproducibility, automated testing, and smooth rollout using tools like GitHub Actions, GitLab CI/CD, or Jenkins.
• Database Management: Use PostgreSQL and NoSQL databases effectively in storing structured and unstructured data, with focus on performance, indexing, and scaling.
• Performance Optimization: Continuously monitor, benchmark, and optimize applications for performance, scalability, and cost-efficiency across cloud and on-prem environments.
• Collaboration & Mentorship: Work closely with product managers, AI researchers, and junior developers. Lead by example and provide technical mentorship in AI system design and production-readiness.
Required Qualifications:
• Technical Expertise:
Strong proficiency in Python (mandatory) and experience with AI/ML libraries like FastAI, Hugging Face Transformers, and OpenAI APIs.
Experience implementing LangChain workflows, prompt chaining, agents, tools, and memory systems.
Hands-on experience building RAG systems using vector DBs like FAISS, Pinecone, Weaviate, or Chroma.
Solid understanding of Generative AI technologies (LLMs, image/video generation, embeddings).
Working knowledge of Agentic AI concepts for building autonomous tools and applications.
• Backend & Architecture:
Strong skills in system design, API development, and microservices-based backend architecture.
Experience with PostgreSQL and NoSQL databases (e.g., MongoDB, DynamoDB).
• Cloud & DevOps:
Familiarity with CI/CD pipelines, containerization (Docker), and cloud deployments (AWS/GCP/Azure).
Git-based version control and collaborative development workflows.
• Communication & Leadership:
Ability to translate AI research into production-ready applications.
Strong collaboration skills and experience leading engineering efforts or mentoring peers.
Preferred (Nice to Have):
• Experience working with LLMOps tools
• Exposure to data labeling, fine-tuning, or custom LLM training.
• Familiarity with streaming architectures or real-time AI inference.
Interested candidate can also share their CV at [Confidential Information]
Job ID: 144784387
Skills:
Gcp, Databricks, Azure, Python, AWS, embeddings, LLaMA, LLMs, Crew.ai, Claude, model orchestration, prompt engineering, AutoGen, LangGraph, Mistral, OpenAI, Gemini, LlamaIndex, synthetic datasets, agentic frameworks
Skills:
Aws Lambda, Python, Api Gateway, AWS, LangChain, Generative AI, AWS Bedrock, LLMs, RAG pipelines, Agent Core
Skills:
Tensorflow, Pytorch, Gcp, AWS, Python, Azure, LangChain, OpenAI, Airflow, LlamaIndex, RAG architecture, Llama, Azure OpenAI, embeddings, Weaviate, Pinecone, Milvus, similarity search, Kubeflow, MLflow, Hugging Face Transformers, knowledge-grounded generation, Anthropic, FAISS, Chroma
Skills:
Azure, Python, MLops, GPT, LlamaIndex, LangChain, Gemini, LLaMA, OpenAI APIs
Skills:
data engineering , Deep Learning, Tensorflow, Pytorch, Nlp, Api Development, Cloud Computing, Python, ML Ops, Model Optimization
We don’t charge any money for job offers