Primary Title: Generative AI Engineer
Industry & Sector: A fast-growing IT services and enterprise software firm operating in Enterprise Generative AI, Cloud ML & Data Engineering. The team builds production-grade LLM-based products and integrations that power intelligent search, automation, and decision support for enterprise customers across industries.
Location & Workplace: Chennai
About The Opportunity
We are hiring a hands-on Generative AI Engineer to design, implement, and productionize LLM-driven features and pipelines. You will translate research prototypes into scalable, secure servicesbuilding retrieval-augmented generation (RAG), embeddings pipelines, fine-tuning workflows, and high-throughput inference systems that deliver measurable business outcomes.
Role & Responsibilities
- Design and implement end-to-end GenAI solutions: embeddings generation, vector indexing, RAG pipelines, fine-tuning and serving LLMs for production workloads.
- Develop and maintain backend services (Python) that integrate transformers, LangChain workflows, and vector search to expose robust APIs for applications.
- Build and optimize embedding, retrieval, and inference pipelinesimprove latency, throughput, and cost for live traffic.
- Containerize and deploy model serving components; collaborate on CI/CD, model versioning, and automated rollout strategies for model updates.
- Implement monitoring, observability and automated tests for model drift, data quality and inference performance.
- Partner with Product and Data Science teams to translate requirements into technical design and mentor engineering peers on GenAI best practices.
Skills & Qualifications
Must-Have
- Python
- PyTorch
- Hugging Face Transformers
- LangChain
- FAISS
- Docker
Preferred
- Kubernetes
- Milvus
- AWS SageMaker
Benefits & Culture Highlights
- Work directly on cutting-edge GenAI products with clear customer impact and fast release cycles.
- Learning-first culture with access to training, conferences, and R&D time for experiments.
- Collaborative, engineering-driven environment focused on scalable, production-ready solutions.
To apply, bring proven hands-on experience building and shipping LLM-based features, strong software engineering discipline, and a passion for operationalizing GenAI at scale. The role is on-site in India and prioritizes candidates comfortable working in fast-paced product delivery cycles.
Skills: python,llm,langchain,docker