Job Description
Role - Gen AI Engineer
Experience- 4 - 7 Years
Budget - 23 LPA
Location - GGN
Notice Period - Immediate joiner Only
About The Role
We are seeking a highly motivated
Generative AI Engineer with hands-on experience in
LLMs,
RAG pipelines, and
backend systems to join our growing AI team. You will design, implement, and optimize GenAI-powered applications, with a focus on creating robust RAG solutions integrated into real-world products.
Key Responsibilities
- Design and develop RAG-based systems using state-of-the-art LLMs (e.g., GPT, LLaMA, Mistral, Claude).
- Build and maintain pipelines for document ingestion, chunking, embedding, and vector search (e.g., FAISS, Weaviate, Pinecone).
- Integrate LLMs into product workflows, optimizing for performance, latency, and cost.
- Work on backend services (e.g., REST APIs, microservices) to expose GenAI capabilities to applications.
- Collaborate with data engineers, product managers, and other developers to deploy AI features at scale.
- Implement evaluation frameworks for GenAI outputs and continuously improve prompt engineering and retrieval accuracy.
- Stay current with the latest developments in LLMs, RAG, and open-source AI tools.
Required Skills & Experience
- 3+ years of experience in software engineering with at least 1+ year working with GenAI / LLMs / RAG.
- Strong Python skills and experience with GenAI frameworks like LangChain, LlamaIndex, or Haystack.
- Practical experience with embedding models (e.g., SentenceTransformers, OpenAI Embeddings) and vector databases.
- Solid understanding of RAG architecture and ability to fine-tune or customize it.
- Experience with backend development using frameworks like FastAPI, Flask, or Node.js.
- Familiarity with deploying models using Docker, Kubernetes, and cloud platforms (e.g., AWS, GCP, Azure).
- Strong problem-solving and debugging skills.
Required Skills
[Generative AI]
Additional Information
Only Immediate Joiner