Generative AI Engineer.
Role & Responsibilities
- Design, fine-tune, and deploy generative language models (LLMs) to power recruitment automation, rsum parsing, candidate matching, and conversational interfaces.
- Implement RAG pipelines: build document ingestion, vectorization, indexing and retrieval layers to enable accurate and low-latency QA and assistant flows.
- Develop production-ready microservices for inference and orchestration using containerization (Docker) and CI/CD practices; integrate with internal APIs and data stores.
- Optimize model performance and cost: apply quantization, LoRA/PEFT fine-tuning, batching, and efficient serving strategies for on-prem or cloud deployments.
- Collaborate with data scientists and product managers to translate business requirements into evaluation metrics, A/B tests and monitoring dashboards for drift, latency and accuracy.
- Establish engineering best practices for reproducibility, model versioning, observability and secure data handling across the ML lifecycle.
Skills & Qualifications
Must-Have (Technical Skills)
- Python
- PyTorch
- Hugging Face Transformers
- LangChain
- FAISS
- Docker
Preferred (Qualifications & Nice-to-haves)
- Bachelor's or Master's in Computer Science, Engineering, or related field.
- 3+ years of hands-on experience in ML/LLM model development and production deployment.
- Familiarity with cloud platforms (AWS / GCP / Azure), Kubernetes and MLOps tools (MLflow, Weights & Biases) is advantageous.
Benefits & Culture Highlights
- Work on impactful HR products using state-of-the-art GenAIdirect product-to-user feedback loop.
- Collaborative on-site team environment with clear ownership, rapid release cycles and mentorship opportunities.
- Competitive compensation, learning budget for conferences/certifications, and career growth into technical leadership.
Location: India (On-site). We welcome high-impact engineers who are passionate about practical GenAI applications in HR and are ready to ship reliable, ethical, and scalable solutions.
Skills: docker,gen ai,pytorch,python