A technology and infrastructure company operating in Enterprise AI and Cloud Infrastructure, building LLM-powered applications and intelligent automation for enterprise operations. We focus on delivering production-grade generative AI solutions that drive operational efficiency and product innovation across industries.
Primary job title: Generative AI / LLM Engineer
Role & Responsibilities
- Design, fine-tune, and productionize LLM models and workflowsdata ingestion training/fine-tuning evaluation inference.
- Implement scalable model serving and inference endpoints with performance optimizations (quantization, batching, caching) to meet SLAs.
- Integrate LLMs with orchestration frameworks and toolchains (LangChain), vector search, and external APIs for retrieval-augmented generation (RAG) solutions.
- Build reproducible ML pipelines and CI/CD for model training, deployment, monitoring, and automated retraining using containerization.
- Collaborate with product, data, and security teams to translate requirements into secure, compliant, and observable LLM features.
- Drive cost and performance optimization across model selection, serving infrastructure, and inference strategies; maintain model versioning and rollback processes.
Skills & Qualifications
Must-Have
- Python
- PyTorch
- Hugging Face Transformers
- LangChain
- FAISS
- Docker
Preferred
Qualifications
- Proven track record deploying LLM-based systems in production with attention to latency, throughput, and reliability.
- Strong fundamentals in NLP/ML model evaluation, prompt engineering, bias mitigation, and responsible AI practices.
- Ability to work in a hybrid model from India and collaborate effectively with distributed engineering teams.
Benefits & Culture Highlights
- Hybrid work model with focused in-office collaboration and flexible remote days.
- Competitive compensation, learning stipend, and access to cutting-edge AI tooling and compute.
- High-impact, fast-paced environment with opportunities to own end-to-end AI products.
Skills: pytorch,automation,kubernetes,python,docker