Search by job, company or skills

  • Posted 9 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

A technology and infrastructure company operating in Enterprise AI and Cloud Infrastructure, building LLM-powered applications and intelligent automation for enterprise operations. We focus on delivering production-grade generative AI solutions that drive operational efficiency and product innovation across industries.

Primary job title: Generative AI / LLM Engineer

Role & Responsibilities

  • Design, fine-tune, and productionize LLM models and workflowsdata ingestion training/fine-tuning evaluation inference.
  • Implement scalable model serving and inference endpoints with performance optimizations (quantization, batching, caching) to meet SLAs.
  • Integrate LLMs with orchestration frameworks and toolchains (LangChain), vector search, and external APIs for retrieval-augmented generation (RAG) solutions.
  • Build reproducible ML pipelines and CI/CD for model training, deployment, monitoring, and automated retraining using containerization.
  • Collaborate with product, data, and security teams to translate requirements into secure, compliant, and observable LLM features.
  • Drive cost and performance optimization across model selection, serving infrastructure, and inference strategies; maintain model versioning and rollback processes.

Skills & Qualifications

Must-Have

  • Python
  • PyTorch
  • Hugging Face Transformers
  • LangChain
  • FAISS
  • Docker

Preferred

  • OpenAI API
  • LoRA
  • Kubernetes

Qualifications

  • Proven track record deploying LLM-based systems in production with attention to latency, throughput, and reliability.
  • Strong fundamentals in NLP/ML model evaluation, prompt engineering, bias mitigation, and responsible AI practices.
  • Ability to work in a hybrid model from India and collaborate effectively with distributed engineering teams.

Benefits & Culture Highlights

  • Hybrid work model with focused in-office collaboration and flexible remote days.
  • Competitive compensation, learning stipend, and access to cutting-edge AI tooling and compute.
  • High-impact, fast-paced environment with opportunities to own end-to-end AI products.

Skills: pytorch,automation,kubernetes,python,docker

More Info

Job Type:
Industry:
Employment Type:

Job ID: 144006617

Similar Jobs