Search by job, company or skills

Grapevine Round1 AI

Hiring for AI/ML Engineer | Upto 45-80 LPA

This job is no longer accepting applications

new job description bg glownew job description bg glownew job description bg svg
  • Posted 3 months ago

Job Description

AI Infrastructure Architecture:

  • Design and implement asynchronous multi-agent orchestration
  • Own end-to-end latency from user message to AI response
  • Build resilient inference pipelines that gracefully degrade under load
  • Implement intelligent request routing and load balancing for AI workloads
  • Migrate critical AI conversation flow from monolith to dedicated services
  • Implement WebSocket/streaming infrastructure for real-time chat
  • Design circuit breakers and fallback strategies for AI model failures
  • Build comprehensive observability for AI system performance
  • Optimize credit data retrieval and caching strategies

Requirements

Must-Have Experience:

  • 6+ years building production systems handling >10k concurrent users
  • Proven experience with async/event-driven architectures (not just REST APIs)
  • Hands-on experience scaling ML/AI inference in production
  • Deep understanding of caching strategies (Redis, in-memory, CDN)
  • Experience with message queues and real-time communication protocols

AI-Specific Expertise:

  • Built systems integrating multiple LLM/AI models in production
  • Experience with AI model serving frameworks (TensorFlow Serving, Triton, etc.)
  • Understanding of AI inference optimization (batching, caching, model quantization)
  • Knowledge of conversation state management and context handling
  • Has debugged production issues under high AI inference load

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 124488221