Experience: 4.00 + years
Salary: INR 6000000-10000000 / year (based on experience)
Expected Notice Period: 30 Days
Shift: (GMT+05:30) Asia/Kolkata (IST)
Opportunity Type: Office ()
Placement Type: Full Time Permanent position(Payroll and Compliance to be managed by: Stealth Start-up Dummy)
(*Note: This is a requirement for one of Uplers client - Stealth Start-up Dummy)
What do you need for this opportunity
Must have skills required:
Linux, Cloud, DevOps, Kubernetes, ML, Terraform, vLLM
Stealth Start-up Dummy is Looking for:
We're looking for an AI Cloud Engineer at to own our end-to-end inference infrastructure powering AI experiences for millions of homes. You'll build and operate low-latency, high-throughput LLM serving systems, obsessing over p99 latency, cost efficiency, and reliability at scale.
Responsibilities:
- Own and operate the end-to-end LLM inference infrastructure across cloud environments
- Architect and scale high-throughput model serving systems with sub-100ms latency targets
- Design and implement custom caching layers, batching strategies, and inference optimizations
- Optimize cost-per-token, GPU utilization, and system efficiency at scale
- Build and maintain cloud-native infrastructure using Kubernetes, Terraform, and modern DevOps practices
- Collaborate with ML, product, and platform teams to ship reliable, production-grade AI systems
Requirements:
- 4+ years of experience in ML infrastructure, platform engineering, or large-scale backend systems
- Hands-on experience with LLM serving frameworks such as vLLM, TGI, or similar
- Strong expertise in Kubernetes, Terraform, and cloud platforms (AWS and/or GCP)
- Experience with inference optimization techniques including KV-cache optimization, speculative decoding, or dynamic batching
- Strong Linux fundamentals; Rust experience is a plus
Bonus: experience with GPU cost modeling, multi-region inference, and traffic routing
Interview Process:
- L1: Technical discussion
- L2: Deep-dive technical round (Architecture)
- L3: Culture fit
How to apply for this opportunity
- Step 1: Click On Apply! And Register or Login on our portal.
- Step 2: Complete the Screening Form & Upload updated Resume
- Step 3: Increase your chances to get shortlisted & meet the client for the Interview!
About Uplers:
Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.
(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).
So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!