Search by job, company or skills

Uplers

AI Cloud Engineer

Save
new job description bg glownew job description bg glow
  • Posted 2 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Experience: 4.00 + years

Salary: INR 6000000-10000000 / year (based on experience)

Expected Notice Period: 30 Days

Shift: (GMT+05:30) Asia/Kolkata (IST)

Opportunity Type: Office ()

Placement Type: Full Time Permanent position(Payroll and Compliance to be managed by: Stealth Start-up Dummy)

(*Note: This is a requirement for one of Uplers client - Stealth Start-up Dummy)

What do you need for this opportunity

Must have skills required:

Linux, Cloud, DevOps, Kubernetes, ML, Terraform, vLLM

Stealth Start-up Dummy is Looking for:

We're looking for an AI Cloud Engineer at to own our end-to-end inference infrastructure powering AI experiences for millions of homes. You'll build and operate low-latency, high-throughput LLM serving systems, obsessing over p99 latency, cost efficiency, and reliability at scale.

Responsibilities:

  • Own and operate the end-to-end LLM inference infrastructure across cloud environments
  • Architect and scale high-throughput model serving systems with sub-100ms latency targets
  • Design and implement custom caching layers, batching strategies, and inference optimizations
  • Optimize cost-per-token, GPU utilization, and system efficiency at scale
  • Build and maintain cloud-native infrastructure using Kubernetes, Terraform, and modern DevOps practices
  • Collaborate with ML, product, and platform teams to ship reliable, production-grade AI systems

Requirements:


  • 4+ years of experience in ML infrastructure, platform engineering, or large-scale backend systems
  • Hands-on experience with LLM serving frameworks such as vLLM, TGI, or similar
  • Strong expertise in Kubernetes, Terraform, and cloud platforms (AWS and/or GCP)
  • Experience with inference optimization techniques including KV-cache optimization, speculative decoding, or dynamic batching
  • Strong Linux fundamentals; Rust experience is a plus

Bonus: experience with GPU cost modeling, multi-region inference, and traffic routing

Interview Process:

  • L1: Technical discussion
  • L2: Deep-dive technical round (Architecture)
  • L3: Culture fit

How to apply for this opportunity


  • Step 1: Click On Apply! And Register or Login on our portal.
  • Step 2: Complete the Screening Form & Upload updated Resume
  • Step 3: Increase your chances to get shortlisted & meet the client for the Interview!

About Uplers:


Our goal is to make hiring reliable, simple, and fast. Our role will be to help all our talents find and apply for relevant contractual onsite opportunities and progress in their career. We will support any grievances or challenges you may face during the engagement.

(Note: There are many more opportunities apart from this on the portal. Depending on the assessments you clear, you can apply for them as well).

So, if you are ready for a new challenge, a great work environment, and an opportunity to take your career to the next level, don't hesitate to apply today. We are waiting for you!

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148223827

Similar Jobs

Bengaluru, India

Skills:

TensorflowNumpyPytorchGcpPandasDockerAzurePythonKubernetesAWSLangChainHugging FaceMLflowOpenAI APIsTransformers

Bengaluru, India

Skills:

bwa Sap HanaPerlLinuxRhelBashRAID managementPythonLinux network managementOS administrationSLES

Bengaluru

Skills:

JavaSpringbootBashAWS CloudWatchElk StackJenkinsShellTerraformECSDynatraceSplunkKubernetesPythonAWSEKSSpinnaker

Bengaluru, India

Skills:

NetworkingContainersSpringAngularPython ScriptingBash ScriptingCloudJava ProgrammingRest ApisMulti-threadingSecurity ConceptsAI technologyTLS concepts

Bengaluru, India

Skills:

DockerKubernetesAgentic AILLMsRAGMicroservices architecture