Search by job, company or skills

G

SDE III - GPU Engineer

4-9 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted a month ago
  • Be among the first 10 applicants
Early Applicant
Quick Apply

Job Description

We are looking for aSenior Software Engineer (SDE III)who will build, profile, and optimize GPU workloads powering next-generation generative AI experiences fromStable Diffusionimage generation to transformer-based multimodal models.

you'll work closely with research and infrastructure teams to make model inference faster, more cost-efficient, and production-ready.

This role is ideal for engineers passionate aboutpushing GPUs to their limits, writing high-performance kernels, and turning cutting-edge research into scalable systems.

Key Responsibilities

  • Develop, optimize, and maintainGPU kernels(CUDA, Triton, ROCm) for diffusion, attention, and convolution operators.
  • Profile end-to-end inference pipelines (data movement, kernel scheduling, memory transfers) to identify and resolve bottlenecks.
  • Apply techniques likeoperator fusion, tiling, caching, and mixed-precision computeto maximize GPU throughput.
  • Collaborate with researchers to productionize experimental layers or model architectures.
  • Buildbenchmarking toolsand micro-tests for latency, memory, and throughput regressions.
  • Integrate kernel improvements into serving stacks, ensuringreliability and repeatable performance.
  • Work with platform teams to tune runtime configurations and job scheduling for GPU utilization.

Required Qualifications

  • 4+ years of experience in systems or ML engineering, with 2+ years working onGPU or accelerator optimization.
  • Strong hands-on skills withCUDA programming, memory hierarchies, warps, threads, and shared memory.
  • Familiarity with profiling tools (Nsight, nvprof, CUPTI) and performance analysis.
  • Working knowledge of PyTorch, JAX, or TensorFlow internals.
  • Proficiency inC++andPython.
  • Experience withmixed precision, FP16/BF16, or quantization.
  • Deep curiosity about system bottlenecks and numerical correctness.

More Info

About Company

Founded in 2019, Glance is a leading consumer technology company building an industry-defining Al commerce platform that reimagines how people shop using generative Al. Its Al architecture combines predictive intelligence, neural visualization, and real-time orchestration across devices from mobile to TV to apps powering a new era of connected, intelligent commerce.

Glance is backed by Google, Jio Platforms, and Mithril Capital, and operates as an unconsolidated subsidiary of InMobi.

Job ID: 131820745