Search by job, company or skills

Scout It Out

Machine Learning LLM Serving Engineer - Cloud

new job description bg glownew job description bg glownew job description bg svg
  • Posted 5 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

We're looking for Machine Learning LLM Serving Engineers - Cloud!

Responsibilities

  • Improve and optimize key Deep Learning models on Qualcomm AI 100.
  • Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories.
  • Implement Kernels for AI workloads
  • Collaborate and interact with internal teams to analyze and optimize training and inference for deep learning.
  • Build software tools and ecosystem around AI SW Stack.
  • Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo to build abstraction layers for inference accelerator.
  • Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems.
  • Optimize the entire deep learning pipeline including graph compiler integration.
  • Apply knowledge of software engineering best practices.

Desirable Skills And Aptitudes

  • Deep Learning knowledge- LLMs, Natural Language Processing, Vision, Audio, Recommendation systems.
  • Knowledge of the structure and function of different components of Pytorch, TensorFlow software stacks.
  • Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design.
  • Ability to work independently, define requirements and scope, and lead your own development effort.
  • Well versed with open-source development practices.
  • Strong developer with a research mindset- strives to innovate.
  • Avid problem solver- should be able to find solutions to key engineering and domain problems.
  • Knowledge of tiling and scheduling a Machine learning operator is a plus.
  • Familiarity with using C++ 14 (advanced features)
  • Familiarity with profiling software and optimization techniques
  • Familiarity with writing SIMD and/or multi-threaded high-performance code is a plus.
  • Proficiency in ML compiler, Auto-code generation (using MLIR) is a plus.
  • Familiarity with running workloads on large scale heterogeneous clusters is a plus.
  • Familiarity with CUDA, CUDNN is a plus.

Qualifications

  • Bachelor's degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.

Minimum Qualifications

  • Bachelor's degree in Engineering, Information Systems, Computer Science, or related field.

(*Note: This is a requirement for one of Scoutit's clients)

Skills: c,python,machine learning

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 145628287

Similar Jobs