We're looking for
Machine Learning LLM Serving Engineers - Cloud!
Responsibilities
- Improve and optimize key Deep Learning models on Qualcomm AI 100.
- Build deep learning framework extensions for Qualcomm AI 100 in upstream open-source repositories.
- Implement Kernels for AI workloads
- Collaborate and interact with internal teams to analyze and optimize training and inference for deep learning.
- Build software tools and ecosystem around AI SW Stack.
- Work on vLLM, Triton, ExecuTorch, Inductor, TorchDynamo to build abstraction layers for inference accelerator.
- Optimize workloads for both scale-up (multi-SoC) and scale-out (multi-card) systems.
- Optimize the entire deep learning pipeline including graph compiler integration.
- Apply knowledge of software engineering best practices.
Desirable Skills And Aptitudes
- Deep Learning knowledge- LLMs, Natural Language Processing, Vision, Audio, Recommendation systems.
- Knowledge of the structure and function of different components of Pytorch, TensorFlow software stacks.
- Excellent C/C++/Python programming and software design skills, including debugging, performance analysis, and test design.
- Ability to work independently, define requirements and scope, and lead your own development effort.
- Well versed with open-source development practices.
- Strong developer with a research mindset- strives to innovate.
- Avid problem solver- should be able to find solutions to key engineering and domain problems.
- Knowledge of tiling and scheduling a Machine learning operator is a plus.
- Familiarity with using C++ 14 (advanced features)
- Familiarity with profiling software and optimization techniques
- Familiarity with writing SIMD and/or multi-threaded high-performance code is a plus.
- Proficiency in ML compiler, Auto-code generation (using MLIR) is a plus.
- Familiarity with running workloads on large scale heterogeneous clusters is a plus.
- Familiarity with CUDA, CUDNN is a plus.
Qualifications
- Bachelor's degree in Engineering, Machine learning/ AI, Information Systems, Computer Science, or related field.
Minimum Qualifications
- Bachelor's degree in Engineering, Information Systems, Computer Science, or related field.
(*Note: This is a requirement for one of Scoutit's clients)
Skills: c,python,machine learning