Search by job, company or skills

Advanced Micro Devices (AMD)

Principal ML Engineer

4-9 Years

This job is no longer accepting applications

new job description bg glownew job description bg glownew job description bg svg
  • Posted 4 months ago

Job Description

The Role

We are looking for Machine Learning Engineer to join our Models and Applications team. If the challenge of distributed training of large model on large number of GPUs excites you and you are passionate about improving training efficiency and enjoy innovating and coming up with new ideas, then this role is for you.

You will be part of world class team focus on addressing the challenge of training generative AI.

The Person

The ideal candidate should have experience with distributed training pipeline, knowledgeable with distributed training algorithms (Data parallel, Tensor parallel, Pipeline parallel, ZeRO) and familiar with training Large Model.

Key Responsibilities

  • Train large model to convergence on AMD GPUs.
  • Improve the end-to-end training pipeline performance.
  • Optimize the distributed training pipeline and algorithm to scale out.
  • Contribute your changes to open source.
  • Up to date with latest training algorithms.
  • Influence the direction of AMD AI platform.
  • Cross team collaborate with various group and stakeholder.
  • Preferred Experience10+ years of experience.
  • Experience in ML frameworks such as PyTorch, JAX or Tensorflow.
  • Experience with distributed training and distributed training framework such as DeepSpeed.
  • Experience with LLM or Vision, especially large model is a plus.
  • Excellent python programing skills, including debugging, profiling, and perf analysis.
  • Experience with ML pipeline.
  • Strong communication and problem-solving skills.

Academic Credentials

A master s degree in computer science, artificial intelligence, machine learning, or a related field.

More Info

Job Type:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

For nearly 50 years, AMD (NASDAQ: AMD) has driven innovation in high-performance computing, graphics, and visualization technologies the building blocks for gaming, immersive platforms, and the datacenter. Hundreds of millions of consumers, leading Fortune 500 businesses, and cutting-edge scientific research facilities around the world rely on AMD technology daily to improve how they live, work, and play.

Job ID: 122680555

Similar Jobs