Search by job, company or skills

I

AI DevOps Tech Lead

Save
new job description bg glownew job description bg glow
  • Posted 4 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Introduction

At IBM Infrastructure & Technology, we design and operate the systems that keep the world running. From high-resiliency mainframes and hybrid cloud platforms to networking, automation, and site reliability. Our teams ensure the performance, security, and scalability that clients and industries depend on every day. Working in Infrastructure & Technology means tackling complex challenges with curiosity and collaboration. You'll work with diverse technologies and colleagues worldwide to deliver resilient, future-ready solutions that power innovation. With continuous learning, career growth, and a supportive culture, IBM provides the opportunities to build expertise and shape the infrastructure that drives progress.

Your Role And Responsibilities

As a Senior DevOps Platform Engineer, you will be responsible for designing, building, and maintaining a standardized, reusable Devops platform that enables consistent, secure, and efficient delivery of multiple components and teams within a cornerstone initiative of IBM's AI infrastructure portfolio.

This role focuses on enterprise-scale DevOps platform engineering, providing secure, reliable, and highly automated delivery pipelines. You will act as a technical leader, driving DevOps strategy, defining standards, modernizing delivery pipelines using the latest tools and practices, and mentoring engineers to improve overall DevOps maturity.

The CI/CD platform you build will enable the faster and secure development and delivery of multiple components by providing a standardized, scalable, and reusable CI/CD framework shared across teams as well as different architectures.

Preferred Education

Bachelor's Degree

Required Technical And Professional Expertise

  • Design, build, and own a shared architecture-agnostic CI/CD platform across teams.
  • Implement and enhance end-to-end robust delivery pipelines from development and test through staging and production.
  • Design and implement automated testing frameworks for across various test buckets including Component level tests, unit tests, Operation and model-level tests.
  • Collaborate with different component owners and identify the needs in the respective build & test phases and implement it in this common DevOps pipeline.
  • Establish and maintain standardized pipeline templates for build, test, security scanning, packaging, and deployment to align with IBM standards and enforce quality gates, compliance checks, and security controls across all pipelines.
  • Implement controlled deployment strategies such as blue-green, canary, and A/B deployments where applicable.
  • Continuously optimize pipeline performance, reliability, and execution time by adopting latest DevOps tools, technologies, and best practices.
  • Collaborate with senior technical leadership and establish Roadmap for the project.
  • Proactively identify bottlenecks and lead continuous improvement initiatives
  • Act as technical lead for the CI/CD platform initiative by leading architecture and design discussions with senior stakeholders.
  • Provide technical leadership and mentorship, guiding engineers on DevOps, CI/CD, and testing best practices.

Preferred Technical And Professional Experience

  • 12+ years of experience in DevOps, platform engineering, or infrastructure automation.
  • Strong expertise in designing and operating shared, reusable CI/CD platforms supporting multiple teams and components.
  • Strong background in automated testing frameworks and quality engineering
  • Hands-on experience with Docker and modern deployment strategies.
  • Experience leading teams or acting as a technical lead in complex environments.
  • Excellent problem-solving, communication, and stakeholder collaboration skills.
  • Jenkins Mastery with Deep experience in Declarative and Scripted Pipelines, shared libraries, and Jenkins API.
  • Proficient in configuring Jenkins with Git (GitHub) and artifact management tools (Artifactory)
  • Integrate Jenkins with Docker and Kubernetes for containerized application builds and deployments.
  • Strong skills in Python, Bash, and Groovy scripting for pipeline customizations

Nice-to-Have

  • Experience with Kubernetes and cloud-native AI platforms.
  • Working knowledge of AI/ML frameworks, including: PyTorch
  • Building scalable components for monitoring and benchmarking of LLM-powered features; experience working with LLMs will be advantageous.
  • Working knowledge of LLMs and their performance metrics is desirable, particularly in the context of monitoring within DevOps pipelines.
  • Experience with managing both dev, test, and production level containers with proper meta-data such as Manifests, tagging, and export control.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 147507279