Search by job, company or skills

IBM

Site Reliability Engineer

5-10 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 6 days ago
  • Be among the first 50 applicants
Early Applicant
Quick Apply

Job Description

Role Summary

IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.

Your Role and Responsibilities

In this Site Reliability Engineer role, you will work closely with the entire IBM Cloud organization to maintain and operationally improve the IBM cloud infrastructure. You will focus on the following key responsibilities:

  • Ability to respond promptly to production issues and alerts 24x7.
  • Execute changes in the production environment through automation.
  • Implement and automate infrastructure solutions that support IBM Cloud products and services to reduce toil.
  • Partner with other SRE teams and program managers to deliver mission-critical services to IBM Cloud.
  • Build new tools to improve automated resolution of production issues.
  • Monitor, respond promptly to production alerts, and execute changes in Production through automation.
  • Support the compliance and security integrity of the environment.
  • Continually improve systems and processes regarding automation and monitoring.

Required Education

  • Bachelor's Degree

Preferred Education

  • Master's Degree

Required Technical and Professional Expertise

  • Excellent written and verbal communication skills.
  • Minimum 5+ years experience in handling large production systems environment.
  • Must be extremely comfortable using and navigating within a Linux environment.
  • Ability to do low-level debugging and problem analysis by examining logs and running Unix commands.
  • Must be efficient in writing and debugging scripts.
  • 3-5+ years of experience in Virtualization Technologies and Automation / Configuration Managements.
  • Automation and configuration management tools/solutions: Ansible, Python, bash, Terraform, GoLang etc. (at least one).
  • Virtualization technologies: Citrix Xen Hypervisor (Preferred), KVM (also preferred), libvirt, VMware vSphere, etc. (at least one).
  • Monitoring technologies: Zabbix, Sysdig, Grafana, Nagios, Splunk, etc. (at least one).
  • Working knowledge with Container technologies: Kubernetes, Docker, etc.
  • Flexibility to work on shifts to handle production systems.

Preferred Technical and Professional Experience

  • Good experience in Public cloud platforms, Kubernetes clusters, and strong Linux skills for managing services across microservices platform.
  • Good SRE knowledge in Cloud Compute, Storage, and Network services

More Info

Job Type:
Industry:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

At IBM, we do more than work. We create. We create as technologists, developers, and engineers. We create with our partners. We create with our competitors. If you're searching for ways to make the world work better through technology and infrastructure, software and consulting, then we want to work with you.

Job ID: 117901499

Similar Jobs