Search by job, company or skills

IBM

Site Reliability Engineering Professional

10-12 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 29 days ago
  • Be among the first 40 applicants
Early Applicant
Quick Apply

Job Description

IBM's product and technology landscape includes Research, Software, and Infrastructure. Entering this domain positions you at the heart of IBM, where growth and innovation thrive.

Your Role and Responsibilities

In this Site Reliability Engineer role, you will build and maintain an observability stack for IBM's Cloud Object Storage service using managed services as well as custom built services. This stack is used by Cloud Object Storage SREs and devs to understand the health of the service. Work duties and responsibilities include:

  • Design, setup, configure and implement the COS Monitoring System using technologies such as Elasticsearch, Logstash, Kibana, Kafka, Kafka Mirrors, Filebeat, Grafana and Sysdig.
  • Automate CICD tasks and infrastructure using Ansible, Terraform, Jenkins, and Travis.
  • Experience with microservices and distributed application architecture, such as containers and Kubernetes.
  • Experience with Linux administration and programming languages such as Java, Python and SQL.
  • Performance and configuration tuning to support the increasing load of data flowing into the COS Monitoring System.
  • Provide design recommendations and thought leadership to provide best-in-class observability as part of the COS Monitoring System.
  • Provide 24x7 on-call customer support on a rotational basis.
  • Design and develop dashboards for metrics analysis.
  • Design, Develop and Configure an alerting solution for an end-to-end incident management and recovery process by integrating Sysdig with Pagerduty, Email and Slack.

Required Education

  • Bachelor's Degree

Required Technical and Professional Expertise

  • Ability and tenacity to solve increasingly complex technical issues through analysis and a variety of problem-solving techniques.
  • Working knowledge of Object-Oriented Python with demonstrable experience in applying these skills.
  • Working knowledge of Linux environments.
  • Experience working in an Agile-Scrum development environment.
  • Experience using tools such as Jira, GitHub and Logging and monitoring tools.
  • BS in CS, CE or similar field, plus 10-12 years relevant work experience.

More Info

Job Type:
Industry:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

At IBM, we do more than work. We create. We create as technologists, developers, and engineers. We create with our partners. We create with our competitors. If you're searching for ways to make the world work better through technology and infrastructure, software and consulting, then we want to work with you.

Job ID: 117929259

Similar Jobs