Role Description
Site Reliability of our Development, Test & Prod environments hosted in Azure
- Driving operational excellence for Payments Cloud services to deliver an always on operation, year-round, at the right cost
- Rollout of Infrastructure, Operating System and Application updates with no impact to consumers
- Experience with implementing end to end monitoring & alerting
- Implementing and Delivering robust Infrastructure as code
- Managing desired state configuration of Java Applications hosted on Cloud
- Leading Root Cause Analysis through Blameless Post Mortems of Incidents and Failure Mode Analysis
- Should prepare Run Books, Training Material and conducted sessions
- Converts OPS issues into Stories to fix root cause
Key Responsibilities
Own value stream and application issue resolution to completion