Team Description
SABRE OFFER OPERATIONS IS LOOKING FOR A TALENTED LEAD SITE RELIABILITY ENGINEER (LEAD SRE)
Roles and Responsibilities
- Responsible for Operational Stability of critical ATSE suite of applications
- Responsible for defining Operational Requirements / arriving at Solutions / Estimating
- Provides requirements/monitors System Performance and Capacity Monitoring
- Leads Incident Management and is responsible for driving incidents to resolution
- Responsible for Production Readiness/Production & for Operational Documentation
- Defines Severity Definitions & Keeper of Sabre Compliance (MOS, Disaster Recovery)
- Advising and collaborating with our architecture and product development team on technical hardware and software issues.
- Primary contact for ATSE suite of applications
- Manages outage and emergency situations with datacenter and application staff
- Ensuring platform availability and addressing operational issues as they arise in accordance with internal and customer SLA's
- Knowledge resource on multiple projects/initiatives across the company for a variety of internal/external customers
- Defining, designing and implementing new and updated hardware and software solutions for production and non-production platforms.
- Coordinating infrastructure maintenance and upgrades with our Application teams
- Assist Change Management tasks, by coordinating them with Release Management and Application teams, as well as implementing changes with assistance from development team
- Maintain platform and applications documentation
- Define, design, and implement disaster recovery and business continuity plans
- Assist in developing and maintaining annual budget and capacity planning
What's in it for you
Working with a state-of-the-art shopping which spreads over multiple regions on 5000+ servers.
Opportunity to do something that has high impact and game changing in our industry
Be part of one of the world's largest Travel and Hospitality technology company
Qualifications And Education Requirements
EDUCATION: Bachelor's degree or equivalent.
Experience
- 8+ years of related experience.
- Excellent written and verbal communication skills.
- Ability to handle multiple projects simultaneously
Mandatory Skills
- Advanced understanding of Unix/Linux, Oracle/SQL
- Network Architecture
- Scripting/Automation
- Analytical capabilities
- DevOps/SRE Working Experience, CI/CD
- Load Balancing Technologies
- Public Cloud Technologies & working experience
- Very good Communication & Leadership skills
Nice To Have Skills
- Advanced understanding of web-based architecture and development techniques
- Experience on tools like JIRA, Service Now, puppet/ansible, GCO, Ansible,
- MQ series knowledge
- Microsoft Office – Excel, Outlook, PowerPoint, Visio & Word
- Experience in complex, high availability systems would be a plus
- Experience in capacity planning