- Ensure a Cloud Ready approach is taken to service availability, reliability and performance
- Implement advanced monitoring configurations, including advanced analytics (AI/ML) in modern Observability platforms.
- Make suggestions to define and implement the Golden Signals across the entire technology stack in Financial Services
- Define and monitor SLI s to ensure the service is running to NCR s expectations
- Participate in Problem Management to help drive root cause analysis via Observability analytics
- System Admin like tasks on all Observability platforms.
- On call support (Tier 4) to manage and execute deep analysis and forensics as/if needed by SRE Operations.
- Execute on Problem Management, using modern tools for forensics and validation of root cause
- Software Development in terms of automating repeatable Operations tasks (TOIL)
- Provide data and trending for Capacity management for the cloud environments to ensure maximum availability and performance of services
- SRE Metrics Monitoring Strategy (SLI, SLO, etc.)
- Responsible for SRE Ops Guidelines across all Clouds to ensure consistency in approach, execution and reporting.
- Participate in all continuous improvement activities including Incident reviews, Change implementation reviews, TOIL automation candidate areas etc.
This position works closely with NCR Voyixs Global - SRE team that guide the overall SRE strategy and direction for NCR Voyix.
Basic Requirements
- B.E/B.Tech/M.E/M.Tech, MCA in Computer Science
- 8+ years of IT experience
- 5+ Years experience in ITIL Service Management processes and associated domain technologies
- 5+ years in defining, implementing and use in modern Observability platforms across APM, Log Mining, Event Correlation (Dynatrace, Moogsoft, BigPanda, Splunk, AppDynamics etc)
- Exposure/experience with SRE as a discipline
- 3+ years experience with Google and/or Azure cloud platforms and technologies (IaaS, SaaS and PaaS)
- 3+ years experience in software development (DevOps scripting /or Object Oriented)
- 5+ years datacenter technology, architecture, and operational experience
- Demonstrated history of innovative thinking and delivery, including disruptive innovation
- Highly organized
- Working knowledge of CI/CD pipelines
- Understanding of and experience with cloud native databases such In both GCP and Azure
- Excellent communication, meeting facilitation and listening skills
- Strong negotiation, team working and interpersonal skills
- Proficient with PowerPoint, Word and Excel
- Ability to travel both domestically and internationality if needed (note: should be minimal travel for this position)
Preferred Requirements
- Working knowledge of Terraform scripting to develop infrastructure as code
- Working knowledge of other cloud automation tools like Ansible, Rundeck, and Chef
- System Admin level experience in ServiceNow / Dynatrace / AppDynamics et
Role: System Security Engineer
Industry Type: IT Services & Consulting
Department: IT & Information Security
Employment Type: Full Time, Permanent
Role Category: IT Security
Education
UG: B.Tech/B.E. in Any Specialization
PG: M.Tech in Any Specialization, MCA in Computers