
Search by job, company or skills
Job Title: Devops + Incident Management
Experience Range: 6 to 11 Years
Skills : Devops, Incident Management, L2 / L3 Support, Bash, CI/CD, Powershell, Fintech Exp.
Job Location: Bangalore
Notice - Immediate to 30 Days.
Email ID: [Confidential Information]
Your team
This role is part of our Service Operations function that ensures the stability, resilience,
and quality of IT services through strong operational governance and ITIL-aligned
practices. The team drives incident response, change control, and continuous
improvement to support reliable and scalable environments.
Your role in the Team's Success
Your primary responsibility will be to manage complex production and non-production
environments while ensuring alignment with operational standards and compliance
with internal controls. By identifying risks, addressing systemic issues, and enhancing
process efficiency, you will help maintain operational excellence and foster strong
collaboration across engineering, infrastructure, and governance teams. Your role is key
to driving service reliability, environment readiness, and continuous improvement.
What you'll do
Lead and coordinate the resolution of production incidents, including managing
major incident bridges and war rooms.
Ensure high availability and reliability across production and non-production
environments.
Execute deployments, monitor system health, and support release cutovers and
rollbacks.
Automate routine support tasks and environment checks using scripting and
DevOps tools.
Develop tools and scripts (e.g., Bash, Python, PowerShell) to improve system
resilience and reduce manual toil.
Collaborate with Dev, SRE, and infrastructure teams to enhance observability
and CI/CD workflows.
Partner with Change, Release, and Problem Management to drive governance
and cross-functional alignment.
Maintain documentation, produce incident trend reports, and lead knowledge-
sharing sessions.
What you'll need for this role
Key Qualification Requirements:
Strong understanding of automation, fault tolerance, self-healing systems
5+ years of experience in Devops.
Knowledge of Agile and Continuous Delivery practices
Ability to collaborate with developers, SREs, and infrastructure teams to shift-left
operational improvements
Experience in Incident, Problem, and Change Management workflows
Understanding of operational metrics, RCA/PIR processes, and governance
Ability to lead or support major incident calls and stability reviews
Job ID: 131892245