Site Reliability Engineer II – AWS, Incident Response, Automation, Observability

Jpmorgan & Co

Hyderabad, India

5-7 Years

This job is no longer accepting applications

Posted 14 days ago

Job Description

Job Description

Join a team where your SRE expertise drives critical application reliability and operational excellence. Grow your skills in a collaborative, innovative environment.

As a Site Reliability Engineer at JPMorgan Chase within the Chief technology Office team, you will manage and optimize production operations for critical applications. You will leverage your AWS and SRE skills to ensure service stability, performance, and resilience. You will collaborate with engineering and security teams to deliver secure, reliable solutions. Your contributions will help us maintain a robust and thriving operating environment.

Job Responsibilities

Manage and support production operations for critical applications, ensuring stability and predictable performance
Proactively monitor health signals, identify risks, and prevent incidents
Execute operational routines including release readiness, change coordination, and controlled rollouts
Lead or participate in incident triage, recovery, communications, and post-incident reviews with clear root cause analysis and follow-up actions
Drive problem management to eliminate repeat incidents
Build and maintain dashboards, alerts, and operational documentation for improved detection and diagnosis
Automate manual operational tasks and improve tooling using scripting or coding (Python, Bash, Go)
Define and track SLIs/SLOs, manage error budgets, and partner with development teams for reliability
Perform capacity planning, resilience testing, and performance tuning

Required Qualifications, Capabilities And Skills

Formal training or certification on security engineering concepts and 5+ years applied experience
Experience supporting critical application production environments with strong operational discipline
Strong troubleshooting skills across Linux, application behavior, and networking fundamentals
Hands-on experience operating and diagnosing issues in AWS environments
Solid working knowledge of AWS IAM and access control best practices
Experience with observability tools (monitoring, logging, alerting)
Automation mindset with scripting/coding capability (Python, Bash, Go) and familiarity with CI/CD practices
Clear communication during incidents and strong documentation habits

Preferred Qualifications, Capabilities And Skills

Experience with tracing tools for observability
Familiarity with resilience testing and performance tuning in cloud environments
Knowledge of operational security requirements and credential hygiene
Experience collaborating with platform and engineering teams

ABOUT US

More Info

Job Type:

Permanent Job

Industry:

Other

Function:

Site Reliability Engineering

Employment Type:

Full time

About Company

Jpmorgan & CoJob Source: www.linkedin.com

Job ID: 147611865

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 23-05-2026 05:49:32 PM

Homejobs in Hyderabad / Secunderabad, TelanganaSite Reliability Engineer II – AWS, Incident Response, Automation, Observability

Similar Jobs

Site Reliability Engineer II – AWS, Incident Response, Automation, Observability

JP Morgan Chase & Co.

5-7 yrs

Hyderabad

Skills:

AWS, Bash, Python, Logging, Linux, Monitoring, alerting, observability tools, SRE, Go

QA Automation Engineer - Assistant Manager - Hyderabad

Deloitte

4-6 yrs

Hyderabad, India

Skills:

Rest Assured, Java, Defect Tracking, Test Management, Sql, Api Testing, automated test scripts, TestNG, JUnit, Selenium, Postman, Python, continuous integration tools

Automation Engineer (Data Technologies) - Emerging Lead

State Street Corporation

4-8 yrs

Hyderabad, India

Skills:

Ml, Java, Teamcity, Bash, Avro, Sql, Jenkins, Shell, Linux, Api Testing, Databricks, Python, GenAI, Parquet, delta lake, Ai

Automation Engineer (Data Technologies) - Emerging Lead

Currenex State Street Trust Company

4-8 yrs

Hyderabad, India

Skills:

Ml, Java, Teamcity, Bash, Avro, Sql, Jenkins, Shell, Linux, Api Testing, Databricks, Python, GenAI, Parquet, delta lake, Ai

SOC Analyst & Incident Response Lead

Avaya

5-7 yrs

Remote, India

Skills:

Security Controls, PowerShell, Operating Systems, Network Protocols, Python, forensic toolsets, Defender for Endpoint, Azure Sentinel, Microsoft Sentinel, network forensics, cloud environments

Do you want to see more relevant and perfect job for you?

Beware of Scammers

We don’t charge any money for job offers

What it feels like to have

48% more interview calls?

To get 5X more recruiter views on your profile

Real-time notifications

Discover new jobs, get recruiter notifications, track applications & more with the foundit App.

Scan to download foundit App