Search by job, company or skills

F

Senior Site Reliability Engineer

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted 17 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description: Site Reliability Engineering (SRE) Manager

Role Overview:

We are looking for an experienced SRE Manager to lead our Site Reliability Engineering team. The ideal candidate will have a strong background in DevOps practices, system reliability, and team leadership.

Key Responsibilities:

- Lead, mentor, and manage a team of SRE/DevOps engineers

- Define and implement SRE best practices (SLIs, SLOs, error budgets)

- Ensure system reliability, scalability, and performance

- Drive automation initiatives

- Collaborate with cross-functional teams

- Own CI/CD pipelines and release management

- Lead incident response and RCA processes

- Establish monitoring and observability frameworks

- Manage cloud infrastructure (AWS/Azure/GCP)

- Implement disaster recovery plans

Required Skills & Qualifications:

- 7+ years of experience in SRE/DevOps roles

- 3+ years of team management experience

- Experience with cloud platforms (AWS/Azure/GCP)

- Knowledge of CI/CD tools (Jenkins, GitLab CI)

- Experience with Docker and Kubernetes

- Scripting skills (Python, Bash)

- Knowledge of Terraform/CloudFormation

- Monitoring tools (Prometheus, Grafana, ELK)

Preferred Qualifications:

- Experience with microservices

- Cloud certifications are a plus

- Strong problem-solving skills

Key Competencies:

- Leadership

- Communication

- Ownership

- Stakeholder management

Good to Have:

- Experience in e-commerce platforms

- Knowledge of chaos engineering

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 147206893

Similar Jobs

Gurugram

Skills:

DevopsCloud InfrastructureAWSGcpPythonSite Reliability Engineering

Gurugram, Gurugram, India

Skills:

ElkCloudformationPrometheusBashGrafanaJenkinsGcpDockerTerraformAzureKubernetesPythonAWSGitLab CI

Noida, India

Skills:

RustPrometheusCDKPulumiGrafanaDatadogNew RelicDevopsTypescriptJavascriptGcpTerraformAzurePythonKubernetesAWSGroundcoverZipkinGitOpsSREGoJaegerOpenTelemetry

Gurugram

Skills:

containerization configuration managementMonitoring ToolsScriptingAWSLinux/Unix administration

Noida, India

Skills:

Unix AdministrationCassandraPostgreSQLBashDevopsJenkinsGcpLinuxDockerECSMongoDBPuppetKubernetesPythonAWSNoSQL databasesChefEKSbasic networking conceptsCI CD pipelinesSite Reliability Engineering