Senior Site Reliability Engineer

Headout

Bengaluru, India

2-5 Years

Save

Posted a month ago
Be among the first 30 applicants

Early Applicant

Job Description

The Role

As a Senior Site Reliability Engineer, you will own and operate cloud-native infrastructure and Kubernetes platforms that power customer-facing services at scale. You will design and optimize CI/CD workflows, improve deployment reliability, and drive observability, incident management, and performance improvements across the organization. You will build platform tooling to improve developer velocity, enforce security guardrails, and standardize best practices. This role expects strong ownership, architectural thinking, and mentorship of junior engineers.

What makes this role special

Full Platform Exposure – Work across DevOps, infrastructure, observability, performance, and reliability
Architecture Ownership – Influence platform and tooling decisions using benchmarks and metrics
High Impact – Build systems that reduce deployment TAT, improve p99s, and scale across teams
Flexibility – Freedom to work across stacks, tools, and evolving platforms

What skills & experience do you nee

4-7 years of experience operating customer-facing services at scale
Strong hands-on experience with Kubernetes cluster operations and workload optimization
Experience with service mesh and distributed tracing tools (e.g., Istio, Jaeger)
Comfortable with at least one cloud provider (AWS preferred; GCP or Azure acceptable)
Hands-on experience with monitoring and alerting stacks (Prometheus, Grafana, Thanos, Datadog, New Relic)
Proven experience designing robust CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins)
Proficiency in Infrastructure as Code (Terraform or Pulumi)
Strong programming skills in Python, Go, or Java/Kotlin, plus shell scripting
Experience with databases such as MySQL and MongoDB, including application and query profiling
Solid understanding of security best practices and compliance
High-ownership mindset with the ability to proactively identify and resolve platform issues

More Info

Job Type:

Permanent Job

Industry:

Other

Function:

Site Reliability Engineering

Employment Type:

Full time

About Company

HeadoutJob Source: www.linkedin.com

Job ID: 144677675

Jobs by Skill - IT

Jobs by Skill - Non IT

International Jobs

Last Updated: 07-07-2026 07:49:44 AM

Homejobs in Bengaluru / BangaloreSenior Site Reliability Engineer

Similar Jobs

Senior Site Reliability Engineer- Observability

Okta

5-7 yrs

Bengaluru, India

Skills:

Cortex, Prometheus, Grafana, Gcp, Terraform, Splunk, Python, Kubernetes, AWS, Loki, Go, Mimir, OpenTelemetry

Senior Site Reliability Engineer

BlackDuck

6-8 yrs

Bengaluru, India

Skills:

Github, Elk, Prometheus, Grafana, Datadog, Shell, Terraform, Docker, Gitlab, Python, AWS, New Relic, Jenkins, Git, Gcp, Perl, Helm, Azure, Kubernetes, Go, Harness, GitHub Actions, Loki, GitLab CI, ArgoCD

Senior Site Reliability Engineer

josys

5-7 yrs

Bengaluru, India

Skills:

Elk, Prometheus, Slas, Networking, Dns, Grafana, Cdn, Graylog, Python, AWS, Performance Tuning, Bash, Devops, High Availability, Gcp, Load Balancing, Azure, Kubernetes, SLIs, Go, Disaster Recovery, observability tools, Security, OpenTelemetry, Infrastructure Engineering, Site Reliability Engineering, log management tools, reliability metrics, SLOs, container orchestration, incident management frameworks

Senior Site Reliability Engineer - SRE, Multi Cloud, Exp: 7-12 Yrs

Cisco DevNet

7-12 yrs

Bengaluru, India

Skills:

Networking, Prometheus, Memory Management, Grafana, Linux Internals, Gcp, Terraform, Ansible, Azure, Kubernetes, Python, AWS, GKE, Filesystems, Go, AKS, Terragrunt, EKS, Thanos

Senior Site Reliability Engineer - SRE, Multi Cloud, Exp: 7-12 Yrs

CISCO Credit

7-12 yrs

Bengaluru, India

Skills:

Networking, Prometheus, Grafana, Gcp, Memory Management, Terraform, Ansible, Linux Internals, Azure, Python, Kubernetes, AWS, GKE, Filesystems, Go, AKS, Terragrunt, EKS, Thanos