Search by job, company or skills

A

Senior Site Reliability Engineer

6-11 Years
new job description bg glownew job description bg glownew job description bg svg
  • Posted 4 hours ago
  • Be among the first 10 applicants
Early Applicant
Quick Apply

Job Description

Key Responsibilities

Kubernetes & EKS Platform Engineering

  • Architect, deploy, and operate production-grade Kubernetes clusters on AWS EKS
  • Implement and manage EKS automation using EKS Blueprints and lifecycle management of add-ons
  • Plan and execute Kubernetes and EKS version upgrades with minimal service disruption

Autoscaling & Compute Optimization

  • Design and implement Karpenter-based autoscaling solutions for dynamic workload scaling
  • Optimize compute resources for cost efficiency, performance, and high availability

Service Mesh & Traffic Management

  • Design and operate Istio service mesh (including sidecar and ambient mesh models)
  • Implement advanced traffic management policies such as mTLS, retries, circuit breaking, and timeouts

Security, Policy & Runtime Protection

  • Implement Kubernetes governance using Kyverno and OPA/Gatekeeper
  • Operate Falco for runtime threat detection and security incident investigation
  • Integrate security and compliance controls into GitOps workflows

Infrastructure as Code & Automation

  • Build and maintain reusable Terraform modules for AWS infrastructure (VPC, EKS, Transit Gateway, etc.)
  • Implement Terragrunt-based multi-account and multi-region infrastructure setups
  • Drive automation to reduce manual operations and improve scalability

GitOps & Platform Operations

  • Design and manage Argo CD for GitOps-based deployment and platform operations
  • Define Git-based promotion workflows and access control models across environments

Observability & SRE Practices

  • Design and maintain monitoring and alerting systems using Prometheus
  • Participate in incident response, root cause analysis, and reliability engineering improvements
  • Reduce operational toil through automation and self-service capabilities

Security & Compliance

  • Own remediation of security findings from tools such as Wiz across AWS and Kubernetes environments
  • Collaborate with security teams to implement preventive security guardrails and best practices

More Info

Job Type:
Industry:
Function:
Employment Type:
Open to candidates from:
Indian

About Company

Apptad offers strategic consulting, enterprise information management and digital transformation services. With globally connected offices in US and India along with a team of trained and certified IT resources, Apptad ensures quick and effective delivery to its customers. Apptad is relentlessly reinventing the outlook of how companies leverage data.

With an effort to enable our customers the ability to solve biggest problems within their organization. We perceive our clients’ problems and respond with custom solutions instead of handing over boilerplate responses.

Job ID: 145579929