Search by job, company or skills

S

Lead Digital Engineer

Save
new job description bg glownew job description bg glownew job description bg svg
  • Posted an hour ago
  • Be among the first 10 applicants
Early Applicant

Job Description

About Sonata Software

In today's market, there is a unique duality in technology adoption. On one side, extreme focus on cost containment by clients, and on the other, deep motivation to modernize their Digital storefronts to attract more consumers and B2B customers.

As a leading Modernization Engineering company, we aim to deliver modernization-driven hypergrowth for our clients based on the deep differentiation we have created in Modernization Engineering, powered by our Lightening suite and 16-step Platformation™ playbook. In addition, we bring agility and systems thinking to accelerate time to market for our clients.

Headquartered in Bengaluru, India, Sonata has a strong global presence, including key regions US, UK, Europe, APAC, and ANZ. We are a trusted partner of world-leading companies in TMT (Telecom, Media, and Technology), Retail & CPG, Manufacturing, BFSI (Banking, Financial Services, and Insurance), and HLS (Healthcare and Lifesciences) space. Our bouquet of Modernization Engineering Services cuts across Cloud, Data, Dynamics, Contact Centers, and around newer technologies like Generative AI, MS Fabric, and other modernization platforms.

Job Description

Senior DevOps Engineer – EKS & AWS Platform

Client: Healthcare client

Location: Remote / Hybrid

Job Title: Senior DevOps Engineer

Experience: 10+ Years

Employment Type: Full-Time / Contract

Department: Platform Engineering

Role Overview

We are seeking a highly skilled Senior DevOps Engineer to take full ownership of our Amazon EKS-based infrastructure on AWS. This is a hands-on, high-impact role where you will design, build, automate, and operate production-grade Kubernetes environments. You will work closely with development teams, SREs, and security stakeholders to deliver a reliable, scalable, and secure cloud platform. The ideal candidate is a proven EKS expert who brings deep AWS knowledge, strong CI/CD experience with GitHub Actions, and a passion for infrastructure-as-code and automation.

Roles & Responsibilities

EKS Infrastructure Ownership

  • Own end-to-end design, provisioning, and management of Amazon EKS clusters using Terraform
  • Define and maintain node group strategies including managed node groups, Fargate profiles, and spot/on-demand mix for cost optimization.
  • Manage EKS upgrades, control plane configurations, and Kubernetes version lifecycle.
  • Implement cluster autoscaler and Karpenter for dynamic workload scaling.
  • Design multi-environment (Dev/Staging/Prod) EKS architectures with strong environment isolation.

CI/CD Pipeline Engineering

  • Design, build, and maintain CI/CD pipelines using GitHub Actions or Jenkins for automated build, test, and deployment workflows.
  • Implement deployment strategies including blue-green, canary, and rolling deployments to ensure zero-downtime releases.
  • Integrate pipeline quality gates with security scanning (SAST/DAST), container image scanning, and policy compliance checks.
  • Develop automated rollback mechanisms and deployment validation frameworks.
  • Standardize pipeline templates and reusable workflow libraries across engineering teams.

Infrastructure as Code (IaC)

  • Author and maintain Terraform modules for all AWS infrastructure — VPCs, EKS, IAM, S3, ECR, RDS, and more.
  • Enforce IaC standards, module versioning, and Terraform state management using remote backends (S3 + DynamoDB).
  • Implement drift detection mechanisms to continuously validate live infrastructure against IaC definitions.
  • Manage Helm chart development and lifecycle for microservices deployments on EKS.

Security & Compliance

  • Design and enforce least-privilege IAM policies, IRSA (IAM Roles for Service Accounts), and service mesh security policies.
  • Manage secrets using AWS Secrets Manager and Parameter Store, integrated with Kubernetes workloads.
  • Implement network security using VPC security groups, NACLs, and Kubernetes Network Policies.
  • Drive infrastructure security compliance, vulnerability remediation, and audit readiness.

Observability & Incident Response

  • Build and maintain observability stacks using Prometheus, Grafana, and OpenTelemetry for metrics, logs, and distributed tracing.
  • Define SLIs, SLOs, and alerting thresholds for production Kubernetes workloads.
  • Lead incident response, root cause analysis (RCA), and post-mortem processes for infrastructure events.
  • Implement auto-remediation for common failure patterns to improve MTTR.

Cost Optimization & Capacity Planning

  • Continuously analyze AWS spend and implement right-sizing, reserved instance, and savings plan strategies.
  • Build cost attribution frameworks with tagging standards and chargeback models.
  • Forecast capacity requirements based on business growth and workload patterns.

Collaboration & Mentorship

  • Serve as the primary DevOps point of contact for product engineering teams, guiding infrastructure design decisions.
  • Mentor junior and mid-level DevOps engineers, establishing best practices and runbook documentation.
  • Collaborate with security, compliance, and product teams to align infrastructure with business objectives.

Required Skills & Qualifications

  • 8+ years of overall IT/infrastructure experience with 7+ years in a senior DevOps or platform engineering role.
  • Deep hands-on expertise with Amazon EKS — cluster provisioning, node management, upgrades, networking, and add-ons.
  • Strong AWS proficiency: VPC, EC2, IAM, S3, ECR, RDS, Lambda, CloudWatch, Route 53, ALB/NLB.
  • Expert-level Terraform skills including module design, remote state, workspaces, and Terragrunt.
  • Proven experience with GitHub Actions for CI/CD automation.
  • Hands-on experience with any of the tools like Jenkins or equivalent deployment orchestration tools.
  • Proficiency in Helm for Kubernetes application packaging and lifecycle management.
  • Solid understanding of container security, Kubernetes RBAC, and AWS IAM best practices.
  • Experience with Prometheus, Grafana, and OpenTelemetry for production observability.

More Info

Job Type:
Industry:
Function:
Employment Type:

About Company

Job ID: 147131575

Similar Jobs