Mandatory Skills
Python, Terraform, Kubernetes, Ci/Cd, Site Reliability Engineering, AWS
Required Qualifications
- -6+ years of relevant experience in SRE, DevOps, or Platform Engineering
- Strong Python skills with experience building production-grade automation and tooling
- Strong programming experience in Python
- - Production experience with Kubernetes
- - Strong observability fundamentals
- - Experience with Helm, ArgoCD, Argo Rollout and Docker, Ansible, Packer
- - Experience with AWS cloud
- - Strong Linux and networking fundamentals
- - Familiarity with the SDLC
Reliability & Operations
- - Design, build, and maintain scalable platform services and internal developer tooling using Python
- - Develop automation frameworks, orchestration systems, and backend infrastructure services
- - Improve platform reliability, observability, deployment automation, and operational excellence
- - Drive infrastructure-as-code and cloud automation initiatives
- - Implement best practices around security, monitoring, testing, and production operations
- - Design, implement, and maintain highly available and resilient systems in Kubernetes-based environments
- - Contribute to architectural decisions for distributed systems and cloud-native applications
- - Mentor engineers and promote engineering excellence across the organization
- - Lead incident response, RCA, and postmortems
- - Drive reliability improvements through automation
Cloud & Platform Engineering
- - Build and manage infrastructure on AWS.
- - Operate Kubernetes clusters (EKS preferred)
- - Deploy services using Helm, ArgoCD and Argo rollout
- - Manage containerized workloads using Docker and containerd
- - Terraform
- - Python Programming
- - Ansible
- - Packer
- - GitHub Actions / Jenkins / ArgoCD
- - Prometheus / Grafana / Datadog/ Splunk
Automation & Tooling
- Strong Python skills with emphasis on reliability, automation, and observability tooling
- - Develop automation and tooling using Python
- - Create internal reliability and monitoring tools
- - Integrate CI/CD pipelines with observability and reliability checks
Collaboration & Leadership
- - Mentor junior engineers
- - Influence architecture decisions
- - Collaborate across engineering teams
Preferred Qualifications
- - Experience with tool chain development using Python
- - AWS Experience
- - Ansible, Packer
- - Multi-cluster or multi-region Kubernetes experience
- - Experience with Kubernetes package manager (helm) and deployment (ArgoCD / Argo Rollout)
- - Service mesh (Istio) and API gateway (Kong) experience
- - Infrastructure-as-Code (Terraform preferred)
- - Cloud cost optimization experience