Experience- 8+ Years
Work Mode- Hybrid
Work Location- Chennai, Mumbai and Gurgaon.
Job description:
We are looking for an experienced Site Reliability Engineer (SRE) with strong expertise in Linux OS, Azure Cloud, Terraform, and Kubernetes. The ideal candidate should have hands-on experience in managing cloud infrastructure, automating deployments, and ensuring system reliability and scalability. Candidates with CKA / CKAD / CKS certifications will be preferred.
Key Responsibilities:
- Manage, monitor, and maintain Linux-based systems in a cloud environment.
- Design, implement, and manage infrastructure on Microsoft Azure.
- Automate infrastructure provisioning using Terraform (Infrastructure as Code).
- Deploy, manage, and troubleshoot Kubernetes clusters and containerized applications.
- Ensure system reliability, performance, and scalability through effective monitoring and automation.
- Implement best practices for CI/CD pipelines, infrastructure automation, and cloud security.
- Troubleshoot production issues and perform root cause analysis.
- Collaborate with development teams to improve system reliability and deployment processes.