Position Title : Cloud Operations Engineer ( Azure & Kubernetes )
Location : Navi Mumbai (Airoli)
Experience
5+ years overall in Cloud Operations, including :
- Minimum 5 years of hands-on experience with Microsoft Azure
- Minimum 3 years of experience in Kubernetes administration
Certifications : Azure Certification (Azure Administrator Associate, Azure Solutions Architect Expert, or equivalent) - Mandatory
Required Skills & Qualifications
- Azure Certification (Azure Administrator, Architect, or equivalent).
- Strong working knowledge of Azure services (VMs, Azure Kubernetes Service, Storage Accounts, Networking, IAM, Azure Monitor).
- Proficiency in Kubernetes administration (setup, scaling, upgrades, securing workloads).
- Experience with Infrastructure as Code tools (ARM Templates, Terraform, Bicep).
- Familiarity with containerization concepts and tools (Docker).
- Proficiency in monitoring and observability (Azure Monitor, Prometheus, Grafana).
- Solid understanding of incident management, change management, and operational excellence.
- Ability to work in 24x7 support environment with rotating shifts.
- Strong analytical and problem-solving skills.
Key Responsibilities
- Manage and monitor Azure infrastructure resources ensuring performance, availability, and security compliance.
- Administer Azure Kubernetes Service (AKS) clusters : provisioning, scaling, upgrades, patching, and troubleshooting.
- Implement and maintain automation for provisioning, configuration management, and monitoring (using ARM templates, Terraform, Bicep).
- Respond to incidents, perform root cause analysis, and resolve issues within defined SLAs.
- Configure and maintain logging, monitoring, and alerting solutions (Azure Monitor, Log Analytics, Application Insights).
- Support CI/CD workflows integrating Azure and Kubernetes deployments.
- Maintain detailed operational documentation, including configurations and runbooks.
- Collaborate closely with Development, Security, and Architecture teams to ensure adherence to best practices and compliance.
- Participate in an on-call rotation for incident response and critical issue remediation.
Preferred Skills (Nice To Have)
- Exposure to multi-cloud or hybrid environments.
- Scripting experience (PowerShell, Bash, Python).
- Experience integrating AKS with DevOps pipelines (Azure DevOps, GitHub Actions).
- Familiarity with compliance and security standards (ISO, SOC, CIS benchmarks).
(ref:hirist.tech)