- Cluster Management:
- Oversee daily operations of existing Kubernetes clusters in Azure Kubernetes Service (AKS)
- Perform routine maintenance, upgrades, and patching of clusters for high availability and security
- Support:
- Provide tier-2/3 support for AKS cluster-related issues
- Collaborate with development and operations teams to troubleshoot and resolve performance and deployment issues
- Design and Architecture:
- Design and implement scalable and secure AKS cluster architectures
- Develop best practices and guidelines for AKS cluster design and operation
- Automation and CI/CD Integration:
- Implement and maintain CI/CD pipelines using GitLab or Azure DevOps
- Automate repetitive tasks to improve efficiency
- Monitoring and Performance Optimization:
- Set up monitoring and alerting using Prometheus, Grafana, and Alertmanager
- Optimize cluster performance via resource management, scaling strategies, and capacity planning
- Security and Compliance:
- Implement security best practices like network policies, RBAC, and encryption
- Ensure compliance with industry standards and organizational policies
- Documentation and Knowledge Sharing:
- Maintain documentation for AKS configurations, processes, and troubleshooting
- Conduct training sessions and workshops for team knowledge sharing
- Innovation and Continuous Improvement:
- Stay updated with Kubernetes and cloud-native technologies
- Evaluate new tools and technologies to enhance AKS workflows
Skills and Qualifications:
- Bachelor's degree in Computer Science, Information Technology, or related field
- 5+ years of experience in managing Kubernetes clusters, including 3+ years with AKS
- Proficiency in Kubernetes, Helm, Docker, and AKS
- Experience with CI/CD tools and practices
- Strong understanding of networking, security, and Azure cloud infrastructure
- Excellent problem-solving and communication skills
- Ability to collaborate effectively with cross-functional teams