- Oversee and manage cloud operations to ensure seamless service delivery and optimized performance.
- Expertise in managing cloud infrastructure across major platforms (AWS, Azure, GCP).
- Proven experience in cloud operations, service management, and delivering high-quality cloud services on a scale.
- Coordinate and collaborate with cross-functional teams to implement best practices in cloud operations.
- Manage incident response, problem resolution, and ensure effective root cause analysis.
- Implement cloud automation and orchestration processes to streamline operations and improve efficiency.
- Monitor cloud performance, security, and compliance, ensuring that SLAs and KPIs are consistently met.
- Lead and mentor cloud operations teams, fostering a culture of continuous improvement and innovation.
- Develop and maintain operational documentation, including runbooks, incident reports, and operational procedures.
- Familiarity with ITIL, DevOps, and Agile methodologies.
- Strong knowledge of cloud-native technologies, microservices, and containers (e.g., Kubernetes, Docker).
- Proficiency in scripting languages (e.g., Python, Bash) for automation and orchestration.
SECONDARY RESPONSIBILITIES
- Ensure that capacity planning and disaster recovery procedures are in place for cloud infrastructure.
- Conduct regular backups, failover testing, and ensure business continuity.
- Maintain detailed documentation for cloud operations, configurations, and processes.
- Report on cloud usage, incidents, and performance to senior management.
- Stay up-to-date with the latest cloud technologies and trends.
- Recommend and implement new tools and technologies to improve cloud infrastructure and operations.
EDUCATIONAL REQUIREMENTS & certifications
- Minimum of 8 to 12 years of IT experience, with at least 5 years in cloud operations and infrastructure management.
- Bachelor's or Master's degree in Computer Science, Information Technology, or a related field.