- Architect resilient and observable infrastructure on Azure using IAC principles while considering the tradeoffs between cost, performance, and reliability.
- Lead the design of platform-wide monitoring, alerting, and dashboard strategy.
- Oversee incident management and post-incident analysis at the org level.
- Guide DevSecOps strategy and secure automation across engineering teams.
- Implement FinOps practices by optimizing cloud spend and forecasting usage trends.
- Collaborate with global stakeholders to drive and document best practices, governance, and standards.
What will you need:
Required Qualifications:
- Bachelors or Masters degree in Computer Science, Software Engineering, or a related discipline along with 12+ years of professional experience with deep expertise in infrastructure automation and operations.
- Expertise with Infrastructure as Code (IaC) tools such as Terraform, Helm, and Ansible.
- Expertise in CI/CD using platforms such as Gitlab, GitHub, Azure DevOps, and containerization tools such as Docker and Kubernetes.
- Expertise with observability tools such as Grafana, Prometheus, and ELK Stack.
- Experience with FinOps and compliance frameworks such as SOC 2, ISO 27001, GDPR, HIPAA, and NIST 800-53.
Preferred Qualifications:
- Strong technical leadership skills to mentor and influence cross-functional teams.
- Strong ability to communicate across the team and with global stakeholders