Staff Senior Reliability/Automation Engineer - Infrastructure (Hyderabad, India)
The staff senior reliability/automation engineer – infrastructure is responsible for building and scaling automation and infrastructure-as-code (IaC) foundations across global compute, storage and backup platforms in Data Center. This role acts as SME and drives repeatable, scalable, version-controlled infrastructure provisioning delivery, improving reliability, speed and consistency. This role collaborates with infrastructure engineering, operations and security teams to include automation into provisioning, configuration and lifecycle management.
Responsibilities:
- Define and drive infrastructure automation strategy for compute, storage and backup platforms.
- Design and implement IaC i.e., Infrastructure as Code frameworks to standardize provisioning and configuration.
- For M&As, lead the infrastructure integration automation strategy and plans for compute, storage and backup platforms.
- Execute and lead technically the automation for M&As for seamless integration and migration to Data Centers driving cross-functional technical architectures in collaboration with security, DevOps and business stakeholders.
- Establish, govern version-controlled infrastructure code repositories, branching.
- Develop and maintain IaC templates and modules for compute, storage and backup
- Automate, own technically and lead the provisioning, configuration and operational maintenance tasks to reduce manual work and errors. Deliver as per Say-Do.
- Partner with infrastructure engineering and operations teams to ensure architecture alignments and reusability while embedding reliability, resiliency and compliance standards
- Drive adoption by closely partnering with operations teams, measure and report automation outcomes
- Lead and mentor automation engineers and act as technical automation SME.
Qualifications:
- BS in Computer Science or related fields with strong professional experience in infrastructure engineering and automation engineering roles.
- Strong hands-on experience with infrastructure automation and IaC for data centers.
- Expertise with IaC and automation tools like Terraform, Ansible, PowerShell, Python or similar
- Hands-on experience establishing version control systems(Git/Gitlab), CI/CD (like Gitlab CI/Jenkins), Artifacts/state management, secrets management (Vault/CyberArk) and orchestration tools (Spinnaker etc.)
- Good knowledge of RunDeck.
- Solid understanding of compute platforms (Windows and/or Linux OS, VMware, Nutanix and Citrix), storage Dell/EMC PowerStore, Pure, XtremeIO, Unity, VNX etc. and backup/recovery technologies like Veritas, Cohesity, Commvault, Veeam, Rubrik etc.
- People management and project management experience is a must.