Role Overview:
We are seeking a highly skilled and motivated Senior Compute Engineer to lead enterprise-scale infrastructure initiatives. This role is ideal for a seasoned professional with deep expertise in virtual server platforms, capacity management, workload migration from VMware, and automation development across large-scale environments (10,000+ servers). The ideal candidate will also guide junior engineers and drive operational excellence.
Key Responsibilities:
Virtual Server Infrastructure Leadership
- Design, deploy, and manage Virtual Server Clusters (Nutanix) across enterprise environments.
- Lead migration efforts from VMware to Nutanix, ensuring minimal disruption and optimal performance.
Capacity Management
- Monitor and forecast infrastructure capacity needs.
- Develop and maintain dashboards and reporting tools to track utilization and performance metrics.
Automation & Orchestration
- Build and maintain automation scripts and frameworks (e.g., using Terraform/Ansible) to streamline infrastructure provisioning, patching, and monitoring.
- Integrate Nutanix with enterprise automation platforms, observability toolsets, and CI/CD pipelines.
24x7 Operations & Incident Management
- Build and manage a team responsible for 24x7 operations including software patching, technology currency, and vulnerability management.
- Oversee incident response, root cause analysis, and resolution processes to ensure infrastructure resilience and compliance.
Proactive Observability
- Leverage enterprise observability toolsets to automate detection of performance issues and monitor the overall health of the compute environment.
Audit & Compliance Automation
- Support internal and external audits by developing and maintaining CIS benchmarks.
- Incorporate benchmark-driven changes into automation pipelines and deploy them at scale across the enterprise.
Mentorship & Collaboration
- Mentor junior engineers and foster a culture of continuous learning and technical excellence.
- Collaborate with cross-functional teams including network, security, and application teams to ensure seamless infrastructure operations.
Operational Excellence
- Ensure high availability, scalability, and security of Nutanix environments.
- Participate in incident response and root cause analysis for infrastructure-related issues.
Qualifications
- 7+ years of experience in IT infrastructure engineering, with at least 3 years focused on Virtual Server environments like VMware or Nutanix.
- Strong background in capacity planning and performance tuning.
- Proficiency in automation tools and scripting languages (Python, PowerShell, Ansible, Terraform).
- Experience working in a large-scale enterprise supporting environments with 10,000+ servers.
- Excellent communication and leadership skills.
- Ability to solve technical problems, guide junior teammates, and partner with US based engineering leads and management.
Preferred Certifications
- Proven experience migrating workloads from VMware to Nutanix.
- Nutanix Certified Professional (NCP) or higher
- VMware Certified Professional (VCP)
- Certified Kubernetes Administrator (CKA) or similar