Role Overview Looking for a DevOps Engineer to lead the technical deployment and operational management of high-value, project-specific on-premises environments. Unlike our central platform roles, this position is dedicated to successfully onboarding and maintaining complex, non-cloud-native installations—specifically those running on Nutanix Hyper-converged Infrastructure (HCI) and Open-Source/Vanilla Kubernetes distributions. You will be the primary technical owner for these environments, ensuring they meet enterprise stability and security standards while adhering to the core architectural guidelines.
Location & Reporting
- Location: Bangalore, India
- Reporting: Reports to the Project Delivery Head with a dotted line to the Head of CloudOps for architectural alignment.
- Scope: Dedicated project-level support for complex on-premises customer installations.
Key Responsibilities
- Environment Ownership: Lead the end-to-end setup, configuration, and day-to-day management of project-specific on-premises infrastructure.
- HCI & Virtualization Management: Manage production workloads on Nutanix (Prism/AHV), ensuring optimal resource allocation, hardware-level integration, and high availability.
- Open-Source Kubernetes: Design and maintain Vanilla Kubernetes clusters (e.g., Kubeadm, RKE, or K3s) in environments where managed cloud services are unavailable.
- On-Prem Networking & Storage: Configure and troubleshoot on-premise networking (e.g., Load Balancers like MetalLB) and persistent storage integration (CSI) within local data centers.
- Deployment & Maintenance: Execute platform upgrades, patches, and infrastructure changes specifically for these non-standard stacks, following strict risk assessment and rollback strategies.
- Architectural Alignment: Work closely with the central CloudOps team to ensure project-specific configurations remain as aligned as possible with global platform standards to prevent unmanaged technical debt.
- Security & Compliance: Enforce security controls aligned with ISO 27001 and SOC 2, including IAM, RBAC, and network security (WAF/NSG) tailored for on-premises constraints.
Technical Skills & Experience
- Kubernetes Mastery: Deep expertise in managing open-source/vanilla Kubernetes distributions on bare-metal or virtualized on-premises hardware.
- Infrastructure Management: Proven experience with Nutanix HCI and traditional virtualization technologies.
- On-Prem Stack: Proficiency in managing on-premises storage, local networking, and private registry management.
- Infrastructure as Code: Hands-on experience with Terraform, Ansible, or Bicep for automating local resource provisioning.
- Observability: Ability to set up and manage monitoring and logging (e.g., Prometheus, Grafana, ELK) within isolated or air-gapped environments.
- Databases: Experience managing on-premise PostgreSQL instances, including backup/restore and replication.
Behavioral Expectations
- Decisiveness Under Pressure: Ability to remain calm and effective while troubleshooting high-severity incidents in complex, non-standard environments.
- Solution-Oriented: A proactive approach to solving hardware-software integration challenges that are unique to on-premises data centers.
- Stakeholder Management: Ability to manage technical expectations for project managers and customer delivery heads.
Qualifications
- 5+ years of experience in DevOps, System Administration, or Platform Operations.
- Extensive experience with on-premises enterprise data center operations.
- Background in mission-critical sectors such as Banking or Insurance is highly preferred.