About The Opportunity
Industry: Enterprise Cloud & Platform Engineering (Hybrid/OnPrem & Public Cloud). We operate at the intersection of platform reliability, container orchestration, and infrastructure automation to deliver resilient production platforms for missioncritical applications. This role is for an onsite engineering team in India focused on running and scaling OpenShift-based platforms.
Standardized Title: Site Reliability Engineer - OpenShift
Role & Responsibilities
- Operate and maintain production Red Hat OpenShift clusters: perform installs, upgrades, capacity planning, and operator lifecycle management.
- Design, implement and maintain Infrastructure as Code for platform components using Terraform and automation playbooks with Ansible.
- Build and maintain CI/CD integrations for platform delivery, secure image pipelines, and automated deployment workflows.
- Troubleshoot platform incidents, lead on-call rotations, conduct RCA, and implement mitigations to improve availability and reliability.
- Implement observability and alerting practices; create runbooks, health checks, and automation to reduce manual toil.
- Collaborate with application teams and platform engineers to harden security, network policies, and access controls for multi-tenant clusters.
Skills & Qualifications
Must-Have
- Proven experience operating Red Hat OpenShift and Kubernetes in production (cluster lifecycle management, Operators).
- Strong Linux systems administration and troubleshooting skills.
- Container tooling and image lifecycle management with Docker.
- Infrastructure-as-Code using Terraform (module design, state management).
- Automation and configuration management using Ansible; experience integrating with CI/CD tools (Jenkins/GitLab CI).
- Hands-on incident management, scripting for automation (Bash or Python), and willingness to work on-site in India.
Preferred
- Experience with Prometheus and Grafana for metrics, alerting, and dashboards.
- Familiarity with Helm charts, Operators, and service mesh technologies (Istio/Linkerd).
- Relevant certifications (Red Hat OpenShift, CKA/CKS, or similar).
Benefits & Culture Highlights
- Onsite, highimpact role with ownership of platform reliability and the opportunity to influence core infrastructure.
- Support for professional development, certification backing, and handson mentoring from experienced platform engineers.
- Collaborative, automation-first culture focused on measurable SLAs, observability, and continuous improvement.
Location: India (Onsite). Recruitment partner: Viraaj HR Solutions.
To apply, bring demonstrable OpenShift/Kubernetes platform experience, strong Linux and IaC skills, and a bias for automation and reliability engineering.
Skills: kubernetes,terraform,reliability,sre,openshift,linux,ansible,docker