K&K Global Talents is an international recruiting agency that has been providing technical resources globally since 1993. This position is with one of our clients in India who is actively hiring candidates to expand their teams.
Title: Site Reliability Engineer
Locations: Bhopal, Bangalore, Chennai, Gurugram, Hyderabad, Jaipur, Pune (Hybrid)
Employment Type: Full -Time Employment (FTE)
Mode of Operation: Hybrid
Notice Period: Immediate to 10 Days
Required Experience: 8+ Years
Roles & Responsibilities :
- Lead Dynatrace / Observability onboarding and monitoring strategy for enterprise applications.
- Design and implement resiliency and failover strategies to ensure high availability and disaster recovery readiness.
- Drive cloud cost optimization initiatives (FinOps) and improve infrastructure efficiency.
- Manage and support large-scale distributed systems with a focus on uptime, scalability, and performance.
- Work extensively on AWS cloud (other cloud platforms such as Azure or GCP are also acceptable).
- Administer and optimize container platforms such as ROSA / OpenShift / Kubernetes.
- Build and maintain automation frameworks for infrastructure provisioning, deployments, and monitoring.
- Continuously identify and suggest technology and domain improvements to enhance platform reliability and efficiency.
- Collaborate with development, architecture, and operations teams to embed SRE best practices.
- Provide technical leadership and solution design for reliability engineering initiatives