Search by job, company or skills

datum technologies group

Site Reliability Engineer

new job description bg glownew job description bg glownew job description bg svg
  • Posted 12 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Experience - 8+ years

Work Mode - Hybrid (2 WFO)

Work Location - Chennai, Mumbai and Gurugram

Key Responsibilities

  • Manage, maintain, and troubleshoot Linux-based systems and environments.
  • Design, develop, and maintain Infrastructure as Code using Terraform, including writing Terraform modules from scratch.
  • Deploy, configure, and manage Azure cloud infrastructure to support scalable and reliable applications.
  • Administer and maintain Kubernetes clusters, particularly Azure Kubernetes Service (AKS).
  • Perform Kubernetes cluster lifecycle management, including upgrades, scaling, monitoring, and troubleshooting.
  • Implement and maintain CI/CD pipelines, preferably using GitHub Actions.
  • Ensure system reliability, availability, and performance by following SRE and DevOps best practices.
  • Collaborate with development teams to improve deployment automation and infrastructure efficiency.
  • Monitor infrastructure and applications using monitoring tools and proactively resolve issues.

Mandatory Skills

  • Strong experience with Linux OS administration.
  • Hands-on experience with Terraform, including module creation and infrastructure automation.
  • Solid experience working with Microsoft Azure Cloud.
  • Strong expertise in Kubernetes cluster management, particularly Azure Kubernetes Service (AKS).
  • Experience with CI/CD tools, preferably GitHub Actions.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 145771011

Similar Jobs