5+ years of experience in Site Reliability Engineering (SRE), DevOps, or Infrastructure Engineering.
Strong experience with cloud platforms (AWS, Azure, or GCP).
Proficiency in Infrastructure as Code (IaC) using Terraform, CloudFormation, or Ansible.
Experience with containerization & orchestration using Docker & Kubernetes.
Hands-on experience with CI/CD tools (Jenkins, GitHub Actions, ArgoCD, GitLab CI) and DevOps practices.
Strong understanding of Linux systems administration & shell scripting.
Strong experience with monitoring and observability tools like New Relic, AppDynamics, Prometheus, Dynatrace, DataDog, Nagios. Familiarity with ELK, Instana, Lenses (observability & monitoring).
Strong troubleshooting & debugging skills for infrastructure and application issues.
Strong experience in scripting and automation (Python, Bash / Powershell / Shell, Go, Ruby, JSON, Java, PHP or Node.JS)
Knowledge of networking, security, and load balancing concepts.
Exposure to incident management & on-call rotations