Search by job, company or skills

Zenith Services Inc.

Site Reliability Engineer (SRE)

new job description bg glownew job description bg glownew job description bg svg
  • Posted a month ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Job Description:

We are seeking an experienced Site Reliability Engineer (SRE) to ensure the reliability, scalability, and performance of our production systems. The role focuses on DevOps practices, automation, and cloud infrastructure to support highly available and resilient applications.

Key Responsibilities:

  • Design, build, and maintain CI/CD pipelines for reliable and frequent deployments
  • Manage containerized environments using Docker and Kubernetes
  • Monitor system health, performance, and availability using modern monitoring and alerting tools
  • Ensure high availability, scalability, and fault tolerance of cloud infrastructure
  • Automate operational tasks and improve system reliability through SRE best practices
  • Collaborate with development teams to improve deployment, observability, and incident response

Key Skills:

  • Strong experience in DevOps and SRE practices
  • Hands-on with CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.)
  • Expertise in Docker and Kubernetes
  • Experience with monitoring/logging tools (Prometheus, Grafana, ELK, Datadog, etc.)
  • Solid knowledge of cloud platforms (AWS, Azure, or GCP)
  • Scripting skills (Python, Bash, or similar)

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 139971553