Search by job, company or skills

T

Site Reliability Engineer (SRE)

5-7 Years
SGD 1.08 - 1.56 LPA
Save
  • Posted 15 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Location: Central Region, Singapore
Salary: $9,000 - $13,000

About the Role

We are looking for an experienced Site Reliability Engineer (SRE) to help us build and run highly scalable, always-on cloud systems. You will be at the heart of ensuring our microservices platforms remain stable, secure, and high performing across multi-cloud environments.

This is a hands-on technical role where you will work closely with engineering and DevOps teams to improve system reliability, automate operations, and respond to production incidents.

Key Responsibilities

  • Run and maintain microservices on Kubernetes-based cloud platforms
  • Work with DevOps and development teams to deploy services across multi-cloud environments (AWS / Azure / GCP / OCI)
  • Build and improve system monitoring, alerting, and observability dashboards
  • Conduct load testing and chaos engineering to ensure system resilience
  • Define and track SLAs, SLOs, and SLIs to measure system performance
  • Troubleshoot production issues and perform root cause analysis
  • Design and implement disaster recovery plans
  • Write automation scripts (Python / Go / Bash / Java) to improve efficiency
  • Optimize system performance (CPU, memory, scaling, and scheduling issues)
  • Ensure systems meet security and compliance standards
  • Participate in on-call rotation for production support
  • Document architecture, processes, and operational runbooks
  • Mentor junior engineers and contribute to engineering best practices
  • Evaluate and implement new tools and technologies

Requirements

  • 5+ years of experience in SRE, DevOps, or Cloud Engineering roles
  • Strong experience with Kubernetes and containerized systems
  • Hands-on coding/scripting ability (Python, Go, Java, Bash, or PowerShell)
  • Experience with cloud platforms (AWS / Azure / GCP)
  • Strong understanding of cloud infrastructure, networking, and security
  • Experience with monitoring, logging, and observability tools
  • Strong troubleshooting and incident management skills
  • Comfortable working in fast-paced, high-availability environments

Interested applicants, please email your resume to [Confidential Information]

Tan Li Lian
EA 01C3135
Reg R1100465

More Info

Job Type:
Industry:
Employment Type:

Job ID: 149283809

Similar Jobs

Orchard Road, Singapore

Skills:

UnixElkPrometheusBashGrafanaDatadogIncident ResponseTerraformDockerLinuxAnsibleSplunkPythonKubernetesInfrastructure as CodeGocloud platformsMonitoringRoot Cause Analysisalertingautomation scriptsobservability solutions

Singapore, Alexandra Road

Skills:

ElkPrometheusBashGrafanaDistributed SystemsContainersScripting LanguagesautomationPythonKubernetesInfrastructure as Code toolsGocloud platformsmonitoring and observability toolsOpenTelemetryLinux systems

Singapore

Skills:

ElkPrometheusGrafanaDatadogJavascriptTerraformDockerPythonAWSSentryBashDevopsCloudwatchGcpLinuxAnsibleHelmAzureKubernetesGoOpenSearchInfrastructure EngineeringSite Reliability EngineeringPlatform Engineering

Remote, India

Skills:

GcpDatadogPrometheusAzureTerraformGrafanaJenkinsAnsibleGitHub ActionsAI-OpsGCP Operations SuiteAzure Monitor

Singapore

Skills:

Node.jsFastAPIPythonDockerFlaskGitReactGoogle Workspace Admin SDKGoZoom APIGAM