Search by job, company or skills

squareops

Site Reliability Engineer

Save
new job description bg glownew job description bg glow
  • Posted 9 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Responsibilities

  • Monitor cloud infrastructure (AWS primary, GCP secondary) across multiple client environments simultaneously.
  • Respond to alerts across Slack, Google Chat, and MS Teams per defined SLAs.
  • Triage incidents following ITSM processes: log, classify, escalate, and close with documentation.
  • Execute runbooks for standard operational tasks and known issue patterns.
  • Perform clean shift handoffs with accurate status updates.
  • Coordinate with L2 engineers and client stakeholders during escalations.
  • Participate in change windows as an executor implementing pre-approved changes safely.
  • Handle client credentials, access, and operational data with strict infosec discipline.

Requirements

  • 3-5 years in cloud operations, infrastructure support, or managed services.
  • Linux fundamentals: process management, log navigation, and basic troubleshooting.
  • Networking basics: DNS, TCP/IP, HTTP/S, load balancers, VPCs.
  • Scripting in Bash and/or Python operational scripts, not development-level.
  • Hands-on AWS EC2 ECS, RDS, CloudWatch, IAM, and S3 at minimum.
  • Experience in an ITSM-governed environment: incident management, SLA adherence, and escalation paths.
  • Worked in or alongside infosec-conscious environments, making credential hygiene, least-privilege access, and secure information handling second nature.
  • Strong written and verbal English: direct client communication is part of the role.
  • Comfortable with rotational shifts, including nights and weekends.

Good To Have

  • Kubernetes or ECS hands-on exposure and familiarity with pods, services, and deployments.
  • GCP experience.
  • AWS Certified Cloud Practitioner or SysOps Administrator.
  • Observability tools: Grafana, Prometheus, Loki, and CloudWatch dashboards.
  • Prior experience in a multi-client MSP or shared services environment.
  • Basic CI/CD and containerisation awareness (Docker, EKS, ECS).
  • Exposure to fintech or healthcare client environments.

This job was posted by Nitin Yadav from SquareOps.

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 148355855

Similar Jobs

Delhi, India

Skills:

PrometheusBashJiraGrafanaGitConfluenceDockerTerraformKubernetesPythonAWSInfrastructure as CodeGitOpsCI CD systemsLokiGoPagerDuty

Noida, India

Skills:

ScpPrometheusGrafanaDatadogTerraformPythonAWSJavaRDSJenkinsAnsibleIamDynatraceKubernetesGitOpsGoWASMDirect ConnectOpenSearchAuroraGitHub ActionsRanchereBPFVictoria MetricsElastiCacheEKSGitLab CIservice meshesMimirCost ExplorerArgoCD

Noida, India

Skills:

JavaUnixDistributed SystemsGoogle Cloud PlatformPrometheusShell ScriptingGrafanaDatadogIncident ResponseDockerTerraformLinuxAnsibleAzureKubernetesPythonAWSSLI ManagementInfrastructure as Code

Gurugram, Gurugram, India

Skills:

Load BalancersAerospikeFirewallsApache KafkaBashDnsElk StackPythonHashiCorp VaultAWS networking

Gurugram, Gurugram, India

Skills:

ElkCloudformationPrometheusBashGrafanaJenkinsGcpDockerTerraformAzureKubernetesPythonAWSGitLab CI