Search by job, company or skills

ntt data north america

Site Reliability Engineer (SRE)

5-7 Years
Save
new job description bg glownew job description bg glow
  • Posted 3 days ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Title: Site-Reliability Engineer

Seniority Level: Mid Level

Essential Responsibilities:

The SRE Engineer in Resideo (formerly Honeywell Homes, NYSE: REZI) will be able to own the cloud infrastructure, build an infrastructure-as-code environment while also being able to monitor overall systems and infrastructure health to help the company continue advancing their innovation. The successful candidate will have the cloud and DevOps culture embedded in their work habits. This role will help drive these practices in the overall transformation of operational support.

Your Duties

  • Maintain public cloud infrastructure by using at least one of the Cloud technology Azure or AWS
  • Build and Maintain cloud infrastructure automation (IaC) by using Terraform, ARM Templates or similar.
  • Build and Maintain IT automation using tools like Ansible, Chef or managing complex container-based applications like Helm for Kubernetes.
  • Build, delivery and deployment by using modern technologies like Git, Git Action, Jenkins, Ansible, Docker, Kubernetes or similar.
  • Create/Manage monitoring and alerting systems to meet SLA's.
  • Be part of an SRE team that provides 24x5 support in troubleshooting platform issues, and part of an on-call rotation for weekend support.
  • Oversee all planned outages, and assist with major upgrades to ensure minimum downtime

YOU MUST HAVE:

  • Minimum 5 years of working experience with at least one of the public cloud platforms: Azure preferred but not required
  • Minimum of 5 years Windows / Linux experience.
  • Minimum of 2 years Terraform or other IaC platforms experience.
  • Strong knowledge of Elastic, Grafana, Prometheus or other observability platforms (Datadog, Dynatrace, etc.).
  • Proven experience with running and/or managing large IT platform services with multiple availability regions.
  • Experience with container orchestration platform Docker or Kubernetes, or similar
  • Strong English communication (written and oral) skills are required.

WE VALUE:

  • Public Cloud (Azure or AWS) Certifications – Professional level preferred
  • Comfort with both Linux and Windows administration
  • Background with scripting technologies: PowerShell or Bash or Python
  • Knowledge of monitoring and logging systems (e.g. ELK stack, Grafana, Icinga2 or similar).
  • The right candidate for this role is passionate about technology, collaborates with product owners and technical stakeholders, thrives under pressure, and is laser-focused on delivering exceptional results.

More Info

Job Type:
Industry:
Employment Type:

Job ID: 148086033

Similar Jobs

India

Skills:

ARM TemplatesPrometheusGrafanaDatadogJenkinsGitTerraformDockerAnsibleDynatraceAzureHelmKubernetesAWSChefElasticGit Action

Bengaluru, India

Skills:

GithubDatadogNew RelicJavascriptCloudwatchDockerTerraformLinuxPerlGitlabPythonAWSChefGo

Bengaluru, India

Skills:

StormCassandraPrometheusKafkaDockerTerraformElasticsearchShell scriptingPostgresGitlabPythonAWSRustCloudformationRedisJenkinsGcpCloudwatchLinuxAnsibleSparkKubernetesGoFlinkGitHub ActionsArangoDBStackdriver

Hyderabad, India

Skills:

JavaJavascriptPrometheusNode.jsGrafanaScriptingDatadogPythonKubernetesAWSAPI integrations

Bengaluru, India

Skills:

UnixElkPrometheusGrafanaDatadogDockerTerraformPythonAWSJavaCloudformationBashPulumiDevopsGcpLinuxArmAzureKubernetesMonitoring observability toolsInfrastructure as CodeSREGoAzure Monitor