Search by job, company or skills

PwC India

Site Reliability Engineer

3-5 Years
Save
  • Posted 10 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Opportunity

We are looking for SREs who want to define what reliability means for the next generation of industrial software. Defining SLIs/SLOs, building observability platforms, and establishing incident management processes.


Responsibilitie

  • sDefine and implement SLI/SLO frameworks for complex engineering systems across manufacturing and industrial client
  • sDesign and deploy observability platforms using Prometheus, Grafana, and Datado
  • gEstablish incident management processes and lead blameless post-mortem
  • sImplement chaos engineering practices to proactively identify system weaknesse
  • sDrive toil elimination through automation and platform improvement
  • sBuild reliability engineering capabilities within the practice and client organisation

s
Essential Skil

  • lsSLI/SLO definition and implementation at enterprise sca
  • leObservability: Prometheus, Grafana, Datadog, New Rel
  • icIncident management and post-mortem facilitati
  • onChaos engineering: Gremlin, Chaos Monkey, Litm
  • usPython testing for reliability validation and automated runboo
  • ksAutomation and scripting: Python, Go, Ba
  • shCloud platforms: AWS, Azure, G

CP
Experie

nce3+ years in SRE or Production Engineering roles with experience in enterprise or industrial environme

nts

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 149061923

Similar Jobs

Pune, India

Skills:

PowerShellPrometheusBashGrafanaJenkinsARM templatesDockerTerraformWindows AdministrationMicrosoft AzureKubernetesPythonAzure DevOpsLog AnalyticsGitHub ActionsBicepApplication InsightsAzure MonitorAzure CLI

Hyderabad, Bengaluru, Pune

Skills:

AutomationKubernetesAWSCI/CD Pipelines

Pune, India

Skills:

snowflake JavaADOSpring BootUnixLogging ToolsAzure CloudGithubDynatraceLinuxAzure Cosmos DBServicenowConfluenceApache KafkaApisAzure DevOpsAzure Static Web AppsAzure Blob StorageTroubleshootingControl-MAzure SQL ServerAzure Kubernetes Service

Pune, India

Skills:

JavaOracle DatabasePrometheusSpring BootJbossGrafanaDatadogMs OfficeSqlJenkinsDockerElasticsearchKubernetesPythonGitOpsMonolithic applicationsCI CDMicroservices architecture

Noida, Pune

Skills:

SRE (Site Reliability Engineer)Scripting- Python/BashKubernatesJenkinsDocker