
Search by job, company or skills
Role Overview: We're seeking an experienced Site Reliability Engineer (SRE) to ensure our services are robust, scalable, secure, and maintainable. You will blend software engineering and systems operations to automate processes, monitor performance, lead incident response, and work closely with engineering teams to enhance service availability and reliability, while bringing efficiencies to operational processes. We're looking for proactive problem-solvers with strong technical and communication skills who can also effectively support and troubleshoot project operational issues.
Key Responsibilities
Required Qualifications
Desired Qualifications
#LI-NB1
Job ID: 145790273
Skills:
Containers, Devops, Kubernetes, Python, observability principles, SRE, FinOps, SRE best practices, Go, autoscaling Kubernetes clusters, securing systems in a public cloud environment, Production Engineering, deployment and monitoring of highly scalable products
Skills:
Scripting, Java, Prometheus, Node.js, Grafana, Datadog, Python, Kubernetes, AWS
Skills:
PowerShell, Bash, Jenkins, Gcp, Terraform, Docker, Azure, Kubernetes, Python, AWS, Go, Argo CD
Skills:
Prometheus, Zookeeper, Pulumi, Grafana, Kafka, Gitlab, Arm, Cloudformation, Cassandra, Terraform, Gcp, Elk Stack, Splunk, Artifactory, Bash Scripting, Elastic Search, AWS, Redis, Kubernetes, Python, Argo, Jenkins, MongoDB, Spinnaker, Google Deployment Manager, Packer
Skills:
Lambda, S3, Vpc, RDS, AWS, Cloudformation, Python, Bash, ECS, Iam, Terraform, Ec2, Devops Tools, Cloudwatch, EKS, SRE practices
We don’t charge any money for job offers