Search by job, company or skills

A

Site Reliability Engineer

new job description bg glownew job description bg glownew job description bg svg
  • Posted 11 hours ago
  • Be among the first 10 applicants
Early Applicant

Job Description

Summary

Imagine what we could do together. At Apple, new ideas have a way of becoming excellent products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don't just build products they craft the kind of wonder that's revolutionized entire industries. It's the diversity of those people and their ideas that encourages the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts.

Apple's B2B team manages critical integrations with Apple's supply chain partners such as manufacturers, logistics providers, banks, resellers and business customers. We are seeking a technically hands on individual with a real passion for programming and automation.

Join our dynamic team as a Software Reliability Engineer (SRE) and dive into innovative work culture fueled by machine learning, anomaly detection and threat detection. Collaborate with a highly motivated team of professionals who push boundaries and delivering exceptional results. This position offers an exciting opportunity to build your career as an SRE in a supportive environment, where continuous learning and professional development are prioritized.

Description

As an SRE at Apple, you will be part of a team who will implement and maintain best-in-class devops practices, work on complex technical challenges related to scalability, reliability and performance of Apple B2B systems. You will be managing the lifecycle of machine learning models in production and non-production environment. You will be responsible for continuously assessing and improving system processes, detecting anomalies, identify the areas of optimization and implementing solutions to enhance system reliability and performance. You should have a passion for programming and a good conceptual understanding of the operating environment - JVM, Operating System, File Systems, Network Protocols. Technical expertise, strong communication skills and teamwork are essential requirements for this role as it involves working with both technical and non-technical groups within Apple and externally with our supply chain partners.

We are looking for a Senior DevOps Engineer who can design, build, and scale a modern DevOps and SRE ecosystem from scratch.

This role requires deep hands-on expertise, strong architectural thinking, and the ability to establish GitOps-driven, cloud-native CI/CD platforms using the latest technologies.

The ideal candidate will act as a foundational engineer and technical leader, defining standards, tooling, automation, and reliability practices across the organization.

Minimum Qualifications

  • At least 5 years of prior demonstrated experience in a Site Reliability Engineering, DevOps(Must), or an Infrastructure-focused role.
  • Designing and Building DevOps platforms end-to-end alongwith SRE/Platform Engineering.
  • Proven experience in building DevOps platforms from scratch.
  • Applied Experience on GitOps-based deployment models (ArgoCD / Flux)
  • Establish Infrastructure as Code (IaC) practices.
  • Build and operate Kubernetes platforms (EKS / AKS / GKE / OpenShift)
  • Experience working in large-scale, distributed systems
  • Strong problem-solving and architectural skills
  • Proficiency in one or more programming languages (eg. Python)
  • Support of internet-facing production services and distributed systems via deployments, onCall and Incident Management. Lead incident response, RCA, and reliability improvements.
  • Proficiency in implementing and coordinating telemetry using monitoring and observability tools like Splunk, Grafana, and Prometheus, or similar.
  • Experience in solving and resolving issues in Kubernetes from both an operating system and application perspective.
  • Building and operating container orchestrating systems like Kubernetes or EKS.
  • Strong programming experience in Java building web, middleware or backend applications.
  • Deep understanding of Oracle or similar relational databases and NoSQL databases such as MongoDB.
  • Firsthand experience in performance tuning of applications and databases.
  • Knowledge of HTTP/S, TCP, DNS, web application load balancing.
  • Deep understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, PKI, X509 certificates and PGP.

Preferred Qualifications

  • Strong programming experience in Java for backend, middleware, or web applications
  • Experience with NoSQL databases (MongoDB, Cassandra, DynamoDB, etc.)
  • Deep understanding of relational databases (Oracle, PostgreSQL, MySQL, etc.)
  • Hands-on experience in performance tuning of applications and databases
  • Experience with advanced observability practices:
  • * Distributed tracing
  • * SLO/SLI design
  • * Error budgets
  • Prior experience in large-scale, highly distributed production environments
  • Experience with container orchestration internals (scheduler, CNI, CSI, etc.)
  • Knowledge of middleware platforms such as WebMethods Integration Server or similar.
  • Experience with multi-cloud or hybrid cloud environments
  • Familiarity with service mesh technologies (Istio, Linkerd, etc.)

More Info

Job Type:
Industry:
Employment Type:

About Company

Job ID: 142204917

Similar Jobs

(estd)