
Search by job, company or skills
Our Team - Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Data,Analytics Platform.This team will focus on product development and product strategy for Oracle Health, while building out a complete platform supporting modernized, automated healthcare. This is a net new line of business, constructed with an entrepreneurial spirit that promotes an energetic and creative environment. We are unencumbered and will need your contribution to make it a world class engineering center with the focus on excellence.
Oracle Health Data, Analytics Platform has a rare opportunity to play a critical role in how Oracle Health products impact and disrupt the healthcare industry by transforming how healthcare and technologyintersect.
You will have the opportunity to:
About The Job: A unique opportunity to join a rapidly growing world-class team to engineer cutting edge Oracle Cloud technologies and infrastructure that make up the Oracle Cloud solutions. As part of the SREteam, you will be continually challenged and have an opportunity to contribute to the Oracle Cloud success every day, working closely with the development partners.
As a Site Reliability Engineer, you will solve interesting technical challenges by defining, designing, deploying, and troubleshooting key Oracle Cloud services, platforms, and infrastructure, always thinking about reliability, scalability, resilience, security, and performance.
The ideal candidate for this engaging and visible technical leadership role would have the experience of a developer, the wits of a systems and infrastructure whiz, and the courage of a spirited closer. All these qualities bundled up in an affable communicator in order to make our Oracle Cloud customers successful.
Responsibilities: Our team works within the Oracle Health Data, AnalyticsPlatform with a core focus on cloud operations. We are responsible for building and maintaining the cloud computing services and solutions that help us operate with greater efficiency, security, and attention to detail.As a member of the team, you will be surrounded by bright and innovative minds thriving in a collaborative environment supporting infrastructure and applications. We operate the Big-Data infrastructure portion of one of our core products, Oracle Health Data Intelligence.We empower our team to make advancements to be more efficient and productive in their day-to-day operations leading to the delivery of superior external customer product availability and support experience.
Required Skills:
5+ years of experience building and managing large-scale distributed systems on Cloud platforms (Oracle Cloud, AWS, or Azure).
Experience in large-scale Big Data infrastructure, including hands-on experience managing and scaling Hadoop, HBase, Kafka, and distributed storage systems.
Advanced expertise in Kubernetes and container management technologies, including hands-on experience with orchestration, scaling, and container networking.
Proficiency in Infrastructure as Code (IaC) using Terraform and configuration management via Ansible.
Deep knowledge of Linux internals, including performance tuning, kernel behaviour, and system-level troubleshooting.
Strong automation mindset with proficiency in programming languages such as Python, Go, or Shell to eliminate manual toil.
Extensive experience with Networking and TCP/IP standards, including L7 routing, Load Balancers, DNS, HTTP, and CDN.
Advanced troubleshooting skills with a methodical approach to resolving complex, cross-stack issues and performing Root Cause Analysis (RCA).
Hands-on experience with Observability and Instrumentation tools such as Zabbix, Splunk, Prometheus, Grafana, or New Relic to monitor system health and performance.
Experience managing high-performance databases infrastructure including installation, performance tuning, and scaling in a cloud environment.
Understanding of Cloud Security and compliance, including IAM, secret management, and network security policies.
Understanding of Service Level Objectives (SLOs) and SLIs to measure and maintain service and infrastructure reliability.
Strong systems architecture skills, with the ability to define, document, and implement innovative solution methodologies for cloud migration at scale.
Excellent technical communication skills, including the ability to author design specifications and architecture diagrams.
Collaborative team player with the ability to work alongside development partners in fast-paced environments and a willingness to learn and improve system reliability.
Proven experience participating in 12/7 on-call rotations to maintain service availability and respond to production incidents.
Career Level - IC3
Oracle Corporation is an American multinational computer technology corporation headquartered in Austin, Texas.In 2020, Oracle was the second-largest software company in the world by revenue and market capitalization.The company sells database software and technology (particularly its own brands), cloud engineered systems, and enterprise software products, such as enterprise resource planning (ERP) software, human capital management (HCM) software, customer relationship management (CRM) software (also known as customer experience), enterprise performance management (EPM) software, and supply chain management (SCM) software.
Job ID: 138403443