United States
14 hours ago
Site Reliability Developer 5

Purpose:           As a Site Reliability Engineer (SRE), you will be focused on improving service reliability, performance, and operability of Oracle Cloud SaaS Services. You will have your hand on the pulse of the service and will play a key role in operational excellence, contributing to future automation, tooling, and product improvements. Your role will drive improvements in availability and the customer experience, while reducing costs of running SaaS.

Description:      We are looking for a strong Site Reliability Engineer (SRE) Incident Commander who will help ensure the availability of our Cloud services 24x7x365. The SRE Incident Commanders always have a pulse on the Oracle’s SaaS services and hold themselves directly accountable for improving availability through timely mitigation, postmortem deep dives, and collaboration with engineering partners to improve telemetry and automation. 

During Major Service Impacting Events, your goal is to reduce time to mitigate, by ensuring the correct resources are brought to the discussion, evaluating and tracking the fastest path to resolution through deep technical troubleshooting, managing bridge participation of various leadership levels to ensure communication remains efficient and progressive, and upholding a sense of urgency so that all participants remain cognizant of the impact to the customer and our commitment to service availability. 

Post Major Events, you will be expected to look for ways to optimize the service through automation, improved telemetry, and standard operating procedures, as well as deep technical discussions with responsible service owners during postmortems to reduce the chance of recurrence and drive down incident rates. 

You will leverage excellence in communication, technical/business analysis, problem solving and attention to detail to methodically resolve issues.  Technically, you will have advanced knowledge across the full stack of services (Network to Application) with expertise in specific subject matter areas, where you can dig deep technically as an expert to mitigate the issue as quickly as possible. 

You will help create a “Customer First” culture across all Oracle Cloud teams, focusing on “Up Time” and effective communications, both orally and written. SRE Incident Commanders will continually review and enhance systems, methods, and applications to enable the delivery of a positive customer experience to highly acclaimed hosted products such as Oracle Fusion, Oracle Service Cloud, Oracle CRM and many more. 

As a member of the Oracle Cloud Service Center team, you will be surrounded by “willing to help” individuals from a global team representing some of the brightest and most innovative minds in the industry. You will be a part of an organization that prides itself on providing training, empowerment, and career progression.

Career Level - IC5

Confirm your E-mail: Send Email
All Jobs from Oracle