Reston, VA, United States
1 day ago
Site Reliability Developer 4

Customers rely on Oracle Cloud Infrastructure (OCI) to power their business as they tackle some of the world’s biggest challenges. We’re looking for Senior Principal Site Reliability Developers/Engineers who would be responsible for Advanced Operations (AO), critical issues of production environments, including systems and databases, supporting critical business operations, but most importantly be the Technical Advisor for Operations Strategy and direction. Will perform administration and analysis for multiple production environments and recommend new and novel solutions to improve availability, performance, and supportability. This is an opportunity to bring a combination of deep technical knowledge with administration/analysis knowledge of Oracle's Cloud Infrastructure to provide critical issue support to a wide range of complex production environment problems related to immense growth, scaling, using the cloud, extremely high performance, and high availability requirements. 

Responsibilities:

You will be working in the US Gov Ops organization as the senior most Technical Advisor overseeing both Advanced Operations (AO) and Point Operations (PO) in full stack ownership of a collection of services and/or technology areas, spanning three Realms and seven Regions.  Understand the end-to-end configuration, technical dependencies, and overall behavioral characteristics of production services. Responsible for the design and delivery of the critically important stack and strategy, with focus on security, resiliency, scale, and performance.  Authority for end-to-end performance and operability. Partner with service development teams, engage with senior leadership on vision/strategy/business objectives and finally defining and implementing improvements in service architecture.  Articulate technical characteristics of services and technology areas and guide Development Teams to engineer and add premier capabilities to the Oracle Cloud service portfolio. Understand and communicate the scale, capacity, security, performance attributes, and requirements of the service and technology stack. Demonstrate clear understanding of automation and orchestration principles. You will act as ultimate point for complex or critical issues that have not yet been documented as Standard Operating Procedures (SOPs). This role will allow you to apply a deep understanding of service topology and their dependencies required to solve issues and define mitigations. Understand and explain the effect of product architecture decisions on distributed systems. Professional curiosity and a desire to a develop deep understanding of services and technologies. 

Career Level - IC4

Confirm your E-mail: Send Email