HYDERABAD, TELANGANA, India
5 days ago
Principal Site Reliability Developer

SaaS Cloud CPQ is seeking a motivated Site Reliability Engineer that thrives in a fast-paced rapidly evolving technology environment. This individual will be a member of the CPQ System Administration team and focused on driving for those quality standards across all projects. The purpose of this position is to support build, operations, customer support, and DevOps within the organization. As part of the CPQ System Administration group, you will be instrumental in fostering a culture of SRE for horizontal activities and DevOps for products and tools across our global operations teams. The team you work in will have diverse expertise in systems, networking, and software development to provide the stability, performance and reliability our customers need. We work with multiple service development teams, identifying cross-team issues which create risk for operations across the organization and resolving those issues with a mixture of engineering, troubleshooting expertise, and general operational guidance. Your role also requires communication and organizational skills. You are an interface between DevOps Tools, application teams that implement OCI services. You will deliver the solutions that directly contribute to our customer's success. As a member of our global team, you will:

Deploy, operate and maintain large scale cloud build in a cloud native environment Improve our offerings through performance and reliability analysis Assist in building and maintaining Cl/CD pipeline Diagnose and resolve issues across cloud services such as database, network, compute, storage, and application services such as java, Weblogic, OHS (Apache), etc.. Participate in system design consulting, platform management, and capacity planning Anticipate the future and deliver those concepts to reality Participate in a global break-fix alert calls

Key qualifications of an ideal internal candidate:

Must Have:

Good Understanding of Cloud Infrastructure and Virtual Networking Experience working in closely held/confidential environments Experience in operating Cl/CD pipelines that build and deliver services on cloud A mind focused on systems reliability, automation, and improvement Experience with desktop support, VDI and troubleshoot issues with their workstations/laptops Motivation to collaborate with your local and global teams Experience with Linux 3-8 years' experience in Systems Engineering, DevOps or SRE roles supporting large scale infrastructure, cloud or web services

Nice to have:

Proficient with Git source code management (SCM) Oracle Database Administration experience OS image build for Linux, Windows and patch automation using Python, Terraform, Ansible, PowerShell Good understanding of Agile software development principles including using common tools such as JIRA Aptitude to be a good team player and the desire to learn and implement new Cloud technologies as needed Excellent organizational, verbal, and written communication skills Experience in compute, network, storage, database troubleshooting for improving capacity, reliability, scalability, availability Experience working with fault tolerant, highly available, high throughput, distributed, scalable systems A history of working with Cl/CD related systems (Kubernetes, Terraform, or similar)

Career Level - IC4

Confirm your E-mail: Send Email