Manila, Philippines
19 days ago
Staff (Lead) Site Reliability Engineer

Staff Site Reliability Engineer  

Join our SRE Team and Revolutionize Tricentis SaaS Products! 

  

The Site Reliability Engineer is a pivotal role in our SaaS strategy. You will work closely with our engineering team to ensure unrivaled observability, availability, and performance of Tricentis SaaS Products." 

  

As a Site Reliability Engineer (SRE), you'll be the driving force of our user-facing services and production systems. We're seeking individuals with pragmatic operational skills and software craftsmanship, applying engineering principles, operational discipline to elevate our operating environments and codebase to new heights. 

  

At the core of your responsibilities, you'll specialize in systems such as operating systems, storage subsystems, observability and networking while implementing best practices for availability, reliability, and scalability. But that's just the beginning of your thrilling journey with us!  

  

Your Impact as a Staff Lead SRE

Mentor and collaborate with team members and other staff to further develop a DevOps culture  

Proposes and drives platform architectural changes that affect our SaaS portfolio to solve scaling and performance problem 

Lead significant cross-team project work which has direct impact on company revenue  

Act as subject matter expert in observability, scalability and performance 

Runs RCAs and planning meetings to get meaningful work scheduled into the plan.  

  

As a valuable member of our SRE team, you'll have the opportunity to: 

 

Strive for automation either by coding it or by leading and influencing developers to build systems that are easy to run in production  

Contribute to the future roadmap of software development teams and establish strong operational readiness across teams 

Establish clear ongoing cloud efficiency metrics and multi-environment observability stack based on the existing SaaS system  

Plan for new service roll-outs, expansion and capacity management of existing services, and work with engineers to optimise their resource consumption 

Leading by example with positive and inclusive leadership and fostering constructive discussions between SRE and engineering 

  

Our Tech Stack

 

Terraform, Pulumi, GitHub Actions, Kubernetes, DataDog, Prometheus, Grafana, AWS, AZURE  

  

Our Culture 

We don't just preach our values; we embody them in everything we do. We are committed to creating an environment that empowers, supports, and includes individuals, where trust, transparency, creativity, curiosity, and continuous improvement thrive on a daily basis. 

 

About You

12+ years in SRE or similar roles, with a focus on tooling, automation and infrastructure on a major public cloud provider   

Led teams technically on architecture and system design  

Authoring and maintaining IaC with Terraform and using IaC to deploy resources in AWS, Azure  

Strong skills around observability, troubleshooting, and performance solutioning. 

Helm charts and deploying and maintaining Kubernetes clusters.  

Deep understanding/experience with: 

Cloud services (AWS or Microsoft Azure)  

Containers and Kubernetes  

SQL databases and designing schemas  

Distributed architecture and micro-service architecture 

 Able to create innovative solutions that push Tricentis technical abilities ahead of the curve  

Ability to develop other team members into senior levels 

Identify Service Level Indicators (SLIs) that align the team with availability and latency objectives. 

Be part of an on-call (PagerDuty) rotation to respond swiftly to incidents affecting availability, offering support to product engineers during customer incidents 

 

 

If you're ready to make a lasting impact as a Site Reliability Engineer and be at the forefront of revolutionizing Tricentis SaaS Products, don't miss this.  

At Tricentis, we strive for success while inspiring those around us by knowing what we need to achieve and how we’ll achieve it. Our core values serve as our guiding light to drive our every action and define our ways of working so that we can create and enjoy a successful journey and reach higher heights together.

Demonstrate Self-Awareness: Own your strengths and limitations.Finish What We Start: Do what we say we are going to do.Move Fast: Create momentum and efficiency.Run Towards Change: Challenge the status quo.Serve Our Customers & Communities: Create a positive experience with each interaction.Solve Problems Together: We win or lose as one team.Think Big & Believe: Set extraordinary goals and believe you can achieve them.

Tricentis is proud to be an equal opportunity workplace. Qualified applicants will receive consideration for employment without regard to race, colour, ethnicity, gender, religious affiliation, age, sexual orientation, socioeconomic status, or physical and mental disability and other statuses protected by law.    

Confirm your E-mail: Send Email