Boca Raton, FL, United States of America
13 hours ago
Site Reliability Engineer III

Are you looking for a Sr SRE role whereby you will be able to have a positive impact on society?

About the role: This position is responsible for ensuring the reliability, scalability, and performance of our cloud-based infrastructure, enabling seamless delivery of services to our customers. The ideal candidate will have strong expertise in Azure Cloud, Kubernetes, and modern DevOps practices, with a passion for automating processes and enhancing system resilience

About the team: We are seeking a highly skilled and motivated Site Reliability Engineer (SRE) to join our team and play a critical role in the success of a new project that will impact community safety and crime prevention. .

Key Responsibilities

Design, implement, and maintain highly available and scalable systems using Azure Cloud, Kubernetes, Docker, and Helm Charts.

Manage and optimize Infrastructure as Code (IaC) using Terraform and GitHub Actions with a focus on automation and operational efficiency.

Secure and manage secrets and sensitive data using HashiCorp Vault.

Implement and maintain CI/CD pipelines, enabling seamless deployment and rollback processes.

Monitor, troubleshoot, and improve system performance and reliability, utilizing best practices in observability and incident management.

Leverage Terraform Enterprise (TFE) to streamline IaC workflows and ensure compliance with organizational policies.

Collaborate with development teams to enhance tooling, reduce system bottlenecks, and align with SLOs, SLIs, and SLAs.

Create and manage GitOps workflows, leveraging tools like GitHub IaC and ArgoCD (preferred).

Monitor and analyze system health using the Grafana Stack (Prometheus, Loki, Mimir, Tempo) to ensure continuous improvement.

Required Qualifications

Strong experience with Azure Cloud infrastructure and services.

Proficiency in container orchestration using Kubernetes and Docker.

Expertise in managing infrastructure using Terraform, Helm Charts, and GitHub Actions.

Solid understanding of CI/CD pipelines and automation workflows.

Experience with HashiCorp Vault for secret management.

Familiarity with monitoring and logging tools, with hands-on experience in setting up dashboards and alerts.

Strong problem-solving skills and experience in incident management.

Why Join Us?

Work on a cutting-edge business case that leverages the latest technologies.

Collaborate with a talented and passionate team dedicated to innovation and excellence.

Enjoy opportunities for growth, learning, and career advancement in a supportive environment.

If you are a self-motivated engineer with a passion for reliability, automation, and scalability, we’d love to hear from you!

#LI-AK1

At LexisNexis Risk Solutions, having diverse employees with different perspectives is key to creating innovative new products for our global customers. We have 30 diversity employee networks globally and prioritize inclusive leadership and equitable processes as part of our culture. Our aim is for every employee to be the best version of themselves. We would actively welcome applications from candidates of diverse backgrounds and underrepresented groups. 

We are an equal opportunity employer: qualified applicants are considered for and treated during employment without regard to race, color, creed, religion, sex, national origin, citizenship status, disability status, protected veteran status, age, marital status, sexual orientation, gender identity, genetic information, or any other characteristic protected by law. We are committed to providing a fair and accessible hiring process. If you have a disability or other need that requires accommodation or adjustment, please let us know by completing our Applicant Request Support Form: https://forms.office.com/r/eVgFxjLmAK , or please contact 1-855-833-5120.

Please read our Candidate Privacy Policy.

Confirm your E-mail: Send Email