Remote US
106 days ago
Senior Staff Software Engineer (Reliability)

Join us to accelerate and ensure the resilience of Affirm’s ecosystem of honest financial products! We are looking for a Software Engineer to build and evangelize Reliability practices throughout Affirm’s Infrastructure team and beyond.

Reliability Engineering at Affirm is a small yet crucial team, building out the SRE Central Model that creates reliability best practices, tooling, frameworks, and then drives their adoption across Affirm. Our Vision is to enable all teams to “operate what they own” with excellence, while providing white-glove support for critical services that deliver our most impactful products to Affirm’s customers.

What You'll Do

In coordination with senior ICs and stakeholder teams, create and champion a long-term technical roadmap for the creation and adoption of reliability practices across Affirm Promote culture of ownership, curiosity, and data-driven decision making Elevate architecture, technical design, and code review with resiliency as a first class citizen, coaching team members on effective design and code reviews with the customer journey as the primary focus Influence and provide reliability guidance to Infrastructure teams to deliver solutions that improve customer experience Drive and simplify cross functional investigations around complex issues involving people, software, and systems Engage with product management to understand their needs, accelerate feature development velocity, and enable improved insights into their service while improving operational reliability Support the growth of the Infrastructure organization by hiring, coaching, and supporting senior
engineers in technical contributor roles Foster a culture of technical excellence, humility, constant improvement, and rigor within your team to enable them to undertake challenges across multiple technical domains Provide leadership in the implementation of incident management and reliability principles Focus on the human interaction with reliability systems to enable quicker incident resolution

What We Look For

10+ years of software development experience, including at least one of the following: Python,
Kotlin, Rust, Java, C++, GoLang Expertise in synthesizing complex technical requirements, designs, trade-offs, and capabilities
into clear decisions, and influence product direction Ability to communicate decisions and practices to the engineering organization effectively At least 5+ years of experience in at least two different SRE organizational structures At least 5+ years of experience of hands-on work in infrastructure and scaling distributed systems At least 5+ years of technical leadership on SWE and Reliability teams focussed on infrastructure,
reliability, and software engineering at scale Strong hands-on experience with k8s and AWS in a production environment Experience in encouraging a strong engineering culture and improving reliability in a growing
company, including the ability to work closely with other senior and staff engineers to drive
change Track record of successfully mentoring and developing technical leaders Deep knowledge of incident management, post-incident review, and incident analysis Expertise in developing and implementing effective Service Level Indicators (SLIs) and Service Level Objectives (SLOs)

 

Base Pay Grade - R

Equity Grade - 8

Employees new to Affirm typically come in at the start of the pay range. Affirm focuses on providing a simple and transparent pay structure which is based on a variety of factors, including location, experience and job-related skills.

Base pay is part of a total compensation package that may include equity rewards, monthly stipends for health, wellness and tech spending, and benefits (including 100% subsidized medical coverage, dental and vision for you and your dependents.)

CAN base pay range per year: $206,000 - $256,000 CAD

Location: Remote - Canada

#LI-Remote

Confirm your E-mail: Send Email