Site Reliability Engineering at Affirm is a small, yet crucial, team that helps our Engineering partners to “Operate What They Own” with excellence to protect their customers’ experience. SRE accomplishes this through defining frameworks and best practices for operating applications, building tooling, and providing training and consulting. Some of the many SRE responsibilities are:
Providing data and visibility to teams and leadership on application performance Guiding the development of SLOs Driving the Incident Management and Analysis process Steering the implementation of Change Management and Deployment practices Engaging in service and architectural conversations Recommending observability and alerting configurationsThe SRE team benefits from experience across many domains including:
infrastructure, platform, and distributed systems capacity management, load and chaos testing automation, observability, and configuration management development and product experienceThe SRE team is seeking motivated software engineers with the experience to build and expand reliability and resilience practices throughout Affirms Engineering organization and beyond.
What You'll Do You will be responsible for owning and delivering quarterly goals for your team, leading engineers on your team through ambiguity to solve open-ended problems, and ensuring that everyone is supported throughout delivery. You will support your peers and stakeholders in the product development lifecycle by collaborating with product management, design & analytics by participating in ideation, articulating technical constraints, and partnering on decisions that properly consider risks and trade-offs. You will proactively identify project, process, technology or business issues, advocate for them, and lead in solving them. You will support the operations and availability of your team’s artifacts by creating and monitoring metrics, escalating when needed, and supporting “keep the lights on” & on-call efforts. You will foster a culture of quality and ownership on your team by setting or improving code review and design standards for your team, and advocating for them beyond your team through your writing and tech talks. You will help develop talent on your team by providing feedback and guidance, and leading by example. What We Look For You have 4+ years of experience designing, developing and launching backend systems at scale using languages like Python or Kotlin. You have a track record of developing highly available distributed systems using technologies like AWS, MySQL and Kubernetes. You have 4+ years working in a Site Reliability or Production Engineering team You demonstrate curiosity with empathy, and strong opinions loosely held You have experience defining a technical plan for the delivery of a significant feature or system component with an elegant, simple and extensible design. You write high quality code that is easily understood and used by others. You are proficient at making significant changes in a large code base, and have developed a suite of tools and practices that enable you and your team to do so safely. Your experience demonstrates that you take ownership of your growth, proactively seeking feedback from your team, your manager, and your stakeholders. You have strong verbal and written communication skills that support effective collaboration with our global engineering team. This position requires either equivalent practical experience or a Bachelor’s degree in a related field.Base Pay Grade- N
Equity Grade- 8
Employees new to Affirm typically come in at the start of the pay range. Affirm focuses on providing a simple and transparent pay structure which is based on a variety of factors, including location, experience and job-related skills.
Base pay is part of a total compensation package that may include equity rewards, monthly stipends for health, wellness and tech spending, and benefits (including 100% subsidized medical coverage, dental and vision for you and your dependents.)
USA base pay range (CA, WA, NY, NJ, CT) per year: $190,000 - $240,000
USA base pay range (all other U.S. states) per year: $169,000 - $219,000
#LI-Remote