
Job Overview
Location
Spain
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
February 24, 2026
Full Job Description
đź“‹ Description
- • As a Senior Site Reliability Engineer on the Cloud Compute team at Affirm, you will be at the forefront of ensuring the stability, scalability, and reliability of our entire platform. This is a fully remote role based in Spain, offering a unique opportunity to contribute to the core infrastructure that powers Affirm's innovative credit solutions. Our team is the engine room, managing all of Affirm's Kubernetes clusters, and our mission is to provide a highly available and robust cloud environment that empowers all engineering teams to build and deploy solutions seamlessly and efficiently.
- • You will play a pivotal role in shaping the technical strategy for your team, focusing on year-long initiatives that directly impact business objectives. This involves not only defining the strategy but also driving its execution through critical, business-impacting projects. Your contributions will be essential in ensuring the long-term technical sustainability of our systems.
- • Collaboration is key in this role. You will work closely with product management, design, and analytics teams throughout the product development lifecycle. Your input will be crucial in ensuring that technical considerations, potential risks, and trade-offs are thoroughly understood and effectively managed, leading to more robust and well-architected solutions.
- • You will act as a force multiplier for your team by defining, advocating for, and implementing best-in-class technical solutions and operational processes. This includes championing automation, improving observability, and enhancing the overall reliability of our critical infrastructure.
- • Taking ownership of your team’s operations and availability is paramount. This involves establishing and maintaining comprehensive monitoring systems, defining clear triage rotations, developing detailed playbooks, implementing effective policies, and ensuring rigorous testing and alerting mechanisms are in place to support both day-to-day operations and on-call responsibilities.
- • You will foster a strong culture of quality and ownership within your team. This includes setting high standards for code reviews and design processes, and actively promoting these standards beyond your immediate team through written documentation and technical presentations.
- • Developing talent is a significant aspect of this senior role. You will provide constructive feedback and guidance to team members, leading by example and helping them grow their skills and careers.
- • This role requires participation in an on-call rotation, ensuring the continuous availability and reliability of our production systems. You will be instrumental in responding to incidents, performing root cause analysis, and implementing preventative measures.
- • You will be responsible for the development and deployment of highly available distributed systems, leveraging technologies such as AWS and Kubernetes. Your expertise will be critical in maintaining and enhancing the performance and resilience of our cloud infrastructure.
- • You will apply your excellent troubleshooting skills to diagnose and resolve complex issues within our cloud and distributed systems, minimizing downtime and impact on users.
- • A significant part of your work will involve automating deployments and operational tasks using Kubernetes. You will identify opportunities for automation to improve efficiency, reduce manual effort, and enhance system reliability.
- • You will have the opportunity to deliver major features, refactor system components, or deprecate existing functionality by defining and executing comprehensive technical plans. Your ability to write high-quality, understandable, and reusable code will be essential.
- • We encourage you to thrive in ambiguous environments, demonstrating the ability to dive deep into system architecture, from low-level language idioms to the overarching design of large-scale systems, to fully understand their behavior and identify areas for improvement.
- • Your growth and impact trajectory should demonstrate a mastery of gathering and iterating on feedback from both engineering and cross-functional peers, ensuring continuous improvement and alignment across the organization.
- • Strong verbal and written communication skills are vital for effective collaboration with our global engineering team, enabling clear articulation of technical concepts and strategies.
Skills & Technologies
About Affirm Holdings, Inc.
Affirm Holdings operates a point-of-sale consumer lending platform that integrates with online and in-store checkout systems. Through its technology, shoppers can split purchases into fixed, transparent installment payments, while merchants gain conversion and larger order values. The company underwrites and services loans in the United States and Canada using alternative credit models and data partnerships with banks, avoiding deferred-interest structures. It earns revenue from merchant fees and interest on loans, and offers a mobile app for consumers to manage repayment schedules and access additional credit lines.
Similar Opportunities

Massachusetts Mutual Life Insurance Company
2 months ago

T5 Data Centers, LP
2 months ago

