
Job Overview
Location
United Kingdom
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
May 8, 2026
Full Job Description
đź“‹ Description
- • As a Site Reliability Engineer at Arbor Education, you will play a critical role in ensuring the resilience, performance, and scalability of the platform that powers school management tools used by over 7,000 schools and trusts across the UK. Your work directly supports Arbor’s mission to transform school operations by reducing staff burnout and enabling data-driven, joyful working environments for educators.
- • Day to day, you will proactively monitor and analyse platform performance, collaborate with engineering teams to resolve bottlenecks, improve observability using tools like DataDog or Prometheus, develop runbooks, conduct disaster recovery drills, participate in incident response and blameless postmortems, and help define and track SLOs to ensure high availability and resilience.
- • You will join a mission-driven SRE team within a purpose-led organisation that values staff wellbeing, innovation, and inclusive culture. Arbor is committed to building technology that gives time and power back to school staff, and you’ll work closely with Platform Engineering, feature teams, and support stakeholders to embed SRE practices across the organisation.
- • In this role, you will deepen your expertise in cloud infrastructure, observability, and incident management while contributing to scalable, reliable systems. You’ll have the opportunity to shape SRE practices, influence architectural decisions, and grow professionally through access to a dedicated CPD budget and collaborative, blameless learning culture.
🎯 Requirements
- • Experience in performance monitoring and analysis
- • Capacity planning experience
- • Scripting and automation skills with experience in relevant technologies
- • Experience with Infrastructure as Code, particularly Terraform
- • Understanding of relational database technologies and their cloud versions (e.g. AWS Aurora)
- • Experience with messaging and distributed asynchronous workloads
- • Experience with nginx or similar technologies
- • Familiarity with SRE processes
- • Aware of DevOps principles like the 3 ways and 5 ideals
🏖️ Benefits
- • 32 days holiday (plus Bank Holidays), made up of 25 days annual leave plus 7 extra company wide days given over Easter, Summer & Christmas
- • Life Assurance paid out at 3x annual salary
- • Comprehensive wellness benefit provided by AIG Smart Health, including 24/7 virtual GP service, mental health support, counselling, and personalised health checks
- • Private Dental Insurance with Bupa
- • Salary sacrifice Pension provided by Scottish Widows
- • Enhanced maternity and adoption leave (20 weeks full pay) and paternity (6 weeks full pay) pay
- • Access to services such as Calm and Bippit (financial wellbeing coaching)
- • Dedicated professional development training budget (CPD courses, upskilling resources, professional memberships etc)
- • Volunteer with a charity of your choice for a day each year
- • Dog friendly offices!
Skills & Technologies
About Arbor Education 3
Arbor Education 3 is an educational organization focused on providing high-quality learning experiences. The company operates schools and educational programs designed to foster student growth and academic achievement. Their approach emphasizes innovative teaching methods and a supportive learning environment. Arbor Education 3 is committed to developing well-rounded individuals prepared for future success in higher education and their careers. The organization plays a role in the education sector by offering accessible and effective schooling options.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Pragmatike Soluciones TecnolĂłgicas S.L.
1 month ago
1 month ago

