DailyPay, Inc. logo

Staff Site Reliability Engineer

Job Overview

Location

Belfast

Job Type

Full-time

Category

Software Engineering

Date Posted

May 21, 2026

Full Job Description

đź“‹ Description

  • • Lead the Site Reliability Engineering (SRE) function across DailyPay’s global engineering organization, driving operational excellence and system reliability as the primary SRE subject matter expert.
  • • Champion and embed SRE principles aligned with the Google SRE Book across all engineering teams, influencing culture, processes, and technical decision-making to ensure services remain always up, always stable, and always available.
  • • Define and own the SRE roadmap, technical strategy, and reliability standards in close collaboration with leadership and principal engineers to align with business-critical objectives.
  • • Establish and evolve organization-wide observability standards, selecting and standardizing platforms such as DataDog, Loki, Grafana, Tempo, Mimir, and OpenTelemetry to enable proactive detection and resolution of system issues.
  • • Identify high-impact opportunities to improve system reliability, drive their implementation from concept to completion, and measure outcomes against key reliability metrics.
  • • Mentor and elevate the SRE team by setting a high bar for technical quality, fostering growth through direct coaching, and leading by example in incident response, automation, and documentation practices.
  • • Collaborate with engineering teams to integrate reliability best practices into the software development lifecycle, reducing toil through automation and infrastructure-as-code.
  • • Design, build, and scale highly available, fault-tolerant systems on AWS using Kubernetes and Terraform to support DailyPay’s on-demand pay platform serving millions of users.
  • • Ensure all infrastructure and service deployments follow secure, repeatable, and auditable practices, minimizing downtime and maximizing system resilience.
  • • Serve as a technical bridge between SRE, platform, and application engineering teams to align priorities, resolve cross-functional bottlenecks, and promote shared ownership of system health.
  • • Advocate for and implement AI-assisted development tools in daily workflows, critically evaluating AI-generated code before production deployment while maintaining code quality and security standards.
  • • Contribute to incident management protocols, postmortem culture, and blameless reviews to drive continuous learning and improvement across the organization.
  • • Promote a high-trust, inclusive engineering environment where team members are empowered to challenge norms, ask difficult questions, and share bold perspectives without fear of professional fallout.
  • • Represent DailyPay’s SRE function externally through best practice sharing, internal knowledge transfer, and alignment with industry standards in reliability engineering.
  • • Act as a technical anchor during critical incidents, providing rapid diagnosis, coordination, and resolution while ensuring communication remains clear and transparent to stakeholders.
  • • Evaluate and recommend new tools, technologies, and architectural patterns to enhance system scalability, observability, and operational efficiency.
  • • Maintain a strong focus on automation, eliminating manual toil through code and infrastructure changes, ensuring SRE time is spent on innovation rather than repetitive tasks.
  • • Support the scaling of DailyPay’s infrastructure to meet growing user demand while maintaining 99.9%+ service availability targets.
  • • Participate in on-call rotations as a senior member of the SRE team, ensuring timely response to production incidents with deep technical ownership.
  • • Collaborate with security and compliance teams to ensure infrastructure meets regulatory and data protection requirements for financial services.
  • • Document system architectures, runbooks, and operational procedures to ensure knowledge retention and team scalability.
  • • Influence product and engineering roadmaps by advocating for reliability as a non-negotiable feature, not an afterthought.
  • • Model accountability, intellectual honesty, and curiosity, encouraging the team to check assumptions and embrace diverse perspectives to arrive at better engineering outcomes.

🎯 Requirements

  • • 8+ years of experience designing, building, and scaling complex, highly available systems
  • • Deep SRE expertise aligned with the principles outlined in the Google SRE Book
  • • Proven technical leadership with a track record of delivery, cross-team collaboration, and influencing engineering culture
  • • Experience with Terraform, Kubernetes, AWS, and LGTM (Loki, Grafana, Tempo, Mimir)
  • • Comfort writing production-quality code in Go or Python
  • • Willingness to use and critically evaluate AI-assisted development tools in daily workflow

🏖️ Benefits

  • • Opportunity for equity ownership
  • • Private health insurance option
  • • Employee Resource Groups
  • • Fun company outings and events
  • • Generous PTO Allowance
  • • 5% Pension contribution

Skills & Technologies

Python
Go
AWS
Kubernetes
Terraform
Senior
Onsite

Ready to Apply?

You will be redirected to an external site to apply.

AI Job Fit Analysis
Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

DailyPay, Inc. logo
DailyPay, Inc.
Visit Website

About DailyPay, Inc.

DailyPay provides an on-demand pay platform that integrates with employer payroll systems, allowing employees to access earned wages before the scheduled payday. Founded in 2015 and headquartered in New York City, the company partners with enterprises across retail, hospitality, healthcare and contact-center industries to offer real-time pay transfers, automated savings, financial counseling and analytics dashboards that reduce turnover and support workforce financial wellness.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Austin
Full-time
Expired Jun 2, 2026
Python
Senior
Onsite
+1 more

3 months ago

Expired
Veeam Software Group GmbH logo

Veeam Software Group GmbH

Remote, Switzerland
Full-time
Expired Apr 25, 2026
Go
Remote

4 months ago

Expired
Hybrid - San Francisco
Full-time
Expired May 27, 2026
AWS
GCP
REST
+3 more

3 months ago

Brazil (Remote)
Contract
Expires Jul 18, 2026
Remote

1 month ago