Astreya Partners, Inc. logo

Sr. Systems Reliability Engineer

Job Overview

Location

Remote, WA

Job Type

Full-time

Category

DevOps & SysAdmin

Date Posted

May 7, 2026

Full Job Description

đź“‹ Description

  • • As a Senior Systems Reliability Engineer at Astreya Partners, Inc., you will play a pivotal role in establishing and scaling enterprise-wide reliability standards through the SRE Center of Excellence, driving the phased implementation of Dynatrace full-stack observability across all technology towers.
  • • Your day-to-day responsibilities include configuring and managing Dynatrace agents, APM, RUM/Synthetics, Log-in-Context, and Zenoss integrations; building dashboards aligned to the four Golden Signals for engineering and executive audiences; owning SLO-based alerting targeting MTTD under 5 minutes; conducting 2-3 week embedded engagements with tower teams to assess observability maturity, define SLIs/SLOs, and deliver runbooks and alert frameworks; implementing and extending Ansible Automation Platform for infrastructure provisioning and configuration management across 600+ Linux and 3,000-7,000 Windows servers; contributing to automated remediation workflows; establishing enterprise-wide SRE baseline standards; serving as a Domain Champion bridging the Center of Excellence to tower SRE teams; and contributing to OKR tracking across observability, MTTD/MTTI reduction, incident resolution, automation, and SRE readiness.
  • • You will join a Center of Excellence focused on enterprise reliability, collaborating with cross-functional technology towers to standardize observability practices, reduce toil, and improve incident response through automation and data-driven SRE principles.
  • • In this role, you will deepen your expertise in enterprise-scale observability, lead SRE transformations across complex hybrid environments, influence organizational standards, and gain visibility into strategic technology initiatives while developing leadership and stakeholder engagement skills in high-impact, cross-functional settings.

🎯 Requirements

  • • 5+ years of experience in Site Reliability Engineering, IT operations, or related fields
  • • Bachelor’s degree in Computer Science, Engineering, or equivalent (2 additional years of experience in lieu of degree)
  • • 1+ year hands-on experience with Dynatrace (required), including configuring agents, APM, dashboards, SLO alerting, and log integrations in production
  • • Demonstrated ability to define SLIs/SLOs in collaboration with product and engineering teams
  • • Proven ability to present and lead in front of stakeholders, explain SRE principles to technical and non-technical audiences, and guide teams through implementation
  • • Experience in enterprise environments with a mix of on-prem, cloud, and homegrown applications
  • • Must be authorized to work in the U.S. (W2 only; no sponsorship)
  • • Must be located near an Alaska Airlines hub city and available for on-site work approximately once per month
  • • Availability to work West Coast / PST hours

🏖️ Benefits

  • • Opportunity to own and scale enterprise-wide Dynatrace implementation from Phase 1 to full organization-wide rollout
  • • Lead embedded SRE engagements with tower teams to drive observability maturity and reduce toil
  • • Shape and establish enterprise-wide SRE baseline standards as a Domain Champion in the Center of Excellence
  • • Work with hybrid infrastructure including 600+ Linux and 3,000-7,000 Windows servers using Ansible Automation Platform
  • • Contribute to automated remediation workflows targeting zero human intervention for known issues
  • • Influence OKR tracking across key SRE metrics including MTTD/MTTI reduction, incident resolution, and automation
  • • Engage in blameless postmortems and RCA feedback loops to foster a culture of continuous improvement
  • • Develop leadership and stakeholder management skills in a high-visibility, cross-functional role

Skills & Technologies

Python
Kubernetes
Terraform
Linux
Grafana
Senior
Remote
Degree Required

Ready to Apply?

You will be redirected to an external site to apply.

AI Job Fit Analysis
Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

Astreya Partners, Inc. logo
Astreya Partners, Inc.
Visit Website

About Astreya Partners, Inc.

IT services company providing global technology workforce solutions, managed services, and digital workplace support. Delivers end-user computing, service desk, logistics, and enterprise technology deployment for Fortune 500 clients, operating across North America, EMEA, and APAC with a distributed talent model and integrated platform for device lifecycle management.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Yerevan, Armenia
Full-time
Expired Jun 4, 2026
Python
Java
Go
+6 more

2 months ago

Expired
Pragmatike Soluciones TecnolĂłgicas S.L. logo

Pragmatike Soluciones TecnolĂłgicas S.L.

Armenia
Full-time
Expired Jun 6, 2026
JavaScript
TypeScript
Rust
+4 more

2 months ago

Expired
Yerevan, Armenia
Full-time
Expired Jun 4, 2026
Python
Java
Go
+5 more

2 months ago

Expired
Argentina
Full-time
Expired May 31, 2026
Azure
Remote
$40k-45k

3 months ago