This job has expired

This position was posted on November 20, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Elligint Health Inc. logo

DevOps Site Reliability Engineer (SRE)

Job Overview

Location

Remote

Job Type

Full-time

Category

Data Science

Date Posted

November 20, 2025

Full Job Description

đź“‹ Description

  • • Architect and own the next-generation, cloud-native infrastructure that powers Elligint Health’s mission-critical healthcare platform, ensuring 99.99 % uptime for millions of patient records and real-time analytics streams.
  • • Design, build, and harden CI/CD pipelines (GitHub Actions, Argo CD, Terraform Cloud) that deploy micro-services, data pipelines, and ML models to AWS EKS in under 10 minutes with zero-downtime blue/green and canary releases.
  • • Implement Infrastructure-as-Code using Terraform, Pulumi, and CloudFormation to provision multi-account, multi-region AWS topologies that meet HIPAA, HITRUST, and SOC 2 Type II controls out of the gate.
  • • Instrument end-to-end observability with Prometheus, Grafana, Loki, and OpenTelemetry, defining SLIs/SLOs for latency, error budgets, and data freshness that directly impact patient-care decisions.
  • • Lead chaos-engineering game days and quarterly disaster-recovery drills to validate RPO < 15 min and RTO < 30 min for PHI workloads stored in encrypted S3, RDS Aurora, and DynamoDB Global Tables.
  • • Automate cost-optimization guardrails using AWS Cost Explorer, Kubecost, and custom Lambda functions that shaved 28 % off our monthly cloud bill while maintaining performance SLAs.
  • • Partner with Security & Compliance to embed guard-duty, security-hub, and Prisma Cloud scans into every build, ensuring vulnerabilities are remediated before containers reach production.
  • • Coach feature teams on SRE best practices—runbooks, blameless post-mortems, error budgets—turning every on-call rotation into a learning opportunity and reducing MTTR by 40 % in the last two quarters.
  • • Own the 24Ă—7 on-call rotation (follow-the-sun across US time zones) with a one-week-in-four schedule, supported by automated alert routing via PagerDuty and Opsgenie.
  • • Continuously evaluate emerging technologies—Kubernetes Gateway API, eBPF observability, serverless Spark on EKS—to keep Elligint Health at the forefront of scalable healthcare tech.
  • • Collaborate with Data Engineering to tune Kafka, Airflow, and dbt pipelines that process 2 TB of HL7/FHIR data daily, ensuring freshness and correctness for downstream predictive models.
  • • Contribute to internal tooling (Python/Go CLIs, Helm charts, GitHub templates) that accelerate developer productivity and standardize deployment patterns across 12 squads.
  • • Champion a culture of documentation: every runbook, architecture decision record (ADR), and incident review is stored in MkDocs and discoverable within 30 seconds.
  • • Present quarterly reliability reviews to the CTO and VP of Engineering, translating technical metrics into business impact—fewer patient portal outages, faster prior-authorization workflows, and higher payer satisfaction scores.

🎯 Requirements

  • • 5+ years production-grade AWS experience (EKS, RDS, MSK, IAM, KMS, CloudTrail) with deep networking, security, and cost-optimization expertise.
  • • Expert-level Terraform and Kubernetes administration, including writing custom operators, admission webhooks, and Helm charts for stateful workloads.
  • • Proficiency in at least one systems language (Go, Rust, or Python) for building CLI tools, operators, and automation scripts.
  • • Hands-on experience implementing HIPAA/HITRUST controls and SOC 2 evidence collection in a regulated environment.
  • • Nice-to-have: CKA/CKS or AWS Pro/Specialty certifications, experience with healthcare data formats (HL7, FHIR), and prior work in a remote-first, fast-growing startup.

🏖️ Benefits

  • • Fully remote-first culture with quarterly in-person summits in rotating US cities (all expenses paid).
  • • 100 % employer-paid medical, dental, and vision for you and 75 % for dependents, plus $2,000 annual HSA contribution.
  • • 20 days PTO, 12 paid holidays, and a 4-week paid sabbatical after 4 years.
  • • $3,000 annual professional-development stipend (conferences, certifications, courses) and a $1,500 home-office setup budget.

Skills & Technologies

DevOps
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Elligint Health Inc. logo
Elligint Health Inc.
Visit Website

About Elligint Health Inc.

Elligint Health Inc. provides cloud-based interoperability and analytics software that connects electronic health records, payers, and life-science organizations. Its platform normalizes disparate clinical and claims data into FHIR-compliant formats, enables real-time querying, and delivers population-level insights for care management, quality reporting, and research. The company focuses on removing data silos to accelerate value-based care, clinical trials, and public health initiatives while maintaining HIPAA compliance and patient privacy through advanced encryption and consent management.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Forward Financing, LLC logo

Forward Financing, LLC

Remote - United States
Full-time
Expires Jun 8, 2026
Remote
$137k-187k

13 days ago

Apply
BRAZIL
Full-time
Expires Jun 20, 2026
AWS
Azure
Docker
+4 more

10 hours ago

Apply
Vapi Technologies Inc. logo

Vapi Technologies Inc.

San Francisco
Full-time
Expires May 3, 2026
Onsite
Degree Required

2 months ago

Apply
CANADA
Full-time
Expires Jun 20, 2026
AWS
Senior
Remote

10 hours ago

Apply