hatch I.T., Inc logo

Site Reliability Engineer (SRE)

Job Overview

Location

Remote

Job Type

Full-time

Category

Product Management

Date Posted

March 4, 2026

Full Job Description

đź“‹ Description

  • • As a Site Reliability Engineer (SRE) at CardioOne, you will be at the forefront of ensuring the unwavering reliability, seamless scalability, robust security, and optimal performance of our critical production systems and services. This pivotal role acts as the essential bridge between our dynamic software development teams and our robust operational infrastructure, embodying the core principles of SRE to enable the rapid, secure, and dependable delivery of our cutting-edge applications.
  • • You will be instrumental in designing, implementing, and maintaining sophisticated automation strategies that streamline deployment pipelines, reduce manual toil, and enhance system resilience. This includes developing and managing CI/CD (Continuous Integration/Continuous Deployment) workflows, infrastructure as code (IaC) solutions, and automated testing frameworks to ensure consistency and efficiency across all environments.
  • • A significant aspect of your role will involve establishing and refining comprehensive monitoring and alerting systems. You will leverage advanced tools to gain deep visibility into system health, performance metrics, and potential issues, proactively identifying and addressing anomalies before they impact users. This includes defining Service Level Objectives (SLOs) and Service Level Indicators (SLIs) to quantitatively measure and manage system reliability.
  • • You will actively participate in the incident response process, acting as a key contributor during outages or performance degradations. This involves rapid diagnosis, effective troubleshooting, root cause analysis (RCA), and the implementation of preventative measures to avoid recurrence. Post-incident, you will lead efforts to document learnings and improve system robustness.
  • • Collaborating closely with software engineers, you will provide guidance on designing for reliability, scalability, and maintainability. This includes reviewing code for operational impact, advising on architectural decisions, and promoting best practices for cloud-native development and distributed systems.
  • • You will be responsible for managing and optimizing our cloud infrastructure, likely on platforms such as AWS, Azure, or GCP, ensuring cost-effectiveness, security, and high availability. This involves capacity planning, performance tuning, and implementing robust disaster recovery and business continuity strategies.
  • • Security will be a paramount concern. You will work to embed security best practices throughout the system lifecycle, from development to deployment and operation, ensuring compliance with relevant regulations and protecting sensitive patient data.
  • • You will contribute to the development and maintenance of system documentation, runbooks, and operational playbooks, ensuring that knowledge is shared effectively across the engineering and operations teams.
  • • This role offers a unique opportunity to shape the future of CardioOne's technical infrastructure, directly impacting the company's mission to revolutionize cardiology through innovative technology. You will work in a collaborative environment, reporting directly to the Senior Director of Engineering, and have a significant voice in technical direction and operational excellence.
  • • Your work will directly support CardioOne's mission to partner with independent cardiologists, improve patient outcomes, and reduce healthcare costs. By ensuring the stability and performance of their platform, you enable physicians to focus on patient care and thrive in evolving healthcare models.
  • • You will be part of a growing company that has recently partnered with WindRose Health Investors, indicating a strong commitment to growth and investment in its technological capabilities. This presents an exciting opportunity to contribute to a company in a significant phase of expansion and innovation.
  • • The role demands a proactive, problem-solving mindset, a deep understanding of distributed systems, and a passion for automation and operational excellence. You will be empowered to make impactful decisions and drive continuous improvement in our technology stack.
  • • You will engage in capacity planning, performance tuning, and the implementation of robust disaster recovery and business continuity strategies, ensuring that CardioOne's services are always available and performant.
  • • You will also be responsible for managing and optimizing our cloud infrastructure, ensuring cost-effectiveness, security, and high availability, leveraging your expertise to maintain a resilient and efficient environment.
  • • This position is fully remote, offering flexibility and the opportunity to work from anywhere within the US, contributing to a healthy work-life balance.
  • • You will be a key player in fostering a culture of reliability and operational excellence, directly contributing to the company's success and its ability to positively impact US cardiology.

Skills & Technologies

DevOps
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

hatch I.T., Inc logo
hatch I.T., Inc
Visit Website

About hatch I.T., Inc

hatch I.T., Inc. is a recruiting and talent‑scaling firm that specializes in sourcing engineering, product, and data teams for startups and high‑growth technology companies. The company offers subscription and bespoke hiring programs including full‑cycle recruitment, employer branding, candidate community building, and integrated outreach to accelerate hires across software engineering, product, QA, and technical leadership roles. hatch I.T. also provides recruitment-as-a-service for scale‑ups and VC portfolios, integrating with clients’ ATS and communication tools to deliver retained search, talent pipelines, and hiring operations support. The firm primarily serves early‑stage and mid‑market tech companies in the U.S. DMV region and beyond.

Similar Opportunities

Washington, District of Columbia, USA
Full-time
Expires Apr 28, 2026
Junior
Remote

11 days ago

Apply
Berlin, Germany
Full-time
Expires Apr 25, 2026
Remote

14 days ago

Apply
⏰ EXPIRES SOON
USA
Full-time
Expires Mar 10, 2026 (Soon)
Remote

2 months ago

Apply
❌ EXPIRED
Santa Monica, California, USA
Full-time
Expired Mar 1, 2026
Senior
Onsite
Remote

2 months ago

Apply