This job has expired

This position was posted on October 3, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

ConductorOne Inc. logo

Site Reliability Engineer

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Software Engineering

Date Posted

October 3, 2025

Full Job Description

đź“‹ Description

  • • Own the backbone of ConductorOne’s identity governance platform: you’ll architect, deploy, and continuously refine the cloud-native infrastructure that keeps DigitalOcean, Instacart, Ramp, and other forward-thinking enterprises secure and compliant 24/7.
  • • Design and operate multi-region, multi-cloud environments (AWS primary, GCP/Azure secondaries) that scale elastically to handle millions of permission checks and access requests per day while maintaining sub-second latency and five-nines availability.
  • • Build and maintain Infrastructure-as-Code stacks using Terraform and Pulumi, codifying every network segment, IAM policy, and Kubernetes manifest so that environments are reproducible, peer-reviewed, and version-controlled from day one.
  • • Eliminate toil through relentless automation: craft self-healing deployment pipelines, auto-remediation playbooks, and chaos-engineering experiments that surface weaknesses before customers ever feel them.
  • • Establish and track rigorous SLIs/SLOs—think error budgets, p99 latency, and MTTR—then embed these reliability contracts into CI/CD gates so every new feature ships with measurable confidence.
  • • Develop world-class observability: deploy Prometheus, Grafana, Loki, and OpenTelemetry tracing to give engineers real-time, actionable insight into microservices, databases, queues, and third-party integrations.
  • • Lead incident response from detection to post-mortem: run blameless retrospectives, translate root causes into code fixes, and share lessons learned across engineering, product, and customer success teams.
  • • Continuously optimize cost and performance: right-size clusters, leverage spot fleets, and tune data stores so we deliver enterprise-grade reliability without enterprise-grade cloud bills.
  • • Partner with security and compliance teams to harden every layer—network segmentation, secrets management, policy-as-code—ensuring we exceed SOC 2, ISO 27001, and FedRAMP controls without slowing delivery.
  • • Contribute to open-source tooling and internal libraries that the broader SRE community can adopt, amplifying ConductorOne’s reputation as a thought leader in secure, scalable infrastructure.

🎯 Requirements

  • • 3+ years in an SRE, DevOps, or Production Engineering role supporting high-traffic, cloud-native applications
  • • Expert-level proficiency with at least one major cloud platform (AWS preferred) and Infrastructure-as-Code tools such as Terraform, Pulumi, or CloudFormation
  • • Hands-on experience running Kubernetes in production, including cluster bootstrapping, service meshes, and multi-tenancy security
  • • Strong coding or scripting skills in Go, Python, or similar, with a habit of writing tests and peer-reviewed automation
  • • Demonstrated track record of defining SLIs/SLOs, error budgets, and incident-response processes that measurably improved reliability

🏖️ Benefits

  • • Competitive salary plus meaningful equity in a fast-growing, mission-driven company
  • • Flexible PTO and a remote-first culture that trusts you to manage your own schedule
  • • 100% employer-paid medical, dental, and vision coverage for you and your dependents
  • • Annual learning & development stipend, conference budget, and open-source contribution time

Skills & Technologies

Python
AWS
Azure
GCP
Kubernetes
Remote

Ready to Apply?

You will be redirected to an external site to apply.

ConductorOne Inc. logo
ConductorOne Inc.
Visit Website

About ConductorOne Inc.

ConductorOne provides an identity security platform that automates access reviews, provisioning, and de-provisioning across cloud applications and infrastructure. The software integrates with directories, SaaS tools, and cloud providers to enforce least-privilege policies, track entitlements, and monitor usage. Continuous compliance reporting and self-service access requests reduce manual IT workload while improving security posture for mid-market and enterprise organizations.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

❌ EXPIRED
Brazil
Full-time
Expired Dec 2, 2025
Python
Express
PostgreSQL
+4 more

7 months ago

Apply
❌ EXPIRED
Remote
Full-time
Expired Dec 2, 2025
Remote

7 months ago

Apply
❌ EXPIRED
WeTravel, Inc. logo

WeTravel, Inc.

United States
Full-time
Expired Dec 2, 2025
Python
JavaScript
REST
+1 more

7 months ago

Apply
❌ EXPIRED
France
Full-time
Expired Dec 2, 2025
JavaScript
TypeScript
React
+5 more

7 months ago

Apply