This job has expired
This position was posted on January 3, 2026 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
London, Oregon, USA
Job Type
Full-time
Category
Software Engineering
Date Posted
January 3, 2026
Full Job Description
đź“‹ Description
- • Own the beating heart of incident.io’s infrastructure. You will architect, build, and continuously evolve the cloud platform that powers 500,000+ incidents a month for Netflix, Airbnb, Block, and 1,500 other demanding customers. Every millisecond you shave off a deploy, every extra nine of reliability you add, directly translates into faster incident resolution and happier end-users.
- • Design and maintain Terraform-driven infrastructure as code that is versioned, peer-reviewed, and reproducible from dev to prod. You will codify best practices so that any engineer can spin up or tear down environments safely, while you keep an eye on cost, security, and compliance.
- • Build and harden CI/CD pipelines that let product engineers ship dozens of times per day with confidence. You’ll integrate automated testing, security scanning, blue-green and canary deployments, and rollback strategies that work at scale. Expect to push the boundaries of Google Cloud Build, Cloud Deploy, and custom tooling to achieve sub-minute deploys.
- • Champion reliability and observability. You will define SLIs/SLOs, instrument services, and build dashboards and alerts that make sense to both engineers and on-call responders. When things break, you lead blameless post-mortems and turn lessons learned into resilient systems.
- • Scale for tomorrow, not just today. Our customer base is growing 3× year-over-year. You will forecast capacity, automate horizontal and vertical scaling, and design multi-region architectures that keep latency low and availability high—even when half the internet is on fire.
- • Elevate developer experience. You’ll create golden-path templates, self-service tooling, and documentation that empower product engineers to own their services end-to-end. Think internal developer portals, ephemeral preview environments, and chat-ops commands that feel like magic.
- • Collaborate across the company. You’ll pair daily with product engineers, join customer calls to understand pain points, and work with Customer Success to translate feedback into infrastructure improvements. Your code will be reviewed—and you’ll review theirs—in a culture that prizes kindness and rigor.
- • Join a two-person platform team poised to double in the next 12 months. You’ll help hire, mentor, and set technical direction for the next wave of platform engineers. Your architectural decisions today will echo for years.
- • Stay curious and keep us sharp. You’ll run internal tech talks, experiment with bleeding-edge GCP services, and prototype new approaches to chaos engineering, progressive delivery, and AI-driven capacity planning. If it makes us faster, safer, or happier, you’ll champion it.
Skills & Technologies
About Incident Technologies, Inc.
Incident Technologies provides a cloud platform that consolidates alerts, on-call scheduling, incident response workflows, and status communication into one tool. Teams connect monitoring sources, define escalation rules, and automate runbooks to reduce mean time to resolution. The service integrates with Slack, PagerDuty, GitHub, and observability suites, offering real-time collaboration, stakeholder updates, and post-mortem analytics. Founded by ex-Uber and Stripe engineers, the company targets DevOps and SRE groups seeking faster incident handling without juggling multiple tools.
Similar Opportunities

SHI International Corp.
15 days ago

