
Job Overview
Location
Remote
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
March 28, 2026
Full Job Description
đ Description
- ⢠As an SRE - Infra at PostHog Inc., you will play a critical role in transforming a rapidly growing, stateful infrastructure into a predictable, automated platform that supports the companyâs mission to ship every product companies need to run their businessâfrom early-stage startups to IPO and beyond. This role is not about maintaining the status quo; itâs about reducing operational toil through deep ownership, automation, and systems thinking, enabling engineering teams to move faster and more reliably.
- ⢠Day to day, you will: operate and evolve EKS clusters across multiple environments using Karpenter for autoscaling, Cilium for networking, and ArgoCD for GitOps-driven deployments; manage and refine a multi-account AWS organization, including networking, IAM, and cross-account connectivity; maintain and enhance the Terraform/Terragrunt-based IaC platform with automated plan-on-PR and apply-on-merge pipelines; improve operational tooling for deploys, schema changes, backups, restores, and incident response; identify recurring pain points and eliminate them via code and self-healing automation; optimize cloud spend continuously; and participate in on-call rotations with a focus on reducing incident frequency over time through proactive improvements.
- ⢠Youâll join a remote-first, product-led company where transparency, autonomy, and shipping fast are core values. PostHog operates with public roadmaps, open financials, and a culture that trusts engineers to make product decisions. Teams are small, highly autonomous, and encouraged to be âweirdââbuilding ambitious, user-focused products even when they seem unnecessary at first glance. This environment rewards initiative, learning, and shipping impact over process perfection.
- ⢠In this role, you will gain deep expertise in scaling stateful systems at petabyte scale, master modern platform engineering practices including GitOps and infrastructure as code, and develop the ability to design systems that scale without scaling human effort. Youâll have the autonomy to experiment, the support to learn, and the satisfaction of knowing your work directly enables hundreds of engineers to ship faster and more reliablyâmaking this one of the most impactful infrastructure roles in a high-growth, mission-driven tech company.
đŻ Requirements
- ⢠Deep hands-on experience with Kubernetes in production, preferably EKS, including debugging node pressure, networking issues, and deployment failures at scale (thousands of nodes)
- ⢠Strong experience operating production infrastructure on AWS across multiple accounts, with understanding of organizational boundaries, IAM, and networking between accounts
- ⢠Experience automating infrastructure using Terraform or Terragrunt at scale, including module design and state management
- ⢠Solid understanding of Linux systems (disk, memory, networking, and failure modes)
- ⢠Experience supporting stateful systems such as databases, queues, and storage systems
- ⢠Ability to debug and reason about performance and reliability issues in production
- ⢠Comfort with owning systems end-to-end, including participating in on-call responsibilities
đď¸ Benefits
- ⢠Fully remote work with flexible hours and asynchronous-first culture
- ⢠Equity compensation as part of a well-funded, high-growth startup backed by top-tier investors
- ⢠Generous learning and development budget to support continuous skill growth
- ⢠Unlimited time off and meeting-free Tuesdays and Thursdays to protect focus time
- ⢠Access to PostHogâs full product suite, including AI-powered analytics and data warehouse tools
- ⢠Transparent culture with open access to company strategy, financials, and roadmap
Skills & Technologies
About PostHog Inc.
PostHog provides an open-source product analytics platform that lets teams track user behavior, run A/B tests, and gather feedback without sending data to third parties. The self-hosted or cloud service captures events, pageviews, feature flags, and session recordings, then surfaces insights through dashboards, funnels, retention, and cohort analysis. Engineers can instrument code once and non-technical teammates can query results using SQL or visual builders. The company maintains the core project under an MIT license and offers paid tiers for enterprise support, higher volumes, and advanced features such as correlation analysis, data pipelines, and team collaboration tools.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
27 days ago

Pragmatike Soluciones TecnolĂłgicas S.L.
25 days ago

