
Job Overview
Location
Remote (EMEA)
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
April 3, 2026
Full Job Description
đź“‹ Description
- • As an SRE specializing in ClickHouse at PostHog, you will play a pivotal role in scaling one of the largest self-managed ClickHouse deployments on AWS, currently operating at petabyte scale and preparing for 10–50× growth. This role is critical to ensuring the reliability, performance, and automation of PostHog’s core data infrastructure, which powers product analytics, customer data platforms, and AI-driven insights for over 100,000 companies worldwide. Your work will directly enable the company’s mission to provide an all-in-one operating system for software builders, from startup to IPO and beyond.
- • You will manage and optimize large fleets of EC2-based virtual machines, storage, and networking infrastructure tailored for data-intensive workloads, focusing on provisioning, scaling, rebalancing, and recovery of ClickHouse clusters. Your day-to-day will involve designing and implementing automation to reduce manual toil, improving tooling for schema changes, backups, restores, and incident response, and collaborating closely with ClickHouse engineers to translate database requirements into robust infrastructure solutions. You will also participate in on-call rotations with a strong emphasis on preventing incidents through proactive system design and self-healing automation.
- • You’ll join a remote-first, async-driven engineering culture rooted in transparency, autonomy, and shipping fast. PostHog operates with radical openness—sharing revenue, board notes, and fundraising plans publicly—and empowers individuals to choose high-impact work based on customer value and personal motivation. Teams are small, highly autonomous, and end-to-end owned, allowing you to drive meaningful change without bureaucratic overhead. Tuesdays and Thursdays are meeting-free, protecting deep focus time for building and problem-solving.
- • This role offers the opportunity to grow into a leading expert in large-scale stateful systems on AWS, particularly around analytical databases like ClickHouse. You’ll gain deep experience in infrastructure automation, performance tuning at petabyte scale, and building resilient, self-healing platforms—skills that are rare and highly valuable in today’s infrastructure landscape. You’ll also have the chance to shape PostHog’s long-term data platform strategy as the company expands into CRM, workflow, revenue analytics, and support products.
🎯 Requirements
- • Strong experience operating production infrastructure on AWS, including EC2, networking, and storage services
- • Hands-on experience managing VM-based systems (not just managed PaaS) for stateful workloads
- • Proven ability to automate infrastructure using tools like Terraform, Ansible, or similar
- • Solid understanding of Linux systems, including disk, memory, networking, and failure modes
- • Experience supporting stateful systems such as databases, queues, or storage systems
- • Ability to debug and reason about performance and reliability issues in production environments
- • Comfort with end-to-end ownership, including participation in on-call and incident response
🏖️ Benefits
- • Fully remote role with flexibility to work from anywhere in the EMEA time zones
- • Access to a transparent, high-trust culture where company strategy, finances, and roadmaps are openly shared
- • Generous learning and development budget to support growth in infrastructure, databases, and cloud technologies
- • Unlimited time off and meeting-free days (Tuesdays and Thursdays) to protect focus and prevent burnout
- • Opportunity to work on cutting-edge, petabyte-scale ClickHouse infrastructure with real-world impact
- • Equity participation in a well-funded, fast-growing startup backed by top-tier investors
Skills & Technologies
About PostHog Inc.
PostHog provides an open-source product analytics platform that lets teams track user behavior, run A/B tests, and gather feedback without sending data to third parties. The self-hosted or cloud service captures events, pageviews, feature flags, and session recordings, then surfaces insights through dashboards, funnels, retention, and cohort analysis. Engineers can instrument code once and non-technical teammates can query results using SQL or visual builders. The company maintains the core project under an MIT license and offers paid tiers for enterprise support, higher volumes, and advanced features such as correlation analysis, data pipelines, and team collaboration tools.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
27 days ago

Pragmatike Soluciones TecnolĂłgicas S.L.
25 days ago

