This job has expired
This position was posted on September 16, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
Indiana, USA
Job Type
Full-time
Category
DevOps
Date Posted
September 16, 2025
Full Job Description
đź“‹ Description
- • Shape the infrastructure that powers the world’s most comprehensive AI-driven DevSecOps platform, serving 100,000+ organizations and millions of users every day. As a Site Reliability Engineer at GitLab you will architect, build, and automate the systems that keep GitLab.com and our self-managed offerings running at planet-scale with five-nines reliability.
- • Own the end-to-end lifecycle of critical production services—from design and capacity planning through deployment, monitoring, and incident response—using Infrastructure-as-Code (Terraform, Ansible, Chef) and GitOps workflows. You will treat operations as a software problem, writing clean, testable code to eliminate toil and enable self-healing infrastructure.
- • Drive continuous improvement of our observability stack (Prometheus, Grafana, Thanos, OpenTelemetry) to surface actionable insights in real time. You will define SLIs/SLOs that align with customer experience, build error budgets, and partner with product engineering to balance velocity with reliability.
- • Lead blameless post-incident reviews and turn operational surprises into systemic fixes. Your postmortems will feed directly into the GitLab product itself, ensuring every outage makes the platform more resilient for every user.
- • Champion a culture of reliability across the company by mentoring engineers on best practices for distributed systems, chaos engineering, and performance optimization. You will contribute to GitLab’s public runbooks, architecture docs, and open-source projects, embodying our value of transparency.
- • Leverage AI as a force multiplier in your daily workflow—whether that’s using Duo Enterprise to auto-generate infrastructure code, predicting capacity needs with ML models, or automating root-cause analysis. You will experiment with cutting-edge tooling and share what you learn in our internal AI guild.
- • Participate in a follow-the-sun on-call rotation that respects work-life balance. We keep pager load low through aggressive automation and fair scheduling, and we compensate every on-call hour with additional PTO.
- • Collaborate with a globally distributed, fully remote team across 65+ countries. You will default to asynchronous communication in GitLab issues and merge requests, while also jumping on live calls when synchronous collaboration yields better outcomes.
- • Contribute to GitLab’s open-source codebase, turning operational learnings into product features. Your merge requests may improve everything from Kubernetes integrations to CI/CD templates used by thousands of companies.
- • Influence the roadmap for GitLab’s cloud-native architecture, including multi-region Kubernetes clusters, edge caching, and data-tier resiliency. You will work with product managers and executives to prioritize reliability investments that deliver outsized customer impact.
- • Enjoy the freedom to experiment: every team member receives an annual learning and development budget to attend conferences, purchase books, or spin up personal cloud sandboxes. We encourage 10% “investment time” for exploratory projects that could become tomorrow’s standard tooling.
🎯 Requirements
- • 3+ years of experience running large-scale, customer-facing production systems on Kubernetes, GCP, or AWS
- • Proficiency in at least one programming language (Go, Ruby, or Python preferred) and solid shell scripting skills
- • Deep understanding of Linux internals, networking (TCP/IP, HTTP/2, gRPC), and distributed systems concepts
- • Hands-on experience with Infrastructure-as-Code tools (Terraform, Ansible, or similar) and GitOps workflows
- • Familiarity with observability best practices—defining SLIs/SLOs, building dashboards, and configuring alerts
- • Nice-to-have: contributions to open-source projects, chaos engineering experience, or advanced Kubernetes certifications
🏖️ Benefits
- • Fully remote work with flexible hours and asynchronous culture—work from anywhere with reliable internet
- • Competitive global compensation plus equity in a high-growth, profitable company valued at $11B
- • Annual remote-work stipend, home-office setup budget, and monthly internet reimbursement
- • Self-managed PTO policy with a minimum of 20 days encouraged and paid volunteer time off
Skills & Technologies
GitLab
Remote
About GitLab, Inc.
GitLab Inc. is the company behind GitLab, a web-based DevOps platform that provides Git repository management, continuous integration and deployment, security scanning, and issue tracking in a single application. Founded in 2014 and headquartered in San Francisco, the company offers both open-source and commercial editions, serving software teams globally.


