
Senior Site Reliability Engineer, Platform & Cloud FinOps (100% Remote - USA Central & EST)
Job Overview
Location
Boston
Job Type
Full-time
Category
Software Engineering
Date Posted
March 7, 2026
Full Job Description
đź“‹ Description
- • Join Hopper's innovative Cloud FinOps team as a Senior Site Reliability Engineer, a pivotal role focused on optimizing our expansive Google Cloud infrastructure. This position is 100% remote within the USA (Central & EST time zones), offering flexibility and the opportunity to make a significant impact on a platform serving millions of users globally.
- • As a Senior SRE, you will be instrumental in driving higher cost efficiency across our cloud operations. This involves tackling complex challenges such as reducing network egress costs by identifying and eliminating unnecessary data transmission, optimizing data storage solutions by ensuring warehouse data is appropriately utilized and leveraging cost-effective storage tiers like cold storage for infrequently accessed data.
- • A key responsibility will be the meticulous optimization of autoscaling for both databases and compute resources, ensuring that our infrastructure scales dynamically and efficiently to meet demand while minimizing unnecessary expenditure.
- • You will play a crucial role in enhancing our cost attribution systems, providing all engineering teams with clear, actionable visibility into their cloud spending. This transparency is vital for fostering a culture of cost awareness and accountability across the organization.
- • This role also involves active participation in incident response, including being part of an on-call rotation for platform incidents. Given the distributed nature of our engineering teams across America and Europe, this rotation is designed to allow for restful sleep during off-hours.
- • You will contribute to resolving technical queries and challenges faced by other engineers regarding our infrastructure, and you will be responsible for reviewing and approving Pull Requests (PRs) that require Platform team oversight, ensuring adherence to best practices and standards.
- • You will be an integral part of a small, highly collaborative, and efficient team of SREs, working together to maintain and improve the reliability, scalability, security, and cost-effectiveness of Hopper's cloud infrastructure.
- • The ideal candidate possesses a strong foundation in Site Reliability Engineering, DevOps, Software Engineering, or Systems Engineering, coupled with exceptional troubleshooting skills and a systematic approach to problem-solving.
- • You will be involved in system design, applying strong analytical capabilities to architect robust and efficient solutions. Excellent communication skills are essential for collaborating with cross-functional teams and articulating technical concepts clearly.
- • Familiarity with major cloud providers, with a preference for Google Cloud, is highly desirable. Experience with SQL for data analysis and management is also important.
- • Proficiency in containerization technologies like Kubernetes and associated tooling such as Kustomize and Helm is expected.
- • Experience with Service Mesh technologies, particularly Istio, will be a significant advantage.
- • A solid understanding of networking concepts, including DNS, TLS, certificates, and ingresses, is crucial for optimizing traffic flow and security.
- • Expertise in observability tools for log collection, metrics, and Application Performance Monitoring (APM), preferably Datadog, is required to ensure system health and performance.
- • Knowledge of security best practices, including IAM, RBAC, and network security principles, is essential for protecting our infrastructure.
- • Familiarity with authentication and authorization technologies will be beneficial.
- • Experience with CI/CD pipelines for automated software delivery is a must.
- • A good understanding of various database technologies and their operational aspects is necessary.
- • Competency in scripting languages such as Bash and Python, or other relevant scripting languages, is required for automation tasks.
- • This role offers the opportunity to work on cutting-edge technology within a fast-paced, entrepreneurial environment, contributing directly to the success of a leading travel platform.
Skills & Technologies
Python
GCP
Kubernetes
Datadog
SSL
Senior
Remote
About Hopper Inc.
Hopper is a travel technology company that uses predictive analytics and machine learning to forecast flight and hotel prices, allowing consumers to book travel at optimal times. Founded in 2007 and headquartered in Montréal, Canada, it operates mobile-first booking platforms and provides fintech products like price freeze, cancel-for-any-reason, and rebooking guarantees to reduce travel risk.


