
Job Overview
Location
Toronto
Job Type
Full-time
Category
Software Engineering
Date Posted
March 7, 2026
Full Job Description
đź“‹ Description
- • Join Hopper's dynamic Cloud FinOps team as a Senior Site Reliability Engineer, a pivotal role focused on optimizing our expansive Google Cloud infrastructure that serves millions of users and hundreds of engineers.
- • This position is 100% remote, offering flexibility and the opportunity to contribute to a leading travel technology company from anywhere.
- • You will be instrumental in driving significant cost efficiencies across our cloud operations, tackling complex challenges with innovative and practical solutions.
- • Key responsibilities include identifying and implementing strategies to reduce network egress costs, such as optimizing header transmissions.
- • You will analyze and optimize data storage solutions, ensuring that warehouse data is appropriately utilized and stored in the most cost-effective tiers, like cold storage for infrequently accessed buckets.
- • A core part of your role will involve fine-tuning autoscaling configurations for both databases and compute resources to maximize performance and minimize waste.
- • You will play a crucial role in enhancing our cost attribution systems, providing all engineering teams with clear, actionable visibility into their cloud spending.
- • This role requires active participation in incident response, including being part of an on-call rotation to ensure the reliability and availability of our platform.
- • The team is globally distributed across America and Europe, allowing for a comfortable on-call schedule that respects your sleep.
- • You will also provide essential support to other engineers, troubleshooting infrastructure-related issues and offering guidance.
- • Approving Pull Requests (PRs) that require Platform supervision will be a regular duty, ensuring adherence to best practices and standards.
- • You will be an integral part of a small, highly efficient team of SREs, contributing to a collaborative and results-oriented environment.
- • The ideal candidate possesses a strong foundation in Site Reliability Engineering (SRE), DevOps, Software Engineering, or Systems Engineering principles.
- • Exceptional troubleshooting skills are paramount for diagnosing and resolving complex infrastructure issues.
- • You will be involved in system design, leveraging strong analytical capabilities to architect scalable, reliable, and cost-effective solutions.
- • Excellent communication skills are essential for collaborating with cross-functional teams and articulating technical concepts clearly.
- • Familiarity with major cloud providers, with a strong preference for Google Cloud, is a key requirement.
- • Proficiency in SQL is necessary for data analysis and cost attribution efforts.
- • Experience with containerization technologies like Kubernetes and associated tooling such as Kustomize and Helm is expected.
- • Knowledge of Service Mesh technologies, particularly Istio, will be highly beneficial.
- • A solid understanding of networking concepts, including DNS, TLS, certificates, and ingresses, is crucial.
- • Expertise in observability tools for log collection, metrics, and Application Performance Monitoring (APM), with a preference for Datadog, is required.
- • Security best practices, including IAM, RBAC, and network security, are vital for maintaining a secure infrastructure.
- • Familiarity with authentication and authorization technologies is important.
- • Experience with Continuous Integration and Continuous Deployment (CI/CD) pipelines is necessary.
- • A good understanding of various database technologies is expected.
- • Competency in scripting languages such as Bash and Python, or other relevant scripting languages, is required for automation tasks.
- • This role offers the opportunity to make a substantial impact on a rapidly growing, innovative travel technology company, contributing directly to its mission of transforming the travel industry through technology and fintech solutions.
- • You will work with cutting-edge technologies and contribute to a platform that powers millions of travelers worldwide, making your work both challenging and rewarding.
Skills & Technologies
Python
GCP
Kubernetes
Datadog
SSL
Senior
Remote
About Hopper Inc.
Hopper is a travel technology company that uses predictive analytics and machine learning to forecast flight and hotel prices, allowing consumers to book travel at optimal times. Founded in 2007 and headquartered in Montréal, Canada, it operates mobile-first booking platforms and provides fintech products like price freeze, cancel-for-any-reason, and rebooking guarantees to reduce travel risk.


