
Job Overview
Location
Boston
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
April 1, 2026
Full Job Description
đź“‹ Description
- • As a Senior CloudOps Engineer at CloudZero, you will be a force multiplier for the engineering organization by owning the performance, reliability, and observability of the infrastructure that powers a platform processing billions of cloud cost events daily across AWS, Azure, and GCP, directly enabling customers to make business-critical decisions about their cloud spend.
- • You will design and maintain Pulumi-based Infrastructure as Code to provision reliable, cost-efficient cloud resources without relying on manual console interactions, ensuring infrastructure scales gracefully, fails predictably, and recovers automatically within CloudZero’s unique serverless architecture.
- • You will instrument systems with observability tools like Prometheus or Datadog to surface failures quickly, enabling data-driven debugging and proactive issue detection before customers are impacted.
- • You will automate deployments, scaling, backups, and repetitive operational tasks, balancing automation with practicality to eliminate toil while avoiding unnecessary complexity.
- • You will partner closely with product engineering teams to review architectures, design resilient services, and build deployment pipelines that support safe, fast shipping while optimizing for cost and performance — aligning with CloudZero’s mission of exemplifying efficient cloud usage.
- • You will debug production issues under pressure using thoughtful, reliable system design principles rather than reactive heroics, and document systems clearly to support long-term team clarity and stability.
- • You will communicate complex technical concepts effectively to non-technical stakeholders, ensuring alignment across teams on infrastructure decisions and operational trade-offs.
- • You will contribute to a mission-driven culture at CloudZero, where efficient innovation is pursued through proven reliability engineering principles applied to financial efficiency in cloud cost management.
- • You will grow your expertise in operating distributed systems at real scale, working with frontier AI models like Claude, Codex, or Gemini, and shaping the operational foundation of a platform that helps organizations optimize cloud spend in an increasingly complex economic landscape.
🎯 Requirements
- • 3 to 5+ years of experience building and operating distributed systems in AWS
- • Strong skills in Python and Infrastructure as Code using Pulumi or Terraform
- • Hands-on experience with monitoring tools such as Prometheus or Datadog
- • Proven ability to debug production issues under pressure
- • Experience with frontier AI models such as Claude, Codex, or Gemini
- • Ability to clearly explain complex technical issues to non-technical stakeholders
🏖️ Benefits
- • Opportunity to work on real infrastructure at massive scale — processing billions of events daily across multi-cloud environments
- • Ownership of end-to-end infrastructure with no console-clicking or ticket-closing tasks — pure engineering impact
- • Collaboration with product engineering teams to shape resilient, cost-optimized systems that exemplify CloudZero’s own FinOps principles
- • Exposure to cutting-edge AI integration in cloud cost analytics, working with models like Claude, Codex, and Gemini
- • Culture that values thoughtful design over heroics, promoting sustainable reliability and long-term system stability
- • Mission-driven work helping organizations prove cloud efficiency and make data-driven business decisions amid rising economic pressures
Skills & Technologies
About CloudZero, Inc.
CloudZero provides a cloud cost intelligence platform that helps engineering and finance teams understand, manage, and optimize their cloud spending. Its solution offers real-time visibility into cloud costs, breaking them down by application, team, feature, or any business dimension. This allows organizations to identify cost anomalies, allocate costs accurately, and make data-driven decisions to reduce waste and improve efficiency. By connecting engineering metrics with financial data, CloudZero enables proactive cost management, fostering a culture of cost accountability across the organization and ensuring that cloud investments deliver maximum business value. It integrates with major cloud providers like AWS, Azure, and GCP.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
27 days ago

Pragmatike Soluciones TecnolĂłgicas S.L.
25 days ago

