Checkly Group Inc. logo

Staff Software Engineer - Infrastructure (remote, UTC-3 to UTC+3)

Job Overview

Location

Indiana, USA

Job Type

Full-time

Category

Software Engineering

Date Posted

March 4, 2026

Full Job Description

đź“‹ Description

  • • As a Staff Software Engineer - Infrastructure at Checkly, you will be at the forefront of building and maintaining the robust, scalable, and cost-efficient infrastructure that powers our leading synthetic monitoring platform. You will play a pivotal role in ensuring the reliability, performance, and security of our hybrid cloud environment, which spans both AWS and bare metal. This is an opportunity to work with a highly skilled team of engineers, contribute to a fast-growing, remote-first startup, and directly impact the developer experience for over 1000 companies worldwide, including industry giants like LinkedIn and Citibank.
  • • Your primary responsibility will be to evolve and optimize our complex infrastructure. This involves ensuring the secure execution of sandboxed code and AI Agents, a critical function for our platform, while maintaining industry-leading cost efficiency. You will delve deep into the intricacies of our systems, investigating customer and infrastructure problems with a meticulous approach, potentially down to the packet capture and process memory level. This hands-on problem-solving ensures that our customers can place unwavering trust in Checkly's reliability.
  • • You will be instrumental in enhancing the overall infrastructure reliability. This means proactively identifying and addressing potential bottlenecks, optimizing performance for both ad hoc and scheduled workloads, and preventing unexpected cost escalations. Your work will directly contribute to systems that remain responsive and stable, even under heavy load.
  • • Collaboration is key in this role. You will work closely with product engineers to enhance the developer experience within Checkly. This includes supporting our strong shipping culture by providing the necessary tools, infrastructure, and observability that enable rapid and confident deployment of new features. You will be a bridge between infrastructure and product development, ensuring seamless integration and efficient workflows.
  • • The role demands a deep understanding of Linux administration and hands-on experience with building and maintaining production-grade infrastructure in both cloud (AWS) and bare metal environments. Proficiency in managing automated infrastructure using tools like Terraform and Ansible is essential. A solid grasp of Kubernetes is also crucial for orchestrating and managing our containerized workloads.
  • • You will be part of an async-first, remote-first culture that prioritizes productivity and clear communication. This means embracing asynchronous workflows, minimizing unnecessary meetings, and contributing to excellent documentation. You'll have the autonomy to manage your work effectively while also actively engaging with colleagues and contributing to a supportive team environment.
  • • Checkly is at an exciting stage of growth, having recently secured $20M in Series B funding. This investment will fuel our expansion and allow us to further innovate and scale our platform. As a Staff Software Engineer, you will be a key player in this growth phase, helping to shape the future of synthetic monitoring and developer observability.
  • • You will contribute to the continuous improvement of our CI/CD pipelines, ensuring efficient and reliable code deployments. This involves working with infrastructure-as-code principles to manage and provision resources dynamically.
  • • You will be involved in capacity planning and performance tuning, ensuring our infrastructure can scale to meet the growing demands of our customer base.
  • • Participating in on-call rotations will be part of the role, requiring you to respond to and resolve critical incidents to maintain service uptime and performance.
  • • You will have the opportunity to mentor junior engineers, sharing your expertise and fostering a culture of learning and technical excellence within the team.
  • • The role requires a proactive approach to security, ensuring that all infrastructure components and data are protected against threats and vulnerabilities.
  • • You will contribute to defining and implementing best practices for infrastructure management, monitoring, and incident response.
  • • Your work will directly influence the technical direction of Checkly's infrastructure, providing opportunities to research and implement new technologies that can enhance our platform's capabilities and efficiency.
  • • You will be a champion for operational excellence, driving initiatives that improve system stability, reduce downtime, and enhance overall system resilience.
  • • The ability to translate complex technical challenges into actionable plans and communicate them effectively to both technical and non-technical stakeholders is paramount.
  • • You will be empowered to make significant technical decisions and drive their implementation, taking ownership of critical infrastructure projects from conception to completion.
  • • This role offers a unique chance to work with cutting-edge technologies and contribute to a product that is fundamentally changing how developers build and maintain reliable applications.

Skills & Technologies

JavaScript
TypeScript
Go
Vue.js
PostgreSQL
DevOps
Senior
Remote
€124k-152k

Ready to Apply?

You will be redirected to an external site to apply.

Checkly Group Inc. logo
Checkly Group Inc.
Visit Website

About Checkly Group Inc.

Checkly offers an application reliability platform that unifies testing, monitoring, and observability into a developer-friendly workflow. They provide uptime and end-to-end monitoring, empowering engineering teams to detect, communicate, and resolve performance issues. Using a Monitoring as Code approach, Checkly allows users to automate their entire monitoring process with tools like Playwright and OpenTelemetry, integrating seamlessly into CI/CD workflows. World-class engineering and SRE teams depend on Checkly to deliver reliable digital experiences, and they provide integrations for Slack, SMS, and more to alert teams when issues arise.

Similar Opportunities

Indiana, USA
Full-time
Expires Apr 13, 2026
Python
JavaScript
AWS
+3 more

1 month ago

Apply
SHI International Corp. logo

SHI International Corp.

Indiana, USA
Full-time
Expires Apr 29, 2026
AWS
Azure
Remote
+2 more

17 days ago

Apply
Indiana, USA
Full-time
Expires Apr 13, 2026
Python
JavaScript
AWS
+3 more

1 month ago

Apply
Indiana, USA
Full-time
Expires Apr 13, 2026
Remote

1 month ago

Apply