
Job Overview
Location
Indiana, USA
Job Type
Full-time
Category
Engineering Manager
Date Posted
March 10, 2026
Full Job Description
đź“‹ Description
- • As an Engineering Manager for the Infrastructure team at Checkly, you will play a pivotal role in shaping and scaling the foundational systems that power our cutting-edge DevTool SaaS platform. This is a unique opportunity to lead a talented group of Staff and Senior Engineers, fostering a culture of innovation, reliability, and efficiency. You will be instrumental in evolving and optimizing our hybrid cloud infrastructure, encompassing both AWS and bare metal environments, to securely and cost-effectively run sandboxed code and AI Agents. Your leadership will directly impact the performance, scalability, and security of Checkly's services, ensuring our customers can build reliable products with confidence.
- • Your core responsibility will be to guide and mentor your team, empowering them to tackle complex infrastructure challenges. This includes deep dives into system performance, investigating issues from the packet capture level to process memory, and ensuring our systems remain robust and responsive for both ad hoc and scheduled workloads. You will champion a proactive approach to reliability, anticipating potential bottlenecks and implementing solutions before they impact our users. This hands-on management style ensures you remain connected to the technical realities of the infrastructure while providing strategic direction.
- • A significant aspect of this role involves collaborating closely with product engineers. You will work hand-in-hand to enhance the developer experience, support our strong shipping culture, and provide the essential observability tools they need to monitor and manage their applications effectively. This cross-functional collaboration is key to maintaining Checkly's reputation for delivering high-quality, reliable tools to our extensive customer base, which includes industry leaders like Linkedin, Citibank, and Vercel.
- • You will be at the forefront of optimizing our hybrid infrastructure, focusing on industry-leading cost efficiency. This involves strategic decision-making regarding resource allocation, performance tuning, and the adoption of new technologies to ensure our infrastructure scales seamlessly with our growth. The ability to balance performance, security, and cost is paramount in this role, especially given the demands of running sandboxed code and AI Agents.
- • The ideal candidate will embrace Checkly's remote-first, async-first, and lean operational philosophy. You will thrive in an environment with low meeting overhead and high productivity, contributing to a culture that values clear documentation, well-crafted products, and genuine customer focus. Your ability to work autonomously while also fostering strong team collaboration will be essential for success in this distributed work environment.
- • This role offers the chance to be part of a fast-growing, international startup that has recently secured significant Series B funding. You will contribute to a product that is foundational to how over 1000 companies ensure application performance and reliability, leveraging technologies like OpenTelemetry, Playwright, and Monitoring as Code. Your impact will be direct and measurable, helping to shape the future of developer tooling and observability.
- • You will be responsible for the strategic planning and execution of infrastructure projects, ensuring alignment with company goals and technical roadmaps. This includes managing the lifecycle of infrastructure components, from design and implementation to ongoing maintenance and optimization. The team's focus on 'Monitoring as Code' and leveraging open-source technologies like OpenTelemetry and Playwright means you'll be working with modern, scalable solutions.
- • Furthermore, you will foster a culture of continuous learning and improvement within your team. Encouraging knowledge sharing, professional development, and the adoption of best practices in areas such as infrastructure as code, CI/CD, and cloud-native technologies will be a key part of your leadership. You will also be responsible for performance management, career development, and ensuring a positive and productive team dynamic.
- • The role demands a deep understanding of production environments, including troubleshooting complex issues, implementing robust monitoring and alerting strategies, and ensuring high availability. You will be a champion for operational excellence, driving initiatives to improve system resilience, security posture, and overall reliability. Your leadership will ensure that Checkly's infrastructure is not just a supporting element, but a competitive advantage, enabling rapid innovation and dependable service delivery.
- • You will also contribute to the broader engineering strategy, providing insights and recommendations on technology choices, architectural decisions, and operational best practices. This includes staying abreast of industry trends and emerging technologies that could benefit Checkly's infrastructure and overall product offering. Your ability to translate technical concepts into actionable plans and communicate them effectively to both technical and non-technical stakeholders will be crucial.
Skills & Technologies
JavaScript
TypeScript
Go
Vue.js
PostgreSQL
DevOps
Remote
€99k-121k
About Checkly Group Inc.
Checkly offers an application reliability platform that unifies testing, monitoring, and observability into a developer-friendly workflow. They provide uptime and end-to-end monitoring, empowering engineering teams to detect, communicate, and resolve performance issues. Using a Monitoring as Code approach, Checkly allows users to automate their entire monitoring process with tools like Playwright and OpenTelemetry, integrating seamlessly into CI/CD workflows. World-class engineering and SRE teams depend on Checkly to deliver reliable digital experiences, and they provide integrations for Slack, SMS, and more to alert teams when issues arise.



