
Job Overview
Location
USA
Job Type
Full-time
Category
Software Engineering
Date Posted
February 26, 2026
Full Job Description
đź“‹ Description
- • Solace Corporation is on a mission to fundamentally transform the U.S. healthcare system, which is currently plagued by complexity and a lack of health literacy, leaving 88% of adults struggling to navigate it. As a Series C startup, backed by prominent investors like Inspired Capital, Craft Ventures, Torch Capital, Menlo Ventures, Signalfire, and IVP, we are building innovative solutions to empower patients. Our lean, mission-driven, and rapidly growing U.S.-based team is dedicated to redefining healthcare through urgency, precision, and heart. We foster an environment where individuals can push their boundaries, sharpen their skills, and achieve their best work alongside a deeply committed team. If you thrive in an intense, high-impact setting, Solace is the place for you.
- • As a Senior Platform Engineer at Solace, you will play a pivotal role in constructing the foundational infrastructure that powers the future of healthcare technology. You will be an integral part of our nascent platform engineering team, focusing on creating robust, scalable, and reliable systems. Your primary objective will be to empower product engineers, enabling them to deploy code rapidly and with confidence, while simultaneously building resilient safeguards that allow systems to autonomously recover from failures. When issues arise that cannot be self-healed, you will be instrumental in diagnosing and resolving them.
- • This role demands a versatile generalist capable of operating effectively both independently and collaboratively. You will engage with high-level architectural design as well as the intricate details of low-level infrastructure. A deep sense of curiosity and a commitment to continuous learning are essential. You will be expected to look beyond the surface, understanding the intricate workings of systems, anticipating potential failure points, and designing mechanisms for self-healing. Taking pride in your craft, possessing impeccable communication skills, and demonstrating an exceptional ability to absorb and act on feedback are crucial. We encourage an environment where making mistakes is viewed as a learning opportunity, and taking ownership of your work is paramount. We value individuals who are energized by impact and unhindered by the bureaucracy often found in larger organizations.
- • Your responsibilities will encompass a broad range of critical platform engineering tasks. You will actively contribute to building out our cloud infrastructure and application platform, ensuring it is scalable, secure, and cost-effective. A key aspect of your role will be assessing the risk and impact of changes to our live production environment, implementing rigorous testing and validation processes to maintain stability. You will participate in our on-call rotation, providing timely and effective responses to incidents.
- • Troubleshooting complex problems that emerge from the intricate interactions between technology, governance policies, and human factors will be a significant part of your day-to-day. You will design and implement autonomous self-healing processes within our systems, minimizing downtime and manual intervention. Furthermore, you will serve as a subject matter expert and a valuable resource for our product and data engineers, sharing your knowledge and continuously expanding your own expertise into new areas. Effective communication with stakeholders and internal platform customers will be vital to ensure alignment and successful adoption of platform services.
- • We are seeking individuals who possess a diverse and wide-ranging skill set. Experience working within a startup environment is highly valued, as is a proven track record of building scalable infrastructure for hosting web applications or data workloads. You should possess deep troubleshooting capabilities, relentlessly investigating architectural and abstraction layers to identify root causes, even under pressure. This should include experience in both solo and team-based troubleshooting, generating and testing hypotheses, and implementing effective solutions. Fluency on the command line and comfort working with Linux are essential. Strong communication skills are a must, enabling you to articulate complex technical concepts clearly.
- • Expertise in cloud infrastructure is required, with experience working with GCP, AWS, or Azure. GCP experience is preferred, and familiarity with cloud landing zones and GitOps technologies like Terraform is a significant plus. A solid understanding of networking concepts, including VPCs, subnets, routes, peering, DNS, load balancers, L4/L7 routing, CDNs, and TLS, and how they interact, is crucial. You should also have experience with observability tools such as Datadog and OpenTelemetry for infrastructure monitoring, application performance monitoring, and database performance. Comfort in instrumenting application code, building dashboards, and collaborating with application engineers is expected.
- • We are particularly interested in candidates who bring expertise in one or more of the following focus areas: Site Reliability Engineering, with a proven ability to improve performance, reliability, and cost across the entire stack; Kubernetes, understanding its self-healing and extensibility features for scalable, resilient, and secure workload execution; Polyglot programming, being comfortable with different languages and learning new ones, with proficiency in reading code and understanding its operational characteristics; DevOps, experience in building, optimizing, and maintaining CI/CD systems; Developer Product Experience, creating tools and low-friction experiences for product engineers with a product mindset; or Security, with experience in highly-regulated industries like healthcare and cloud infrastructure.
🎯 Requirements
- • Proven experience building and managing scalable cloud infrastructure (GCP preferred) for web applications or data workloads, including familiarity with GitOps technologies like Terraform.
- • Deep troubleshooting expertise, with a demonstrated ability to identify root causes in complex systems under pressure, and strong command-line proficiency with Linux.
- • Solid understanding of networking concepts (VPCs, subnets, DNS, load balancers, TLS) and experience with observability tools (e.g., Datadog, OpenTelemetry) for monitoring and performance analysis.
- • Experience working in a startup environment and strong communication skills, with the ability to collaborate effectively with cross-functional teams and stakeholders.
- • Experience in one or more focus areas such as Site Reliability Engineering, Kubernetes, Polyglot programming, DevOps, Developer Product Experience, or Security in regulated industries.
🏖️ Benefits
- • Competitive salary and equity package commensurate with experience.
- • Comprehensive health, dental, and vision insurance.
- • Generous paid time off and holidays.
- • Opportunities for professional development and continuous learning.
- • A mission-driven culture focused on making a significant impact in healthcare.
Skills & Technologies
About Solace Corporation
Solace Corporation provides event-driven middleware and streaming data movement solutions that let enterprises connect cloud, on-premises, and IoT environments. Its PubSub+ event brokers, APIs, and managed services enable real-time information flow across applications, devices, and users for financial services, retail, transportation, and government markets. The company supports open protocols like AMQP, MQTT, JMS, and REST, integrates with Kubernetes, Kafka, and serverless platforms, and offers hybrid and multi-cloud deployment options. Headquartered in Ottawa, Canada, Solace was founded in 2001 and serves global customers requiring low-latency, high-throughput, and secure event streaming.


