This job has expired
This position was posted on October 9, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
Remote
Job Type
Full-time
Category
Software Engineering
Date Posted
October 9, 2025
Full Job Description
đź“‹ Description
- • Own the reliability, scalability, and security of Ada’s cloud-native infrastructure, serving millions of AI-powered customer conversations every month. You will architect and automate systems that keep our platform running 24/7 while enabling product teams to ship new features daily.
- • Design, build, and maintain Kubernetes clusters across multiple AWS regions, ensuring zero-downtime deployments, auto-scaling, and cost optimization. You will define Infrastructure-as-Code patterns using Terraform and Helm that the entire engineering organization can reuse with confidence.
- • Establish and continuously improve CI/CD pipelines that cut release cycles from hours to minutes. You will integrate security scanning, automated testing, and progressive rollout strategies so every commit reaches production safely and quickly.
- • Champion observability best practices by implementing Prometheus, Grafana, Loki, and distributed tracing. You will set SLOs, error budgets, and on-call rotations that balance velocity with rock-solid reliability, and you will lead blameless post-mortems that turn incidents into learning opportunities.
- • Partner closely with machine-learning engineers to optimize GPU workloads and model-serving infrastructure. You will fine-tune autoscaling policies, reduce cold-start latency, and keep inference costs predictable as demand spikes during product launches.
- • Automate operational toil away—think database migrations, certificate renewals, and secrets rotation—so the team can focus on high-impact work. You will script in Python or Go, build custom operators, and contribute reusable modules back to the open-source community.
- • Strengthen our security posture by embedding guardrails into every layer of the stack. You will manage IAM policies, network segmentation, container image scanning, and compliance evidence collection for SOC 2 and ISO 27001 audits.
- • Mentor junior engineers and create runbooks, architectural decision records (ADRs), and internal tech talks that level up the entire organization. You will foster a culture where operational excellence is everyone’s responsibility.
- • Collaborate with customer success and support teams to triage and resolve high-severity incidents, translating customer pain into systemic fixes. You will join a lightweight on-call rotation that respects work-life balance while keeping the platform resilient.
- • Experiment with emerging technologies—service mesh, eBPF, chaos engineering—and run proof-of-concepts that inform Ada’s long-term infrastructure roadmap. You will present findings in weekly engineering demos and quarterly planning sessions.
- • Drive cost-optimization initiatives that save six-figure cloud spend without sacrificing performance. You will analyze usage patterns, negotiate reserved capacity, and implement intelligent auto-scaling that aligns spend with business growth.
- • Contribute to Ada’s remote-first culture by participating in virtual coffee chats, cross-team guilds, and annual off-sites. You will help shape engineering rituals that keep us connected, inclusive, and inspired across time zones.
Skills & Technologies
About Ada Support Inc.
Ada Support is a Canadian company that provides an AI-driven customer service automation platform. Its technology enables businesses to deploy AI agents across chat, voice, email, and messaging channels to autonomously resolve many customer inquiries. The platform integrates with existing tools, maintains security and compliance standards (e.g. SOC 2, GDPR), and offers analytics for feedback and continuous improvement. Ada’s systems are built to reduce support costs and scale customer interactions without expanding human support teams.



