
Job Overview
Location
San Francisco
Job Type
Full-time
Category
Software Engineering
Date Posted
April 27, 2026
Full Job Description
đź“‹ Description
- • Software Engineer - Realtime Systems at BaseTen Inc. is a high-impact, high-ownership role focused on building and leading the company's in-house Voice AI inference stack to power real-time speech-to-text, text-to-speech, and voice agent workloads for mission-critical customer deployments.
- • Day-to-day responsibilities include owning Voice AI product areas end-to-end—from architecture and system design through implementation, rollout, and long-term production operations—designing and operating real-time, large-scale, high-performance model serving systems with clear SLOs, driving cross-team collaboration with Forward Deployed Engineers, Model Performance Engineers, and sister teams, and mentoring teammates through code reviews, design docs, and technical leadership.
- • BaseTen powers mission-critical AI inference for leading companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer, uniting applied AI research, flexible infrastructure, and seamless developer tooling to enable cutting-edge models in production, backed by a $300M Series E from top-tier investors including BOND, IVP, Spark Capital, Greylock, and Conviction.
- • In this role, you will make a meaningful impact on industries such as productivity, customer service, clinical conversation, creator tools, and education by bringing state-of-the-art open-source Voice AI models into production, pushing the boundaries of real-time voice systems, and shaping the platform engineers rely on to ship AI products.
Skills & Technologies
About BaseTen Inc.
BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
27 days ago

PAE Holding Corporation, LLC
23 hours ago

Siftstack Inc.
2 months ago

ICF International, Inc.
2 months ago
