BaseTen Inc. logo

Senior Software Engineer - New Products

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Software Engineer

Date Posted

February 26, 2026

Full Job Description

šŸ“‹ Description

  • • Join BaseTen Inc. as a Senior Software Engineer on our New Products team, a pivotal role where you will shape the future of AI infrastructure. BaseTen is at the forefront of powering mission-critical inference for the world's most dynamic AI companies, including industry leaders like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer. We are a rapidly growing company, recently securing a significant $300M Series E funding round, underscoring our strong market position and future potential. Our mission is to unite applied AI research, flexible infrastructure, and seamless developer tooling, enabling companies at the cutting edge of AI to bring their most advanced models into production. This is your opportunity to join a team that is building the platform engineers rely on to ship groundbreaking AI products.
  • • As a Senior Software Engineer on the New Products team, you will be instrumental in developing core platform capabilities that empower researchers, developers, and partners to ship and operate AI products at an unprecedented scale. This role is designed for an infrastructure-leaning, product-minded engineer who thrives on owning ambiguous problems from inception to completion. You will be responsible for shaping APIs and system designs, and ensuring their robust operation in production with clearly defined Service Level Objectives (SLOs). Your work will directly impact the efficiency, reliability, and scalability of AI deployments for our diverse clientele.
  • • Your responsibilities will span the entire product lifecycle. You will own and lead projects and product areas end-to-end, encompassing architecture, implementation, rollout, and long-term operational excellence. A key aspect of this role involves designing highly ergonomic, developer-friendly APIs and abstractions that simplify complex infrastructure capabilities. You will be building and operating reliable backend services that are critical to our platform's success, including systems for rate limiting, authentication, quotas, metering, and seamless migrations. Maintaining clear SLOs for these services will be paramount to ensuring high availability and performance.
  • • Furthermore, you will be a driving force behind performance and reliability improvements. This will involve deep dives into profiling, tracing, load testing, and meticulous capacity planning to ensure our platform can handle the demands of our rapidly growing user base and the evolving landscape of AI. Collaboration and knowledge sharing are core to our culture. You will mentor teammates through constructive code reviews, insightful design documentation, and by providing technical leadership. Your contributions will not only build essential infrastructure but also elevate the technical capabilities of the entire team.
  • • This role offers the chance to work on exciting initiatives such as developing Model APIs for frontier models, which are essential for deploying cutting-edge AI applications. You will also contribute to our innovative Model training platform, built specifically for production inference, enabling our users to train and deploy models efficiently and effectively. By joining BaseTen, you will be at the heart of AI innovation, working with a talented team dedicated to building the essential tools that power the next generation of artificial intelligence. If you are passionate about building scalable, reliable, and user-centric infrastructure, and eager to make a significant impact in the rapidly evolving field of AI, we encourage you to apply and help us build the future of AI products.
  • • This is a unique opportunity to influence the direction of a fast-growing company and contribute to a product that is defining the infrastructure for AI development. You will have the autonomy to make significant technical decisions and see your work directly translate into value for leading AI companies. We are looking for engineers who are not afraid to tackle complex challenges and who are excited by the prospect of building foundational technology for a transformative industry. Your work will be critical in enabling our customers to innovate faster and deploy their AI solutions with confidence and efficiency. We believe in empowering our engineers and providing them with the resources and support needed to succeed in a dynamic and fast-paced environment. Come, be a part of the BaseTen journey and help us build the future of AI inference.

Skills & Technologies

TypeScript
React
Kubernetes
Apache Spark
Senior
Onsite

Ready to Apply?

You will be redirected to an external site to apply.

BaseTen Inc. logo
BaseTen Inc.
Visit Website

About BaseTen Inc.

BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Argentina - Remote
Full-time
Expires May 4, 2026
Python
PHP
Ruby
+5 more

2 months ago

Apply
ā° EXPIRES SOON
Argentina
Full-time
Expires Apr 25, 2026 (Soon)
Python
JavaScript
TypeScript
+4 more

2 months ago

Apply
Colombia - Fully Remote
Full-time
Expires May 24, 2026
Python
JavaScript
TypeScript
+3 more

27 days ago

Apply
Colombia - Fully Remote
Part-time
Expires May 24, 2026
Python
JavaScript
TypeScript
+3 more

27 days ago

Apply