BaseTen Inc. logo

Applied AI Inference Engineer

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Software Engineering

Date Posted

April 22, 2026

Full Job Description

đź“‹ Description

  • • As an Applied AI Inference Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten’s platform, translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes.
  • • You will develop and maintain software systems using general-purpose programming languages (with preference for Python), drive customer impact by designing and deploying end-to-end solutions, optimize AI/ML projects, and own products and customer projects from problem framing to production deployment and monitoring.
  • • Baseten powers mission-critical inference for leading AI companies like Cursor, Notion, and OpenEvidence, uniting applied AI research, flexible infrastructure, and seamless developer tooling to enable cutting-edge models in production, backed by a $300M Series E from top-tier investors.
  • • You will gain hands-on experience across product, software development, performance engineering, and customer-facing implementations, functioning as an engineer, project manager, and product manager while working with frontier AI companies and contributing to a rapidly growing platform shaping the future of AI deployment.

Skills & Technologies

Python
Docker
Apache Spark
Onsite
Degree Required

Ready to Apply?

You will be redirected to an external site to apply.

BaseTen Inc. logo
BaseTen Inc.
Visit Website

About BaseTen Inc.

BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Work From Home – PH
Full-time
Expires May 30, 2026
Spring
Onsite
Degree Required

22 days ago

Apply
⏰ EXPIRES SOON
Brazil
Full-time
Expires Apr 27, 2026 (Soon)
Python
JavaScript
TypeScript
+3 more

2 months ago

Apply
US - Seattle
Full-time
Expires Jun 8, 2026
Python
JavaScript
Java
+5 more

13 days ago

Apply
❌ EXPIRED
Haast Technologies Inc. logo

Haast Technologies Inc.

Manilla
Full-time
Expired Apr 9, 2026
Remote
Degree Required

2 months ago

Apply