BaseTen Inc. logo

AI Solutions Engineer

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Software Engineering

Date Posted

April 22, 2026

Full Job Description

đź“‹ Description

  • • As an AI Solutions Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten’s platform, owning the journey from initial exploration to production deployment by translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes.
  • • Day to day, you will develop and maintain software systems using general-purpose programming languages (with preference for Python), drive customer impact by designing and deploying end-to-end Baseten solutions, deliver with velocity by turning vague objectives into clear specs and PoCs, optimize AI/ML projects, own products and customer projects end-to-end as engineer/project/product manager, navigate ambiguity with sound judgment on tradeoffs, and demonstrate pride and accountability in your work.
  • • Baseten powers mission-critical inference for dynamic AI companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer by uniting applied AI research, flexible infrastructure, and seamless developer tooling to enable cutting-edge models in production. The company recently raised a $300M Series E backed by top-tier investors and is growing rapidly.
  • • In this role, you will gain a front-row view into how modern companies adopt AI at scale, develop deep expertise in AI/ML deployment and optimization, strengthen cross-functional collaboration skills across product and engineering, and grow into a trusted technical advisor who shapes both customer outcomes and Baseten’s platform evolution.

Skills & Technologies

Python
Docker
Apache Spark
Onsite
Degree Required

Ready to Apply?

You will be redirected to an external site to apply.

BaseTen Inc. logo
BaseTen Inc.
Visit Website

About BaseTen Inc.

BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Promise Holdings, Inc. logo

Promise Holdings, Inc.

San Francisco
Full-time
Expires May 22, 2026
Go
Hybrid

1 month ago

Apply
Denver
Full-time
Expires May 12, 2026
Onsite

1 month ago

Apply
Horizon Industries, Limited logo

Horizon Industries, Limited

REMOTE
Full-time
Expires May 15, 2026
PostgreSQL
MySQL
Remote
+1 more

1 month ago

Apply
❌ EXPIRED
Foxglove Technologies, Inc. logo

Foxglove Technologies, Inc.

San Francisco
Full-time
Expired Dec 9, 2025
JavaScript
TypeScript
Go
+4 more

6 months ago

Apply