
Job Overview
Location
San Francisco
Job Type
Full-time
Category
Software Engineering
Date Posted
April 22, 2026
Full Job Description
đź“‹ Description
- • As an AI Solutions Engineer at Baseten, you will partner directly with customers to architect, build, and deploy high-scale production AI applications on Baseten’s platform, owning the journey from initial exploration to production deployment by translating ambiguous business goals into reliable, observable services with clear quality, latency, and cost outcomes.
- • Day to day, you will develop and maintain software systems using general-purpose programming languages (with preference for Python), drive customer impact by designing and deploying end-to-end Baseten solutions, deliver with velocity by turning vague objectives into clear specs and PoCs, optimize AI/ML projects, own products and customer projects end-to-end as engineer/project/product manager, navigate ambiguity with sound judgment on tradeoffs, and demonstrate pride and accountability in your work.
- • Baseten powers mission-critical inference for dynamic AI companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer by uniting applied AI research, flexible infrastructure, and seamless developer tooling to enable cutting-edge models in production. The company recently raised a $300M Series E backed by top-tier investors and is growing rapidly.
- • In this role, you will gain a front-row view into how modern companies adopt AI at scale, develop deep expertise in AI/ML deployment and optimization, strengthen cross-functional collaboration skills across product and engineering, and grow into a trusted technical advisor who shapes both customer outcomes and Baseten’s platform evolution.
Skills & Technologies
About BaseTen Inc.
BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Horizon Industries, Limited
1 month ago

Foxglove Technologies, Inc.
6 months ago

