
Job Overview
Location
San Francisco
Job Type
Full-time
Category
Engineering Manager
Date Posted
May 10, 2026
Full Job Description
đź“‹ Description
- • As an Engineering Manager (Player & Coach) at BaseTen Inc., you will lead and mentor a team of Forward Deployed Engineers focused on building, scaling, and optimizing LLM inference workloads for mission-critical AI customers, directly contributing to the platform that powers companies like Cursor, Notion, and Writer.
- • Day to day, you will lead technical and managerial efforts including setting team goals, mentoring engineers, driving end-to-end customer engagements from problem framing to production deployment, and collaborating with product and infrastructure teams to ensure high-performance, low-latency AI applications.
- • You will join a rapidly growing AI infrastructure company that recently raised a $300M Series E, backed by top-tier investors, and operates at the frontier of applied AI research and developer tooling for generative AI systems.
- • In this role, you will develop deep expertise in LLM serving, inference optimization, and production ML systems while growing as a technical leader who shapes both team outcomes and product roadmap through hands-on engineering and strategic customer partnerships.
🎯 Requirements
- • Bachelor’s, Master’s, or Ph.D. in Computer Science, Engineering, or related field
- • 4+ years of professional software engineering experience, including 1+ year in a leadership or mentorship capacity
- • Strong programming skills in Python, with production experience in building or optimizing ML inference systems
- • Proven experience with LLMs, inference optimization, or serving frameworks (e.g., vLLM, TensorRT, Triton, Hugging Face, Ray Serve)
- • Familiarity with observability, profiling, and cost/performance tradeoffs in production ML systems
- • Excellent communication and collaboration skills—able to lead cross-functional efforts and drive outcomes in ambiguous, fast-paced environments
🏖️ Benefits
- • Competitive compensation, including meaningful equity
- • 100% coverage of medical, dental, and vision insurance for employee and dependents
- • Flexible PTO policy including company wide Winter Break (our offices are closed from Christmas Eve to New Year's Day!)
- • Paid parental leave
- • Fertility and family-building stipend through Carrot
- • Company-facilitated 401(k)
- • Exposure to a variety of ML startups, offering unparalleled learning and networking opportunities
Skills & Technologies
About BaseTen Inc.
BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

dLocal Limited
9 months ago

Coderio LLC
2 months ago

