BaseTen Inc. logo

Capacity and Infrastructure Lead

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Data & Analytics

Date Posted

May 12, 2026

Full Job Description

📋 Description

  • As a Capacity and Infrastructure Analytics Lead at BaseTen Inc., you will build the analytics foundation for tracking infrastructure usage, capacity, and cloud spend across the AI inference platform, enabling data-driven decisions that optimize cost and performance for mission-critical AI workloads.
  • You will build and maintain dashboards tracking cloud cost, usage, capacity, and utilization; ingest and model billing and usage data from multiple cloud providers; create canonical data models; partner with Infrastructure Engineering and Finance to define core metrics; support forecasting and planning; develop alerting for anomalies; and ensure data reliability through testing, documentation, and clear ownership.
  • You will join a fast-growing AI infrastructure company powering mission-critical inference for dynamic AI companies like Cursor, Notion, OpenEvidence, Abridge, Clay, Gamma, and Writer, backed by top-tier investors including BOND, IVP, Spark Capital, Greylock, and Conviction.
  • You will develop deep expertise in cloud cost analytics, infrastructure efficiency, and AI infrastructure economics while influencing cross-functional strategy and gaining visibility into how leading AI companies operate at scale.

🎯 Requirements

  • 5+ years of experience in analytics, BI, infrastructure analytics, cloud cost management, or a related role
  • Strong SQL skills, including experience writing complex transformations across disparate datasets
  • Experience building clean, reusable data models and semantic layers in dbt
  • Working knowledge of AWS Cost and Usage Reports, Google Cloud Billing Export, committed use discounts, savings plans, reservations, usage-based pricing, credits, and cloud cost allocation
  • Experience integrating raw data from APIs, cloud exports, vendor invoices, billing systems, or observability platforms with Python/SQL
  • Experience building dashboards and self-serve analytics in BI tools such as Sigma and Hex

🏖️ Benefits

  • Competitive compensation, including meaningful equity
  • 100% coverage of medical, dental, and vision insurance for employee and dependents
  • Flexible PTO policy including company wide Winter Break (offices closed from Christmas Eve to New Year's Day)
  • Paid parental leave
  • Fertility and family-building stipend through Carrot
  • Company-facilitated 401(k)

Skills & Technologies

Python
AWS
GCP
Apache Spark
DevOps
Senior
Onsite

Ready to Apply?

You will be redirected to an external site to apply.

BaseTen Inc. logo
BaseTen Inc.
Visit Website

About BaseTen Inc.

BaseTen provides a serverless, GPU-accelerated platform that lets machine-learning teams deploy, scale and monitor custom models behind autoscaling inference endpoints. The service abstracts infrastructure management, supports PyTorch, TensorFlow and Hugging Face artifacts, and offers built-in observability, A/B testing and fine-tuning. Customers integrate via REST or GraphQL APIs and pay only for compute used. Founded in 2019 and headquartered in San Francisco, BaseTen targets data scientists and product teams seeking production-grade ML serving without Kubernetes complexity.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Dallas, TX
Full-time
Expires Jul 6, 2026
Azure
Senior
Onsite

5 days ago

Apply
Argentina
Full-time
Expires Jul 5, 2026
Python
AWS
Terraform
+3 more

7 days ago

Apply
Caylent, Inc. logo

Caylent, Inc.

ARGENTINA
Full-time
Expires Jun 20, 2026
AWS
Senior
Remote

22 days ago

Apply
ARGENTINA
Full-time
Expires Jun 20, 2026
AWS
Senior
Remote
+1 more

22 days ago

Apply