OpenAI, Inc. logo

Head of Measurement & Performance Intelligence

Job Overview

Location

Remote - US

Job Type

Full-time

Category

Engineering Manager

Date Posted

May 16, 2026

Full Job Description

đź“‹ Description

  • • Lead the creation and growth of OpenAI’s Measurement & Performance Intelligence function to evaluate and optimize the performance of industrial compute platforms powering AI research and products.
  • • Define and standardize performance metrics, benchmarking methodologies, and acceptance criteria for both next-generation platform development (NPI) and production environments across hardware, software, networking, and storage systems.
  • • Partner with training, inference, infrastructure, and model-serving teams to analyze real-world workload behavior and identify opportunities for platform optimization at scale.
  • • Translate complex performance data into actionable, strategic recommendations for system architecture, platform quality, workload placement, and capacity allocation decisions.
  • • Drive continuous performance validation throughout the lifecycle of compute infrastructure, ensuring systems meet evolving efficiency and reliability targets under real AI workloads.
  • • Collaborate with external hardware and platform vendors to assess system performance, influence roadmaps, and improve the reliability and efficiency of third-party components integrated into OpenAI’s infrastructure.
  • • Build scalable processes, automated tooling, and reporting frameworks to enable consistent, data-driven performance analysis across OpenAI’s global infrastructure fleet.
  • • Establish cross-functional governance for performance evaluation, ensuring alignment between engineering teams, product leaders, and executive stakeholders on platform priorities and trade-offs.
  • • Own the end-to-end performance validation process for new compute hardware and software releases, from initial lab testing through full-scale production deployment.
  • • Advocate for performance-centric design principles across infrastructure teams, embedding measurement and optimization as core requirements in platform development cycles.
  • • Communicate technical performance insights clearly to both engineering and non-technical leadership to inform strategic decisions on capacity planning, vendor selection, and infrastructure investment.
  • • Monitor and respond to performance anomalies in production systems, leading root-cause analyses and coordinating cross-team remediation efforts to minimize impact on AI training and inference workloads.
  • • Design and maintain performance baselines that reflect real-world AI model behavior, ensuring benchmarks are representative of actual usage patterns rather than synthetic workloads.
  • • Influence long-term infrastructure strategy by identifying systemic performance bottlenecks and proposing architectural changes that improve efficiency, scalability, and cost-effectiveness.
  • • Mentor and grow a high-performing team of performance engineers and analysts, setting clear goals, fostering technical excellence, and enabling career development within the function.
  • • Serve as the primary authority on platform performance within OpenAI, acting as the trusted advisor to senior engineering and operational leadership on compute infrastructure optimization.
  • • Ensure all measurement practices comply with OpenAI’s standards for data integrity, reproducibility, and transparency in performance evaluation.
  • • Represent OpenAI in industry discussions and partnerships related to AI infrastructure performance, benchmarking standards, and next-generation compute evaluation.

🎯 Requirements

  • • Experience leading performance engineering, benchmarking, or infrastructure optimization teams at large scale
  • • Strong technical depth across distributed systems, AI/ML infrastructure, HPC, datacenter systems, or large-scale compute platforms
  • • Proven ability to define performance metrics and validation frameworks for complex hardware or infrastructure systems
  • • Comfort operating across hardware, software, and infrastructure boundaries to drive measurable platform improvements
  • • Ability to translate highly technical performance data into clear business, operational, and strategic recommendations
  • • Strong communication and leadership skills with experience influencing senior technical and business stakeholders internally and externally

🏖️ Benefits

  • • Hybrid work model of 3 days in the office per week based in San Francisco, CA
  • • Relocation assistance for new employees
  • • Opportunity to shape the future of next-generation AI infrastructure at OpenAI
  • • Access to cutting-edge compute systems and AI research environments

Skills & Technologies

Remote

Ready to Apply?

You will be redirected to an external site to apply.

OpenAI, Inc. logo
OpenAI, Inc.
Visit Website

About OpenAI, Inc.

OpenAI is a San Francisco-based artificial intelligence research and deployment company founded in 2015. It develops large-scale AI models such as GPT, DALL-E, and Codex, providing cloud APIs and consumer applications like ChatGPT. Originally established as a non-profit, it later created a capped-profit subsidiary to attract capital while maintaining its mission to ensure artificial general intelligence benefits all of humanity.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Dubai
Full-time
Expired Apr 28, 2026
Onsite

3 months ago

Apply
Expired
Argentina (Remote)
Full-time
Expired Nov 22, 2025
JavaScript
TypeScript
Java
+4 more

9 months ago

Apply
Expired
Buenos Aires
Full-time
Expired Jun 6, 2026
Python
JavaScript
Java
+4 more

2 months ago

Apply
Expired
Berlin, Germany
Full-time
Expired May 10, 2026
Go
Senior
Remote

3 months ago

Apply