This job has expired

This position was posted on May 16, 2026 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Head of Measurement & Performance Intelligence

OpenAI, Inc.

Job Overview

Location

Remote - US

Job Type

Full-time

Full Job Description

📋 Description

• Lead the creation and growth of OpenAI’s Measurement & Performance Intelligence function to evaluate and optimize the performance of industrial compute platforms powering AI research and products.
• Define and standardize performance metrics, benchmarking methodologies, and acceptance criteria for both next-generation platform development (NPI) and production environments across hardware, software, networking, and storage systems.
• Partner with training, inference, infrastructure, and model-serving teams to analyze real-world workload behavior and identify opportunities for platform optimization at scale.
• Translate complex performance data into actionable, strategic recommendations for system architecture, platform quality, workload placement, and capacity allocation decisions.
• Drive continuous performance validation throughout the lifecycle of compute infrastructure, ensuring systems meet evolving efficiency and reliability targets under real AI workloads.
• Collaborate with external hardware and platform vendors to assess system performance, influence roadmaps, and improve the reliability and efficiency of third-party components integrated into OpenAI’s infrastructure.
• Build scalable processes, automated tooling, and reporting frameworks to enable consistent, data-driven performance analysis across OpenAI’s global infrastructure fleet.
• Establish cross-functional governance for performance evaluation, ensuring alignment between engineering teams, product leaders, and executive stakeholders on platform priorities and trade-offs.
• Own the end-to-end performance validation process for new compute hardware and software releases, from initial lab testing through full-scale production deployment.
• Advocate for performance-centric design principles across infrastructure teams, embedding measurement and optimization as core requirements in platform development cycles.
• Communicate technical performance insights clearly to both engineering and non-technical leadership to inform strategic decisions on capacity planning, vendor selection, and infrastructure investment.
• Monitor and respond to performance anomalies in production systems, leading root-cause analyses and coordinating cross-team remediation efforts to minimize impact on AI training and inference workloads.
• Design and maintain performance baselines that reflect real-world AI model behavior, ensuring benchmarks are representative of actual usage patterns rather than synthetic workloads.
• Influence long-term infrastructure strategy by identifying systemic performance bottlenecks and proposing architectural changes that improve efficiency, scalability, and cost-effectiveness.
• Mentor and grow a high-performing team of performance engineers and analysts, setting clear goals, fostering technical excellence, and enabling career development within the function.
• Serve as the primary authority on platform performance within OpenAI, acting as the trusted advisor to senior engineering and operational leadership on compute infrastructure optimization.
• Ensure all measurement practices comply with OpenAI’s standards for data integrity, reproducibility, and transparency in performance evaluation.
• Represent OpenAI in industry discussions and partnerships related to AI infrastructure performance, benchmarking standards, and next-generation compute evaluation.

🎯 Requirements

• Experience leading performance engineering, benchmarking, or infrastructure optimization teams at large scale
• Strong technical depth across distributed systems, AI/ML infrastructure, HPC, datacenter systems, or large-scale compute platforms
• Proven ability to define performance metrics and validation frameworks for complex hardware or infrastructure systems
• Comfort operating across hardware, software, and infrastructure boundaries to drive measurable platform improvements
• Ability to translate highly technical performance data into clear business, operational, and strategic recommendations
• Strong communication and leadership skills with experience influencing senior technical and business stakeholders internally and externally

🏖️ Benefits

• Hybrid work model of 3 days in the office per week based in San Francisco, CA
• Relocation assistance for new employees
• Opportunity to shape the future of next-generation AI infrastructure at OpenAI
• Access to cutting-edge compute systems and AI research environments

Skills & Technologies

Remote

Ready to Apply?

Apply Externally

You will be redirected to an external site to apply.

AI Job Fit Analysis

Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

OpenAI, Inc.

Visit Website

About OpenAI, Inc.

OpenAI is a San Francisco-based artificial intelligence research and deployment company founded in 2015. It develops large-scale AI models such as GPT, DALL-E, and Codex, providing cloud APIs and consumer applications like ChatGPT. Originally established as a non-profit, it later created a capped-profit subsidiary to attract capital while maintaining its mission to ensure artificial general intelligence benefits all of humanity.

View Company Profile

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.