OpenAI, Inc. logo

Research Infrastructure Engineer, Training Systems

Job Overview

Location

San Francisco

Job Type

Full-time

Category

Data Engineer

Date Posted

April 28, 2026

Full Job Description

📋 Description

  • Research Infrastructure Engineer, Training Systems at OpenAI, Inc. in San Francisco focuses on building and maintaining the systems layer that turns novel machine learning research ideas into runnable, measurable training workloads for large models, directly enabling model releases and research progress.
  • Day-to-day responsibilities include building and maintaining infrastructure for large-scale model training and experimentation, designing APIs and interfaces to simplify complex training workflows, improving reliability, debuggability, and performance across training and data pipelines, debugging issues across Python, PyTorch, distributed systems, GPUs, networking, and storage, and writing tests, benchmarks, and diagnostics to catch meaningful regressions.
  • The team works on research and systems that advance frontier models, often going beyond standard training recipes by building infrastructure to make new training approaches practical at scale, where systems work is directly tied to research progress through better tools, abstractions, and runtimes.
  • In this role, you can learn and achieve impact by enabling new model training approaches through systems innovation, deepening expertise in ML infrastructure and distributed systems, contributing to reliable and performant training pipelines, and working at the intersection of research and production engineering on cutting-edge AI systems.

Skills & Technologies

Python
Express
PyTorch
DevOps
Onsite

Ready to Apply?

You will be redirected to an external site to apply.

OpenAI, Inc. logo
OpenAI, Inc.
Visit Website

About OpenAI, Inc.

OpenAI is a San Francisco-based artificial intelligence research and deployment company founded in 2015. It develops large-scale AI models such as GPT, DALL-E, and Codex, providing cloud APIs and consumer applications like ChatGPT. Originally established as a non-profit, it later created a capped-profit subsidiary to attract capital while maintaining its mission to ensure artificial general intelligence benefits all of humanity.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Dallas, TX
Full-time
Expired May 18, 2026
Python
Azure
Onsite

3 months ago

Apply
Remote Nationwide
Full-time
Expires Jul 26, 2026
Python
Senior
Remote
+2 more

11 days ago

Apply
Expired
Remote
Full-time
Expired Apr 13, 2026
Senior
Remote

4 months ago

Apply
Expired
Scale Army Careers logo

Scale Army Careers

Remote
Full-time
Expired Apr 13, 2026
Python
Pandas
Remote

4 months ago

Apply