
Job Overview
Location
Seoul, South Korea
Job Type
Full-time
Category
Machine Learning Engineer
Date Posted
April 13, 2026
Full Job Description
đź“‹ Description
- • As a Staff Machine Learning Engineer on the Pegasus team at Twelve Labs Inc., you will drive technical direction for ML engineering while remaining deeply hands-on in critical system design and implementation for the company’s Video Analysis product, which is central to its mission of advancing multimodal foundation models for human-like video understanding.
- • You will own the design and evolution of production ML systems focused on scalability, reliability, performance, and fast iteration; lead technical decisions across model deployment, inference architecture, metadata systems, and ML infrastructure for Video Language Models (VLMs); improve and automate the end-to-end ML lifecycle to accelerate research-to-product translation; mentor engineers; and explore AI-assisted development tools to boost team productivity.
- • The Pegasus team is a core, goal-oriented, cross-functional unit at Twelve Labs responsible for shipping real-world video analysis products, not isolated research, with access to cutting-edge hardware like NVIDIA B300s GPUs and a global presence spanning San Francisco and Seoul, backed by over $110M in funding from top-tier investors including NVIDIA’s NVentures, NEA, and Index Ventures.
- • In this role, you will deepen your expertise in large-scale multimodal ML systems, production ML infrastructure, and AI-assisted development practices while mentoring teammates and shaping the technical foundation of a high-impact AI product used by global B2B customers.
Skills & Technologies
About Twelve Labs Inc.
Twelve Labs builds multimodal video understanding AI. Its cloud platform transforms long-form video into vector embeddings that capture visual, audio, speech and contextual information, enabling semantic search, summarization, chaptering, moderation and analytics through a single API. Developers upload video, index it, then query in natural language or image to retrieve exact moments, generate highlights or detect unwanted content. Models are pretrained on large-scale web video, continually fine-tuned for accuracy and latency, and deployable on dedicated GPU clusters for enterprise security. Founded in 2021, the San Francisco company serves media, ed-tech, safety and e-commerce customers worldwide.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Heidi Health Pty Ltd
2 months ago

Heidi Health Pty Ltd
2 months ago

