
Job Overview
Location
San Francisco
Job Type
Full-time
Category
Product Management
Date Posted
April 3, 2026
Full Job Description
đź“‹ Description
- • As a Senior ML Data Project Manager at Twelve Labs Inc., you will play a critical role in advancing the company’s mission to revolutionize video understanding through multimodal foundation models by leading end-to-end data operations for video-language datasets that directly train and evaluate state-of-the-art AI models.
- • Your work will ensure the quality, scalability, and efficiency of data pipelines that power cutting-edge research and product development, making you a key enabler of Twelve Labs’ technological leadership in multimodal AI.
- • You will design and execute video-language data collection and labeling projects from scoping to delivery, defining dataset requirements in collaboration with research and product teams while balancing speed, cost, and annotation quality to meet aggressive model training timelines.
- • You will build and optimize data pipelines using Python-based tools (pandas, Jupyter) and cloud storage systems (Amazon S3, Oracle Cloud), automating repetitive tasks such as data validation, format conversion, and metadata tagging to reduce manual effort and increase throughput.
- • You will establish and refine labeling guidelines, monitor data quality through statistical analysis and error tracking, and partner with vendor teams to resolve ambiguities, ensuring consistency and reliability across large-scale multimodal datasets.
- • You will collaborate closely with external annotation vendors and outsourcing partners, managing contracts, SLAs, and feedback loops to maintain high standards while scaling operations globally, including time-zone-coordinated workflows with APAC teams.
- • You will partner with internal Engineering and AI Model teams to align on evolving data needs, translate model performance gaps into actionable data improvement plans, and co-design analytical dashboards and reporting tools in Notion and Linear to visualize project health, labeler performance, and data drift.
- • You will manage resource allocation, timelines, and risk across multiple concurrent data streams, adjusting priorities dynamically based on research milestones, model iteration cycles, and emerging data quality issues.
- • You will drive continuous improvement in data tooling and infrastructure by evaluating and implementing new labeling systems, automation scripts, and quality control frameworks, reducing reliance on manual processes and increasing reproducibility.
- • You will mentor junior team members and foster a culture of data excellence, advocating for best practices in annotation design, bias mitigation, and ethical data sourcing in alignment with Twelve Labs’ inclusive and innovative values.
- • You will gain deep expertise in multimodal AI data lifecycles, from raw video ingestion to model-ready datasets, positioning yourself at the forefront of a rapidly evolving field where data quality directly determines model breakthroughs.
- • You will have the opportunity to shape the foundational data strategy for a well-funded, high-impact AI startup backed by NVIDIA, NEA, and industry pioneers, working on problems that push the boundaries of how machines perceive and understand video content.
🎯 Requirements
- • 5+ years of experience in AI-focused data operations, including designing and executing large-scale data collection, labeling, and post-processing projects for machine learning applications.
- • Proficiency with Python for data automation and analysis, particularly using pandas and Jupyter notebooks, to build scalable data pipelines and validation workflows.
- • Strong project management and communication skills, with proven ability to coordinate cross-functional teams (research, engineering, vendors) and manage multiple concurrent projects under tight deadlines.
- • Foundational understanding of large language models (LLMs), vision-language models (VLMs), and multimodal AI, with interest in how data quality impacts model performance.
- • Experience working with third-party SDKs, cloud storage platforms (e.g., Amazon S3, Oracle Cloud), and data visualization tools to support tooling advancement and infrastructure improvements.
- • Background in video-centric domains (e.g., sports, advertising, content creation) or multimodal language model data collection is a strong plus.
🏖️ Benefits
- • Full health, dental, and vision benefits to support your well-being and that of your dependents.
- • Flexible PTO and parental leave policy, including office closure during the week of Christmas and New Years for extended rest and recharge.
- • Visa sponsorship support (such as H1B and OPT transfers) for eligible U.S.-based employees, enabling global talent to contribute to the mission.
- • Opportunity to work closely with a mission-driven, collaborative team on cutting-edge multimodal AI technology backed by top-tier investors and AI pioneers.
- • Open and inclusive culture that values diverse backgrounds, experiences, and perspectives as drivers of innovation and continuous learning.
Skills & Technologies
About Twelve Labs Inc.
Twelve Labs builds multimodal video understanding AI. Its cloud platform transforms long-form video into vector embeddings that capture visual, audio, speech and contextual information, enabling semantic search, summarization, chaptering, moderation and analytics through a single API. Developers upload video, index it, then query in natural language or image to retrieve exact moments, generate highlights or detect unwanted content. Models are pretrained on large-scale web video, continually fine-tuned for accuracy and latency, and deployable on dedicated GPU clusters for enterprise security. Founded in 2021, the San Francisco company serves media, ed-tech, safety and e-commerce customers worldwide.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Constructor Tech Ltd.
2 months ago

CookUnity Inc.
2 months ago

Worldwide Holdings Corporation
2 months ago

Worldwide Holdings Corporation
2 months ago