
Principal AI Research Scientist Post-Training · Alignment · Reinforcement Learning Autodesk AI Lab: London · San Francisco · Toronto · Remote (US/CA/EU
Job Overview
Location
7 Locations
Job Type
Full-time
Category
Data Science
Date Posted
May 27, 2026
Full Job Description
📋 Description
- • Lead post-training research for foundation models with a focus on reinforcement learning, preference optimization, RLHF, RLAIF, DPO, and PPO methodologies
- • Develop novel algorithms to enhance model reliability, controllability, and alignment using domain-specific verifiers grounded in physics simulation and engineering constraints
- • Design and execute experiments that evaluate and shape model behavior, robustness, and long-horizon reasoning capabilities in professional workflows
- • Create evaluation frameworks for agentic behavior, tool use, safety, and real-world workflow completion tied to Autodesk’s architecture, engineering, construction, manufacturing, and media & entertainment domains
- • Lead rigorous model analysis and interpretability efforts to understand and mitigate alignment failures and unintended behaviors
- • Partner with infrastructure teams to build scalable, reproducible post-training workflows that integrate with existing computational design and simulation tools
- • Establish model readiness criteria and provide go/no-go recommendations for production releases based on empirical evaluation and risk assessment
- • Drive human-in-the-loop evaluation with high-quality annotation protocols and scientifically rigorous methodologies
- • Contribute to peer-reviewed publications at top venues including NeurIPS, ICML, ICLR, CVPR, and SIGGRAPH, and pursue patents to extend Autodesk’s research visibility
- • Communicate technical risks, limitations, and trade-offs clearly to engineering, product, and executive leadership
- • Mentor and lead technical research teams in academic, industry, or lab settings to advance alignment and post-training research
- • Make principled architectural decisions on whether to address model challenges at the pre-training, post-training, or system level
- • Maintain deep fluency in alignment research, preference learning, and agentic AI systems
- • Deploy and support production AI systems with awareness of large-scale training infrastructure and compute trade-offs
🎯 Requirements
- • Deep hands-on expertise in reinforcement learning for foundation models and fluency with post-training methods (RLHF, RLAIF, DPO, PPO, or adjacent approaches)
- • PhD or equivalent depth of industry research experience in ML, RL, AI, or a related field
- • Proven experience leading or mentoring technical research teams in academic, industry, or AI lab settings
- • Strong publication record at leading ML or AI venues (e.g., NeurIPS, ICML, ICLR, CVPR, SIGGRAPH)
- • Experience in alignment research, preference learning, or agentic AI
- • Experience deploying or supporting production AI systems
🏖️ Benefits
- • Opportunity to publish at top-tier AI/ML conferences and pursue patents
- • Direct line from research to product impact at scale within Autodesk’s core domains
- • Collaboration with leading academic and industry research labs
- • Work in a domain with unique assets: high-fidelity physics simulation engines and CAD kernels as reward signals
Skills & Technologies
About Autodesk, Inc.
Autodesk, Inc. develops professional design, engineering, and entertainment software. Its flagship products include AutoCAD, Revit, Fusion 360, Maya, and 3ds Max, serving architecture, engineering, construction, manufacturing, media, and education sectors worldwide. Founded in 1982, the company provides cloud-based subscription services enabling digital design, simulation, visualization, and collaboration across project lifecycles. Headquartered in San Rafael, California, Autodesk operates globally, empowering customers to create sustainable infrastructure, products, and digital content through integrated software platforms and emerging technologies such as generative design, additive manufacturing, and building information modeling.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Analytic Partners, Inc.
2 months ago

Beghou Consulting Group LLC
1 month ago

FundraiseUp Inc.
3 months ago
