
Job Overview
Location
Remote - Europe
Job Type
Full-time
Category
Software Engineering
Date Posted
June 22, 2026
Full Job Description
đź“‹ Description
- • Own the end-to-end production journey for customers transitioning from proof-of-concept to scalable AI deployments on Nebius infrastructure.
- • Ensure customer AI workloads are deployed, stable, performant, and cost-efficient by monitoring latency, throughput, reliability, and resource utilization.
- • Drive time-to-production and time-to-value by coordinating with customer engineering teams to resolve technical blockers and optimize workflows.
- • Act as the primary technical contact for production issues, coordinating cross-functionally with Solution Architects, Product, and Infrastructure teams to resolve incidents under pressure.
- • Understand customer architectures and use cases to provide tailored guidance on best practices for inference optimization, model deployment, and infrastructure scaling.
- • Proactively identify performance bottlenecks and cost inefficiencies in AI workloads and implement actionable solutions to improve outcomes.
- • Translate complex technical challenges into clear, practical recommendations for customer teams and internal stakeholders.
- • Provide structured feedback to Product and Infrastructure teams based on real-world customer usage to shape future platform improvements.
- • Support customer expansion by demonstrating value through technical success, without engaging in direct sales activities.
- • Maintain ownership of multiple customer accounts simultaneously, balancing priorities with a structured, proactive, and solution-oriented approach.
- • Serve as a trusted technical partner to customer engineering and ML teams, fostering long-term relationships built on reliability and expertise.
- • Do not design systems from scratch; instead, optimize and operationalize existing customer architectures on Nebius platform.
- • Do not respond to routine support tickets; focus on strategic, high-impact technical enablement and production stability.
🎯 Requirements
- • Practical knowledge of inference frameworks such as vLLM, TensorRT, or similar
- • Solid understanding of cloud or infrastructure systems, distributed systems, or high-load applications
- • Solid understanding of AI/ML workloads, including LLMs and inference
- • Ability to troubleshoot and reason about system performance
- • Experience working directly with technical customers (e.g., engineers, ML teams)
- • Ability to communicate complex technical topics clearly and effectively
- • Strong sense of ownership — driving outcomes, not just tasks
- • Ability to manage multiple customers and priorities
- • Structured, proactive, and solution-oriented mindset
🏖️ Benefits
- • Competitive compensation
- • Career growth and learning opportunities
- • Flexibility and ownership
- • Collaborative and innovative culture
- • Opportunity to work on impactful AI projects
- • International environment and talented teams
Skills & Technologies
See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.
About Nebius Group N.V.
Nebius Group N.V. is a Netherlands-based technology company that operates a full-stack cloud platform designed for AI and machine learning workloads. It provides scalable GPU and CPU infrastructure, managed Kubernetes, object storage, and specialized AI services to enterprises and research organizations worldwide. The company was formed from the restructuring of Yandex N.V. and continues to serve global markets with data centers across Europe and North America.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

SouthState Bank
1 month ago

Global Healthcare Consulting, LLC
1 month ago

CrowdStrike Holdings, Inc.
9 days ago

TIH Insurance Services, LLC
1 month ago