Nebius Group N.V. logo

Senior ML Solutions Architect - Token Factory

Job Overview

Location

Remote - Europe

Job Type

Full-time

Category

Machine Learning Engineer

Date Posted

March 17, 2026

Full Job Description

đź“‹ Description

  • • Join Nebius, a pioneering force in cloud computing dedicated to empowering the global AI economy. We provide cutting-edge tools and infrastructure that enable our clients to tackle complex real-world challenges and drive industry transformation, all without the burden of massive infrastructure costs or the necessity of building extensive in-house AI/ML teams. At Nebius, you will be at the forefront of AI cloud infrastructure, collaborating with some of the most experienced and innovative leaders and engineers in the field.
  • • As a Senior ML Solutions Architect, you will play a pivotal role in supporting our customers as they leverage the Nebius Token Factory's advanced serverless inference platform. This platform is designed to handle open-source Large Language Models (LLMs) across multiple modalities, including text, vision, and audio. Your expertise will be crucial in guiding clients through the design and implementation of bespoke LLM-based solutions and architecting scalable AI applications that utilize our served models.
  • • A key aspect of this role involves close collaboration with our backend engineering team. You will provide invaluable feedback and insights derived from customer interactions to help refine and enhance our platform, ensuring it continuously meets and exceeds evolving client needs and industry demands.
  • • This is a fully remote position, offering you the flexibility to work from anywhere within Europe, allowing for a healthy work-life balance and global collaboration.
  • • Your core responsibilities will encompass the end-to-end design and implementation of sophisticated LLM-based solutions. You will utilize Nebius Token Factory’s state-of-the-art inference services to directly drive tangible business value and achieve critical customer objectives.
  • • You will be instrumental in building robust, production-ready applications. This involves expertly leveraging our serverless LLM APIs, which support a diverse range of models, including advanced multimodal capabilities and specialized domain-specific models tailored for unique industry applications.
  • • A significant part of your contribution will be providing deep technical expertise in critical areas such as prompt engineering, designing effective Retrieval-Augmented Generation (RAG) architectures, selecting the most appropriate models for specific use cases, and optimizing inference performance for maximum efficiency and speed.
  • • You will act as a crucial bridge between our customers and our internal teams. By actively collaborating with product and engineering departments, you will effectively surface valuable customer feedback, insights, and emerging trends, directly influencing and shaping the future roadmap of our platform and its capabilities.
  • • A vital function of this role is guiding our customers through the entire lifecycle of their AI projects, from initial Proof of Concept (POC) stages through to full-scale production deployment. Your focus will be on ensuring solutions are not only effective but also highly scalable, reliable, and cost-efficient, maximizing the return on investment for our clients.
  • • You will be empowered to architect and deploy cutting-edge AI solutions, pushing the boundaries of what's possible with LLMs and generative AI. This includes exploring and implementing novel approaches to model interaction, data processing, and application integration.
  • • By staying abreast of the latest advancements in the AI and ML landscape, you will proactively identify opportunities to integrate new technologies and methodologies into our offerings, ensuring Nebius remains at the forefront of cloud AI innovation.
  • • You will contribute to a culture of continuous learning and knowledge sharing, mentoring junior team members and contributing to internal best practices for ML solution architecture and deployment.

Skills & Technologies

Python
Flask
FastAPI
AWS
Azure
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Nebius Group N.V. logo
Nebius Group N.V.
Visit Website

About Nebius Group N.V.

Nebius Group N.V. is a Netherlands-based technology company that operates a full-stack cloud platform designed for AI and machine learning workloads. It provides scalable GPU and CPU infrastructure, managed Kubernetes, object storage, and specialized AI services to enterprises and research organizations worldwide. The company was formed from the restructuring of Yandex N.V. and continues to serve global markets with data centers across Europe and North America.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

ARGENTINA
Full-time
Expires Jun 20, 2026
AWS
Terraform
TensorFlow
+4 more

12 hours ago

Apply
Melbourne
Full-time
Expires May 15, 2026
Python
Kubernetes
PyTorch
+4 more

1 month ago

Apply
Heidi Health Pty Ltd logo

Heidi Health Pty Ltd

Melbourne
Full-time
Expires May 15, 2026
Python
Go
TensorFlow
+4 more

1 month ago

Apply
FundraiseUp Inc. logo

FundraiseUp Inc.

Portugal - Remote
Full-time
Expires May 23, 2026
Python
FastAPI
MongoDB
+4 more

29 days ago

Apply