This job has expired

This position was posted on February 22, 2026 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

AI Platform Engineer

Fastino AI Inc.

Job Overview

Location

Remote

Job Type

Full-time

Full Job Description

📋 Description

• Join Fastino AI, a rapidly growing and well-funded startup at the forefront of developing the next generation of Large Language Models (LLMs). Our mission is to create specialized, efficient AI, building upon the success of our GLiNER family of open-source models, which have garnered over 5 million downloads and are utilized by industry giants like NVIDIA, Meta, and Airbnb. With a recent $25M seed funding round backed by prominent investors including Microsoft, Khosla Ventures, Insight Partners, and key tech leaders, we are poised for significant expansion and innovation.
• As an AI Platform Engineer, you will play a pivotal role in owning and evolving Fastino's end-to-end model platform. This is a foundational, systems-level engineering position, distinct from feature development, where you will be instrumental in shaping the infrastructure that powers our AI advancements. Your responsibilities will span the entire lifecycle of model development and deployment, ensuring robustness, scalability, and efficiency.
• You will be responsible for designing and building robust training pipelines, enabling the efficient development of our cutting-edge AI models. This includes architecting distributed fine-tuning workflows tailored for both small encoder and decoder models, incorporating advanced techniques such as LoRA, adapters, distillation, and compression to maximize performance and resource utilization.
• A key aspect of your role will involve establishing comprehensive experiment tracking, ensuring reproducibility, and implementing sophisticated dataset versioning systems. This will provide a solid foundation for iterative development and rigorous scientific inquiry, allowing us to manage and understand the vast complexity of our AI research.
• You will focus on optimizing training efficiency across multiple dimensions, including GPU utilization, memory management, throughput, and cost-effectiveness. This optimization is critical for maintaining a competitive edge and enabling rapid iteration in a resource-intensive field.
• Furthermore, you will design and implement scalable Reinforcement Learning (RL) training workflows, encompassing policy optimization and reward modeling. This includes integrating RL methodologies seamlessly with supervised fine-tuning and distillation processes, unlocking new capabilities and performance enhancements for our models.
• Building robust evaluation loops and automated regression detection mechanisms will be a core responsibility, ensuring the quality and reliability of our models throughout the development cycle. This proactive approach to quality assurance is paramount for production-ready AI systems.
• You will architect and build scalable data ingestion pipelines, capable of handling both structured and unstructured data sources. This involves designing efficient data curation, filtering, and quality enforcement systems to ensure the integrity and utility of the datasets used for training.
• Implementing reproducible data workflows that are tightly coupled with training runs will be essential for maintaining consistency and enabling seamless collaboration within the team.
• On the deployment front, you will architect low-latency inference services, ensuring our models can be served efficiently and effectively in production environments. This includes designing secure and reliable production deployment workflows, adhering to best practices for operational stability and performance.
• This role offers a unique opportunity to shape the core infrastructure of a leading AI research company, working with a world-class team and leveraging state-of-the-art technologies. You will have a direct impact on the development and deployment of advanced AI models that are already making waves in the industry.

Skills & Technologies

AWS

GCP

Docker

GitHub

PyTorch

Remote

Ready to Apply?

Apply Externally

You will be redirected to an external site to apply.

Fastino AI Inc.

Visit Website

About Fastino AI Inc.

Fastino AI is a technology company focused on developing advanced artificial intelligence solutions. Their core business revolves around creating and deploying AI-powered tools and platforms designed to automate complex processes and enhance decision-making across various industries. They specialize in areas such as machine learning, natural language processing, and computer vision, aiming to provide businesses with innovative ways to leverage data and improve operational efficiency. Fastino AI's offerings cater to clients seeking to integrate cutting-edge AI capabilities into their existing workflows, driving digital transformation and competitive advantage.

View Company Profile

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.