Featherless AI Inc. logo

Software Engineer - API Gateway

Job Overview

Location

Toronto

Job Type

Full-time

Category

Software Engineer

Date Posted

January 16, 2026

Full Job Description

đź“‹ Description

  • • Own and evolve the beating heart of Featherless.ai – the API gateway that routes every inference request from thousands of AI builders, startups, and Fortune-500 enterprises. Your code will authenticate users, enforce subscription entitlements, and expose a clean, lightning-fast surface for text, image, audio, and multi-modal models.
  • • Ship daily improvements that directly impact GPU utilization and end-user latency. You’ll add new endpoints as fast as open-source models drop, patch regressions before customers notice, and instrument every hop so we can prove “done” with real usage data.
  • • Triage and crush gnarly cross-stack bugs that span DNS, Kubernetes ingress, Node.js, Redis, MongoDB, Python micro-services, and Cloudflare edge rules. When a 3 A.M. pager fires because a new 70 B parameter model suddenly spikes tail-latency, you’ll be the calm engineer who traces it to a mis-configured context-length limiter and rolls a fix within the hour.
  • • Design and harden subscription management logic that turns feature flags into real-world guardrails: context windows, rate limits, and concurrency caps must adapt in real time as customers upgrade, downgrade, or burst during product launches.
  • • Build rock-solid observability: wire OpenTelemetry traces from Fastify handlers through Redis queues to GPU nodes, surface golden signals in Elastic Cloud, and create Sentry alerts that page only when humans need to act.
  • • Right-size infrastructure so that autoscaling GPU pools spin up fast enough for viral launches but never waste precious H100 hours. You’ll pair with the infra team to tune Kubernetes HPA thresholds, warm-pool sizes, and Cloudflare cache rules.
  • • Collaborate in a tight-knit, remote-first Platform Team that swarms pull requests, debates design in Notion, and celebrates weekly demos. We bias to action: if a fix unblocks a customer today, we ship it and measure tomorrow.
  • • Champion the developer experience: write clear OpenAPI specs, maintain Postman collections, and cut SDK snippets so external builders can integrate a new LLM in under five minutes.
  • • Influence product direction by synthesizing support tickets, Discord chatter, and usage dashboards into crisp proposals. When the community begs for vision-language model support, you’ll scope the gateway changes, break them into two-week bets, and deliver.
  • • Mentor junior engineers through pair programming and thoughtful code reviews, raising the bar for testing, documentation, and operational rigor across the entire stack.
  • • Stay curious about the AI ecosystem: run open LLMs locally, experiment with LangChain, replicate a trending tweet-to-video pipeline, then bring that empathy back to the team so we never lose sight of the builders we serve.

Skills & Technologies

Python
Node.js
MongoDB
Redis
Kubernetes
Backend
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Featherless AI Inc. logo
Featherless AI Inc.
Visit Website

About Featherless AI Inc.

Featherless AI Inc. provides serverless LLM hosting, offering developers and AI teams worldwide instant access to a continually expanding library of over 17,300 open-source models. Their platform facilitates seamless deployment for fine-tuning, testing, and production, empowering diverse applications from AI software development to creative writing platforms. Featherless distinguishes itself by eliminating the burden of server management and significantly reducing inference costs, providing transparent, flat-rate pricing with unlimited tokens. As an AI research lab, they pioneer open-source, post-transformer model research and aim to make advanced AI more accessible and affordable for a global customer base, supporting innovation across various industries.

Similar Opportunities

Rio de Janeiro
Full-time
Expires Feb 24, 2026
JavaScript
TypeScript
Angular
+4 more

2 months ago

Apply
❌ EXPIRED
Remote
Full-time
Expired Nov 18, 2025
Go
Senior
Remote

5 months ago

Apply
Remote
Full-time
Expires Apr 10, 2026
JavaScript
Java
GitLab
+4 more

6 days ago

Apply
Grant Street Group logo

Grant Street Group

United States (Remote)
Full-time
Expires Mar 10, 2026
Python
JavaScript
Java
+4 more

1 month ago

Apply