Kraken logo

Senior AI Compute Infrastructure Engineer

Job Overview

Location

United Kingdom

Job Type

Full-time

Category

Engineering

Date Posted

May 8, 2026

Full Job Description

đź“‹ Description

  • • Senior AI Compute Infrastructure Engineer role focused on building and operating GPU and accelerator infrastructure to power AI workloads for Kraken’s exchange, enabling secure, efficient, and scalable model training, inference, and evaluation in-house.
  • • Day-to-day responsibilities include owning and operating GPU clusters, designing infrastructure for on-prem model execution, optimizing inference pipelines using frameworks like vLLM and TensorRT, building observability and alerting systems, partnering with ML engineers to remove bottlenecks, and driving reliability through incident response and runbooks.
  • • The role sits within a dedicated AI Compute and Infrastructure team under engineering leadership, collaborating closely with AI/ML researchers, platform engineers, security, and product teams to make Kraken’s AI ambitions real through production-grade, cost-efficient compute.
  • • You will gain deep expertise in large-scale GPU infrastructure, ML serving systems, performance optimization, and cost-efficient compute at scale, while contributing to long-term architecture decisions that balance performance, scalability, and operational safety in a high-stakes, always-on environment.

🎯 Requirements

  • • 5+ years of infrastructure engineering experience with significant focus on GPU compute, ML infrastructure, distributed systems, HPC, or large-scale production platforms
  • • Hands-on experience operating GPU clusters or accelerator-backed infrastructure in production, including scheduling, orchestration, utilization monitoring, and cost optimization
  • • Strong systems engineering fundamentals across Linux, networking, storage, containers, Kubernetes, distributed runtimes, and production debugging
  • • Experience with ML serving frameworks such as vLLM, Triton Inference Server, TensorRT, TorchServe, KServe, Ray Serve, or equivalent systems
  • • Proficiency in Python for infrastructure automation, tooling, debugging, integration, and operational workflows
  • • Track record of optimizing compute costs while maintaining performance, reliability, and availability expectations

🏖️ Benefits

  • • Fully remote work with global team across 70+ countries and 50+ languages
  • • Opportunity to work on mission-driven crypto infrastructure that enables financial freedom and inclusion
  • • Access to cutting-edge AI hardware and software stack including custom accelerators and advanced serving frameworks
  • • Collaboration with top-tier crypto experts, AI researchers, and platform engineers in a high-impact, innovative environment
  • • Commitment to diversity, equity, and inclusion with merit-based hiring and fair chance considerations
  • • Ongoing learning and professional development through internal knowledge sharing and exposure to frontier AI infrastructure challenges

Skills & Technologies

Python
Rust
Node.js
AWS
Kubernetes
DevOps
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

About Kraken

Kraken is a global cryptocurrency exchange established in 2011, offering spot and futures trading for Bitcoin, Ethereum and 200+ digital assets. Headquartered in San Francisco with entities worldwide, it serves retail and institutional clients, providing custody, staking, an NFT marketplace and OTC desk. The platform emphasizes security, regulatory compliance and educational resources.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

ARGENTINA
Full-time
Expires Jun 20, 2026
AWS
Remote

25 days ago

Apply
ARGENTINA
Full-time
Expires Jun 20, 2026
AWS
Apache Spark
Remote
+1 more

25 days ago

Apply
ARGENTINA
Full-time
Expires Jun 20, 2026
Python
Scala
AWS
+5 more

25 days ago

Apply
Caylent, Inc. logo

Caylent, Inc.

ARGENTINA
Full-time
Expires Jun 20, 2026
Python
TypeScript
React
+4 more

25 days ago

Apply