
Job Overview
Location
United Kingdom
Job Type
Full-time
Category
Engineering
Date Posted
May 8, 2026
Full Job Description
đź“‹ Description
- • Senior AI Compute Infrastructure Engineer role focused on building and operating GPU and accelerator infrastructure to power AI workloads for Kraken’s exchange, enabling secure, efficient, and scalable model training, inference, and evaluation in-house.
- • Day-to-day responsibilities include owning and operating GPU clusters, designing infrastructure for on-prem model execution, optimizing inference pipelines using frameworks like vLLM and TensorRT, building observability and alerting systems, partnering with ML engineers to remove bottlenecks, and driving reliability through incident response and runbooks.
- • The role sits within a dedicated AI Compute and Infrastructure team under engineering leadership, collaborating closely with AI/ML researchers, platform engineers, security, and product teams to make Kraken’s AI ambitions real through production-grade, cost-efficient compute.
- • You will gain deep expertise in large-scale GPU infrastructure, ML serving systems, performance optimization, and cost-efficient compute at scale, while contributing to long-term architecture decisions that balance performance, scalability, and operational safety in a high-stakes, always-on environment.
🎯 Requirements
- • 5+ years of infrastructure engineering experience with significant focus on GPU compute, ML infrastructure, distributed systems, HPC, or large-scale production platforms
- • Hands-on experience operating GPU clusters or accelerator-backed infrastructure in production, including scheduling, orchestration, utilization monitoring, and cost optimization
- • Strong systems engineering fundamentals across Linux, networking, storage, containers, Kubernetes, distributed runtimes, and production debugging
- • Experience with ML serving frameworks such as vLLM, Triton Inference Server, TensorRT, TorchServe, KServe, Ray Serve, or equivalent systems
- • Proficiency in Python for infrastructure automation, tooling, debugging, integration, and operational workflows
- • Track record of optimizing compute costs while maintaining performance, reliability, and availability expectations
🏖️ Benefits
- • Fully remote work with global team across 70+ countries and 50+ languages
- • Opportunity to work on mission-driven crypto infrastructure that enables financial freedom and inclusion
- • Access to cutting-edge AI hardware and software stack including custom accelerators and advanced serving frameworks
- • Collaboration with top-tier crypto experts, AI researchers, and platform engineers in a high-impact, innovative environment
- • Commitment to diversity, equity, and inclusion with merit-based hiring and fair chance considerations
- • Ongoing learning and professional development through internal knowledge sharing and exposure to frontier AI infrastructure challenges
Skills & Technologies
Python
Rust
Node.js
AWS
Kubernetes
DevOps
Senior
Remote
About Kraken
Kraken is a global cryptocurrency exchange established in 2011, offering spot and futures trading for Bitcoin, Ethereum and 200+ digital assets. Headquartered in San Francisco with entities worldwide, it serves retail and institutional clients, providing custody, staking, an NFT marketplace and OTC desk. The platform emphasizes security, regulatory compliance and educational resources.
Get more remote jobs like this
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
ARGENTINA
Full-time
Expires Jun 20, 2026
Python
Scala
AWS
+5 more
25 days ago
