This job has expired
This position was posted on October 27, 2025 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
Remote, UK
Job Type
Full-time
Category
Software Engineering
Date Posted
October 27, 2025
Full Job Description
đź“‹ Description
- • Own the reliability, performance and scalability of Kraken’s energy platform products that serve millions of customers worldwide, directly influencing the stability of the smart, sustainable energy system we are building.
- • Act as the primary SRE partner for multiple product squads, embedding yourself in their day-to-day work to teach, coach and enforce reliability best-practices, implementation patterns and effective usage of our AWS-centric platform.
- • Dive deep into code and infrastructure alongside developers—refactoring Django services, tuning PostgreSQL queries on Amazon RDS, right-sizing Kubernetes deployments on EKS, and instrumenting everything with Datadog—to deliver measurable latency, throughput and availability gains.
- • Build proof-of-concept architectures that anticipate 10× traffic growth, validate new deployment topologies, and feed repeatable patterns back into the core platform so every team benefits from your discoveries.
- • Lead production-readiness reviews for every new feature and service, defining SLOs, run-books, canary strategies and rollback plans that protect the customer experience while enabling rapid, confident releases.
- • Respond to incidents as part of a follow-the-sun on-call rotation, coordinate multi-team war-rooms, drive root-cause analysis, and turn post-mortems into concrete backlog items that prevent recurrence.
- • Analyse metrics, traces and logs to surface systemic risks—slow queries, memory leaks, noisy neighbours, thundering-herd events—and present data-driven recommendations to product and platform leadership.
- • Champion infrastructure-as-code discipline using Terraform, ensuring every change is peer-reviewed, version-controlled, tested and reproducible across environments from dev to production.
- • Mentor junior engineers and share knowledge through internal tech talks, lightning demos and guilds, raising the overall reliability culture of the entire organisation.
- • Collaborate with security, data and compliance teams to harden services, rotate secrets, patch CVEs and maintain SOC 2 / ISO 27001 controls without slowing delivery.
- • Contribute to open-source tooling we maintain (Python libraries, Terraform modules, Helm charts) and give back to the community that powers our stack.
- • Influence the roadmap of the Global Platform Engineering Reliability group by synthesising field feedback into epics that improve CI/CD, observability, cost optimisation and developer experience for everyone at Kraken.
🎯 Requirements
- • 5+ years of experience as a Site Reliability Engineer, DevOps Engineer or Platform Engineer supporting high-traffic, customer-facing SaaS products.
- • Expert-level Python skills including Django, Django ORM and Celery in production; ability to read, debug and optimise application code is essential.
- • Deep hands-on experience with AWS (EC2, RDS, EKS, S3, Lambda, IAM, etc.) and infrastructure-as-code using Terraform or CloudFormation.
- • Proven track record running Kubernetes at scale on Amazon EKS, including cluster upgrades, multi-tenancy, resource quotas, network policies and cost optimisation.
- • Strong proficiency with PostgreSQL on Amazon RDS—query tuning, index design, partitioning, replication, backup/restore and performance troubleshooting.
- • Demonstrated success defining and driving SLOs/SLIs, error budgets and blameless post-mortems to improve system reliability and team culture.
🏖️ Benefits
- • Fully remote-first culture with flexible hours and the option to work from anywhere in the UK, Spain, France, Italy or Germany.
- • 28 days annual leave plus local public holidays, volunteer days and a paid sabbatical after four years.
- • £1,500 annual learning & development budget, access to conferences, certifications and internal “Kraken University” courses.
- • Private medical insurance, life assurance, income protection and a generous pension contribution (UK) or equivalent benefits in other countries.
Skills & Technologies
Python
Django
PostgreSQL
AWS
Docker
Senior
Remote
About Kraken
Kraken is a global cryptocurrency exchange established in 2011, offering spot and futures trading for Bitcoin, Ethereum and 200+ digital assets. Headquartered in San Francisco with entities worldwide, it serves retail and institutional clients, providing custody, staking, an NFT marketplace and OTC desk. The platform emphasizes security, regulatory compliance and educational resources.
Similar Opportunities

Faith Technologies, Inc.
Menasha-OMC
Full-time
Expires Mar 4, 2026
Go
Onsite
Degree Required
1 month ago


