
Job Overview
Location
Barcelona, Spain
Job Type
Full-time
Category
Software Engineering
Date Posted
March 5, 2026
Full Job Description
đź“‹ Description
- • As a mid-level Site Reliability Engineer (SRE) at Auth0, a part of Okta, Inc., you will play a pivotal role in ensuring the unwavering reliability, resilience, and scalability of our world-class identity platform. This is a unique opportunity to join a dynamic European-based SRE team and contribute directly to the core robustness of a service trusted by hundreds of millions of users globally. Your work will go beyond routine maintenance; you will be a hands-on builder, architecting and implementing solutions that embed reliability into the very fabric of our system, preparing it for exponential growth and maintaining the highest standards of availability and performance.
- • Your primary responsibility will be to design and develop custom software solutions, with a strong emphasis on the Go programming language, to proactively enhance the platform's reliability, resilience, and redundancy. This involves creating tools and systems that automate critical processes, detect potential issues before they impact users, and ensure seamless failover and recovery mechanisms.
- • You will collaborate closely with various engineering teams across the organization. Your role will involve partnering with these teams to instill and reinforce reliability principles, acting as a subject matter expert to improve the availability, performance, and observability of their services. This includes conducting thorough reviews of new features and infrastructure changes to ensure they meet our stringent reliability standards.
- • Leveraging your deep understanding of infrastructure and observability principles, you will continuously identify opportunities for improvement within the product and its underlying infrastructure. This proactive approach will involve implementing innovative solutions to address performance bottlenecks, reduce latency, and enhance the overall user experience.
- • A critical aspect of this role is participating in our on-call rotation. In this capacity, you will be responsible for providing rapid, effective, and decisive responses to critical incidents. Your expertise will be crucial in troubleshooting complex production issues, mitigating immediate impacts, and accurately escalating problems when necessary, ensuring minimal disruption to our customers.
- • You will be instrumental in developing and refining our SRE tooling and processes. This focus on automation and operational efficiency aims to streamline workflows, reduce manual toil, and empower the entire engineering organization to operate more effectively and reliably.
- • Furthermore, you will take ownership of defining, documenting, and championing reliability best practices across the organization. This includes creating clear guidelines, conducting training sessions, and fostering a culture where reliability is a shared responsibility, ensuring consistent application of SRE principles throughout the development lifecycle.
- • This position demands a proactive and systematic approach to problem-solving, coupled with a high degree of ownership and accountability for the systems you manage. You will thrive in an environment that encourages autonomy and initiative, where you are empowered to make significant contributions.
- • You will be expected to contribute to our on-call rotation, ensuring swift and effective resolution of critical incidents in a 24/7 cloud-based environment. This requires a calm demeanor under pressure and a methodical approach to troubleshooting and incident management.
- • The role offers a career-defining opportunity to tackle complex challenges at a massive scale, working with cutting-edge technologies and contributing to a product that is fundamental to modern digital security and growth. If you are a curious, motivated, and passionate engineer eager to build reliability directly into a leading platform, we encourage you to apply.
Skills & Technologies
Go
AWS
Azure
GCP
Docker
Remote
About Okta, Inc.
Okta provides cloud-based identity and access management software that enables organizations to securely connect employees, partners, and customers to the right technologies. Its platform offers single sign-on, multi-factor authentication, lifecycle management, API access control, and analytics to manage user identities across applications, devices, and networks. The company serves enterprises, government agencies, and small to medium-sized businesses, helping them improve security, compliance, and user experience while reducing IT complexity and support costs.
Similar Opportunities

Coinbase Global, Inc.
Remote - Canada
Full-time
Expires May 2, 2026
Go
MongoDB
Redis
+3 more
4 days ago

