
Job Overview
Location
Los Angeles, Poland
Job Type
Full-time
Category
Software Engineering
Date Posted
March 4, 2026
Full Job Description
đź“‹ Description
- • Join Oscilar Inc. as a Sr./Staff Infrastructure/Site Reliability Engineer (SRE) and play a pivotal role in shaping the future of trust in the age of AI. Oscilar is at the forefront of building the most advanced AI Risk Decisioning™ Platform, empowering banks, fintechs, and digitally native organizations to effectively manage fraud, credit, and compliance risk through sophisticated AI. If you are driven by the challenge of solving complex problems and are passionate about making the internet a safer place, Oscilar offers a unique and impactful career opportunity.
- • As a rapidly growing company, Oscilar's systems are continuously evolving in complexity. This role is designed for an experienced SRE who will assume comprehensive ownership of the reliability of our multi-region, cloud-native platform. You will be granted the authority and autonomy to architect, implement, and continuously enhance systems that maintain peak performance and resilience, even under demanding conditions such as significant traffic surges, dependency failures, and global deployment scenarios. Your contributions will be instrumental in defining how Oscilar scales its operations, builds a robust observability framework, and manages the underlying infrastructure that supports billions of events and extensive large-scale data pipelines.
- • In this capacity, you will be responsible for architecting and operating resilient cloud infrastructure, with a primary focus on AWS, leveraging Infrastructure as Code tools like Pulumi. You will lead critical initiatives aimed at enhancing the availability, reducing latency, and optimizing the performance of our systems at scale. A key aspect of your role will involve designing and evolving our Continuous Integration and Continuous Deployment (CI/CD) pipelines, ensuring they are optimized for speed, safety, and repeatability, thereby accelerating our development cycles while maintaining system integrity.
- • You will define the essential metrics, alerts, and runbooks that will form the backbone of our observability strategy, providing deep insights into system health and performance. Proactive risk mitigation will be a core function, involving the execution of chaos experiments and failure simulations to systematically identify and address potential vulnerabilities, thereby hardening the platform against unforeseen issues. Furthermore, you will serve as a technical mentor to other engineers, establishing and promoting best practices for SRE principles and operations across the entire company, fostering a culture of reliability and operational excellence.
- • This role demands a proactive approach to system design and maintenance, ensuring that our platform not only meets current demands but is also prepared for future growth and challenges. You will collaborate closely with engineering teams to embed reliability considerations into the development lifecycle, from initial design to production deployment. Your expertise will be crucial in troubleshooting complex production issues, performing root cause analysis, and implementing preventative measures to avoid recurrence. The opportunity exists to significantly influence the technical direction and operational maturity of Oscilar's infrastructure, contributing directly to the company's success and its mission to create a safer digital financial ecosystem.
- • The ideal candidate will possess a proven track record in senior SRE or Infrastructure Engineering roles within high-scale, complex environments. Expert-level proficiency in AWS and Infrastructure as Code (IaC) tools such as Pulumi or Terraform is essential. Strong programming skills in Go or Python are required, with a preference for Go, as it is the primary language used by the team. A deep understanding of distributed systems, including technologies like Kafka and ClickHouse, and experience with microservices architecture are critical. Mastery of container orchestration platforms, particularly Kubernetes, and extensive experience in production debugging are also necessary. The role requires a strong sense of ownership and the ability to exercise sound judgment in balancing development velocity with the imperative of maintaining system reliability. You will be a key player in ensuring the stability and scalability of a platform that is critical to the operations of leading financial institutions.
Skills & Technologies
Python
Go
AWS
Kubernetes
Terraform
DevOps
Senior
Remote
Degree Required
About Oscilar Inc.
Oscilar provides a no-code risk decisioning platform that enables fintechs and banks to build, test, and deploy real-time fraud prevention and credit risk models. The system centralizes identity, transaction, and alternative data, applies machine-learning rules, and offers continuous monitoring and explainable decisions. It is designed for product managers and analysts to reduce charge-offs, false positives, and manual reviews without engineering support.


