Socure Inc. logo

Senior Software Engineer - SRE

Job Overview

Location

Hybrid - US

Job Type

Full-time

Category

Software Engineering

Date Posted

June 4, 2026

Full Job Description

đź“‹ Description

  • • Own end-to-end design, operation, and continuous improvement of highly available, scalable AWS infrastructure for mission-critical production systems.
  • • Design, deploy, and maintain Kubernetes platforms using Amazon EKS in production environments with a focus on reliability and scalability.
  • • Implement and enforce infrastructure-as-code using Terraform and GitOps principles to ensure consistent, auditable, and repeatable deployments.
  • • Build and operate CI/CD pipelines using GitHub Actions and ArgoCD to enable safe, fast, and automated software releases.
  • • Establish and maintain robust observability systems using Datadog or similar tools to monitor infrastructure, Kubernetes clusters, and application performance through metrics, logs, traces, and alerting.
  • • Define and operationalize SLIs and SLOs to drive reliability decisions, reduce toil, and proactively prevent incidents rather than react to them.
  • • Lead incident response efforts including root cause analysis, post-mortem documentation, and implementation of long-term remediations to prevent recurrence.
  • • Automate operational tasks through production-quality code written in Go or Python to eliminate manual toil and improve system resilience.
  • • Troubleshoot complex, multi-layer issues across Kubernetes networking, storage, scheduling, and underlying AWS services.
  • • Raise operational standards by documenting best practices, improving runbooks, and promoting engineering discipline across teams.
  • • Collaborate with software engineering teams to embed reliability practices into the development lifecycle and ensure systems are designed for observability and resilience from inception.
  • • Maintain strong security posture across cloud infrastructure by applying IAM best practices, network segmentation, and compliance controls.
  • • Continuously evaluate and improve system performance, cost efficiency, and fault tolerance under high-traffic and high-stakes conditions.
  • • Contribute to on-call rotations and be accountable for system uptime, latency, and error rates across Socure’s identity verification platform.
  • • Translate observability data into actionable operational improvements that directly impact system reliability and customer trust.
  • • Work in a hybrid environment with expectations for in-office collaboration in the U.S. while maintaining remote flexibility as needed.

🎯 Requirements

  • • Deep AWS expertise including networking, compute, IAM, scaling, and security
  • • Strong hands-on experience managing infrastructure at scale using Terraform
  • • Very strong Kubernetes fundamentals and production experience operating Amazon EKS
  • • Ability to write clean, production-quality code in Go or Python
  • • Proven experience building and operating CI/CD pipelines with GitHub Actions and ArgoCD
  • • Hands-on experience with Datadog or similar observability tools for infrastructure, Kubernetes, and APM monitoring

🏖️ Benefits

  • • Equal opportunity employer with no discrimination based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status
  • • Accommodations available during application, interview, or onboarding process upon request
  • • Opportunity to work on mission-critical identity trust infrastructure impacting businesses, governments, and millions of users daily
  • • Collaborative environment with high bar for engineering excellence and ownership

Skills & Technologies

Python
AWS
Kubernetes
Terraform
GitHub
DevOps
Senior
Hybrid

Ready to Apply?

You will be redirected to an external site to apply.

Socure Inc. logo
Socure Inc.
Visit Website

About Socure Inc.

Socure Inc. provides digital identity verification and fraud prevention software for financial services, fintech, e-commerce and government clients. The platform applies machine learning and graph-based analytics to link and validate identity elements in real time, detecting synthetic identities, account takeover and document fraud. It integrates via APIs and SDKs for onboarding, KYC/AML compliance and transaction monitoring, aiming to reduce false positives and manual reviews while improving approval rates for legitimate users worldwide.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

San Francisco, California
Full-time
Expires Aug 2, 2026
Python
JavaScript
Ruby
+3 more

4 days ago

Apply
Expired
London, United Kingdom; Remote - United States
Full-time
Expired Apr 25, 2026
Remote

3 months ago

Apply
Athens, Greece
Full-time
Expires Aug 2, 2026
Rust
AWS
Azure
+4 more

4 days ago

Apply
USA | Remote
Full-time
Expires Jun 21, 2026
Python
JavaScript
TypeScript
+3 more

2 months ago

Apply