
Job Overview
Location
Hybrid - US
Job Type
Full-time
Category
Software Engineering
Date Posted
June 4, 2026
Full Job Description
đź“‹ Description
- • Own end-to-end design, operation, and continuous improvement of highly available, scalable AWS infrastructure for mission-critical production systems.
- • Design, deploy, and maintain Kubernetes platforms using Amazon EKS in production environments with a focus on reliability and scalability.
- • Implement and enforce infrastructure-as-code using Terraform and GitOps principles to ensure consistent, auditable, and repeatable deployments.
- • Build and operate CI/CD pipelines using GitHub Actions and ArgoCD to enable safe, fast, and automated software releases.
- • Establish and maintain robust observability systems using Datadog or similar tools to monitor infrastructure, Kubernetes clusters, and application performance through metrics, logs, traces, and alerting.
- • Define and operationalize SLIs and SLOs to drive reliability decisions, reduce toil, and proactively prevent incidents rather than react to them.
- • Lead incident response efforts including root cause analysis, post-mortem documentation, and implementation of long-term remediations to prevent recurrence.
- • Automate operational tasks through production-quality code written in Go or Python to eliminate manual toil and improve system resilience.
- • Troubleshoot complex, multi-layer issues across Kubernetes networking, storage, scheduling, and underlying AWS services.
- • Raise operational standards by documenting best practices, improving runbooks, and promoting engineering discipline across teams.
- • Collaborate with software engineering teams to embed reliability practices into the development lifecycle and ensure systems are designed for observability and resilience from inception.
- • Maintain strong security posture across cloud infrastructure by applying IAM best practices, network segmentation, and compliance controls.
- • Continuously evaluate and improve system performance, cost efficiency, and fault tolerance under high-traffic and high-stakes conditions.
- • Contribute to on-call rotations and be accountable for system uptime, latency, and error rates across Socure’s identity verification platform.
- • Translate observability data into actionable operational improvements that directly impact system reliability and customer trust.
- • Work in a hybrid environment with expectations for in-office collaboration in the U.S. while maintaining remote flexibility as needed.
🎯 Requirements
- • Deep AWS expertise including networking, compute, IAM, scaling, and security
- • Strong hands-on experience managing infrastructure at scale using Terraform
- • Very strong Kubernetes fundamentals and production experience operating Amazon EKS
- • Ability to write clean, production-quality code in Go or Python
- • Proven experience building and operating CI/CD pipelines with GitHub Actions and ArgoCD
- • Hands-on experience with Datadog or similar observability tools for infrastructure, Kubernetes, and APM monitoring
🏖️ Benefits
- • Equal opportunity employer with no discrimination based on race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status
- • Accommodations available during application, interview, or onboarding process upon request
- • Opportunity to work on mission-critical identity trust infrastructure impacting businesses, governments, and millions of users daily
- • Collaborative environment with high bar for engineering excellence and ownership
Skills & Technologies
About Socure Inc.
Socure Inc. provides digital identity verification and fraud prevention software for financial services, fintech, e-commerce and government clients. The platform applies machine learning and graph-based analytics to link and validate identity elements in real time, detecting synthetic identities, account takeover and document fraud. It integrates via APIs and SDKs for onboarding, KYC/AML compliance and transaction monitoring, aiming to reduce false positives and manual reviews while improving approval rates for legitimate users worldwide.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Workato, Inc.
4 days ago

Nebius Group N.V.
3 months ago

Deepgram Inc.
2 months ago