
Job Overview
Location
San Francisco, CA - US
Job Type
Full-time
Category
Software Engineering
Date Posted
June 4, 2026
Full Job Description
đź“‹ Description
- • Own the end-to-end design, roadmap, and evolution of Crusoe’s network automation platform, enabling global fleet-scale operations with minimal human intervention.
- • Design and maintain the authoritative source of truth for network configuration and state using NetBox, Nautobot, or equivalent systems, ensuring data integrity across all teams and workflows.
- • Architect declarative, model-driven configuration pipelines using Python, Nornir, Ansible, Jinja2, and CI/CD to eliminate configuration drift and treat network infrastructure as code.
- • Lead the implementation of gNMI, OpenConfig, and NETCONF/YANG-based telemetry and configuration management across multi-vendor network fabrics including Arista, Juniper, and NVIDIA/Mellanox.
- • Build and deploy self-healing, event-driven workflows that automatically detect, correlate, and remediate network faults without human escalation across 10K+ devices.
- • Define and implement the observability platform using Prometheus, Grafana, and streaming gNMI collectors to provide real-time metrics, alerting, and dashboards for global network health.
- • Partner with Network Architecture to ensure all network designs are automation-first, programmatically deployable, validatable, and operable at hyperscale.
- • Mentor Staff and Senior Network Engineers through code reviews, design reviews, and technical guidance to elevate the engineering standards across the organization.
- • Drive the adoption of infrastructure-as-code practices across network operations, ensuring automation systems are testable, production-owned, and integrated into CI/CD pipelines.
- • Operate at scale across multi-region, leaf-spine data center fabrics with deep expertise in BGP, EVPN-VXLAN, and LLDP protocols.
- • Translate intent-based network designs into automated, production-grade systems that reduce operational toil and enable sublinear scaling of network operations with headcount.
- • Ensure all automation systems are resilient, auditable, and capable of handling blast radii measured in racks rather than individual ports.
- • Represent the Network Automation organization in cross-functional planning with Deployment, Operations, and Site Reliability teams to align automation strategy with broader infrastructure goals.
🎯 Requirements
- • 12+ years of network engineering experience with a demonstrated focus on production network automation, platform engineering, and infrastructure as code in hyperscale or internet-scale environments.
- • Mastery of Python or Go at a platform level, with production-quality, CI/CD-integrated, and testable codebases.
- • Deep hands-on experience with gNMI, OpenConfig, NETCONF, and YANG-modeled configuration and telemetry.
- • Proven ownership of a network source of truth platform (NetBox, Nautobot, or equivalent) including data model, integrations, and CI/CD pipelines.
- • Strong hands-on expertise with Arista (EOS), Juniper (Junos), and NVIDIA/Mellanox platforms in leaf-spine architectures.
- • Track record of building auto-remediation and closed-loop automation systems that resolve faults without human intervention.
🏖️ Benefits
- • Competitive compensation and equity packages including Restricted Stock Units
- • Paid time off, paid holidays, and leave of absence programs
- • Comprehensive health, dental, and vision insurance
- • Employer contributions to HSA account
- • Paid parental leave
- • 401(k) Retirement plan with company match up to 4% of salary
- • Professional development & tuition reimbursement
- • Mental health & wellness support
- • Commuter benefits (parking & transit)
- • Cell phone stipend
- • Daily meals allowance
- • Volunteer time off
- • Global travel insurance & emergency assistance
Skills & Technologies
About Crusoe Energy Systems LLC
Crusoe Energy Systems is building Crusoe Cloud, an AI cloud platform that provides managed AI services and AI data center infrastructure. They cater to businesses seeking to accelerate AI solution development with optimized models and high-performance computing. The company utilizes environmentally aligned power sources, including wind, solar, and natural gas, to power its data centers. With features like managed Kubernetes and Slurm, Crusoe simplifies operations and ensures reliability with 24/7 support. Crusoe is expanding its reach, including a strategic European expansion with its first data center in Norway. Crusoe recently raised $1.375 billion at a valuation above $10 billion.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
1 month ago

Smart Working Solutions Limited
9 months ago


