Lambda Inc. logo

Sr. Network Engineer

Job Overview

Location

San Francisco Office (Fremont St)

Job Type

Full-time

Category

Software Engineering

Date Posted

June 13, 2026

Full Job Description

đź“‹ Description

  • • Design, deploy, and scale high-performance cloud networking infrastructure to support AI workloads across tens of thousands of customers.
  • • Configure and manage production-grade networking hardware for new and existing GPU clusters in data centers.
  • • Ensure 99.9%+ network availability through implementation of redundancy, failover mechanisms, and proactive monitoring.
  • • Automate network configuration management using Python, Ansible, and version control systems like Git to reduce manual intervention.
  • • Collaborate with internal teams and external customers to diagnose, troubleshoot, and resolve complex network-related issues in real time.
  • • Deploy and maintain network monitoring tools including Datadog, Prometheus, Grafana, Clickhouse, gNMI, and OpenTelemetry for real-time visibility.
  • • Participate in day-two operations and on-call rotation for the Network Engineering team to ensure rapid incident response.
  • • Implement and optimize CLOS/Spine-and-Leaf topologies with EVPN/VXLAN, ECMP, BGP, and fast-convergence protocols across multi-vendor environments.
  • • Manage Next-Generation Firewalls (e.g., Fortinet) to enforce security policies across cloud and on-premises network segments.
  • • Integrate cloud networking services from AWS, GCP, and OCI into Lambda’s hybrid infrastructure with consistent operational practices.
  • • Work with network hardware vendors including Arista, Juniper, Cisco, Cumulus/SONiC, and Opengear to validate, deploy, and support equipment.
  • • Apply deep Linux networking stack knowledge to troubleshoot kernel-level connectivity, routing, and performance issues.
  • • Contribute to the evolution of network architecture to support emerging AI workloads, including high-bandwidth, low-latency requirements.
  • • Document network designs, configurations, and operational procedures to ensure knowledge sharing and compliance.
  • • Partner with infrastructure and operations teams to align network topology with data center power, cooling, and space constraints.

🎯 Requirements

  • • 10+ years of experience in IT and networking
  • • 6+ years of experience designing and operating production data center networks
  • • Expertise in CLOS/Spine-and-Leaf fabrics, EVPN/VXLAN, ECMP, BGP, and fast-convergence techniques
  • • Production experience managing Next-Generation Firewalls (e.g., Fortigate)
  • • Hands-on experience with cloud networking (AWS, GCP, OCI)
  • • Proficiency in Linux command line and Linux networking stack internals
  • • Strong automation skills using Python and Ansible, with experience using Git or similar source control
  • • Production experience with multiple network hardware vendors (Arista, Juniper, Cisco, Cumulus/SONiC, Opengear)
  • • Experience with network monitoring stacks (Datadog, Prometheus, Grafana, Clickhouse, gNMI, OpenTelemetry)

🏖️ Benefits

  • • Generous cash & equity compensation
  • • Health, dental, and vision coverage for employee and dependents
  • • Wellness and commuter stipends for select roles
  • • 401k Plan with 2% company match (for USA employees)
  • • Flexible paid time off plan that is actively used by employees

Skills & Technologies

Python
AWS
GCP
Terraform
Git
Senior
Onsite

Ready to Apply?

You will be redirected to an external site to apply.

AI Job Fit Analysis
Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

Lambda Inc. logo
Lambda Inc.
Visit Website

About Lambda Inc.

Lambda Inc. provides cloud-based GPU clusters and workstations for artificial-intelligence research and development. The company designs and operates high-performance hardware infrastructure optimized for machine-learning workloads, offering on-demand access to NVIDIA GPUs, pre-configured deep-learning software stacks, and scalable storage. Customers include AI labs, universities, and enterprises training large language and computer-vision models. Founded in 2012, Lambda is headquartered in San Francisco and maintains data centers across North America and Europe.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
San Francisco, CA or Remote (USA)
Full-time
Expired Apr 27, 2026
Senior
Remote

4 months ago

Expired
Xebia Poland Sp. z o.o. logo

Xebia Poland Sp. z o.o.

Budapest, Budapest, Hungary
Full-time
Expired Apr 27, 2026
AWS
Azure
GCP
+2 more

4 months ago

Expired
UK (Remote)
Contract
Expired Apr 27, 2026
Design
Remote

4 months ago

Expired
Australia (Remote)
Contract
Expired Apr 27, 2026
Design
Remote

4 months ago