Lambda Inc. logo

Data Center Operations System Engineer (Kansas City, MO)

Job Overview

Location

Kansas City, MO - Data Center

Job Type

Full-time

Category

DevOps & SysAdmin

Date Posted

March 23, 2026

Full Job Description

đź“‹ Description

  • • As a Data Center Operations System Engineer at Lambda Inc., you will play a critical role in building and maintaining the physical infrastructure that powers one of the world’s leading AI cloud platforms, ensuring reliable, high-performance compute resources are available for AI researchers, enterprises, and hyperscalers advancing the frontiers of artificial intelligence.
  • • Your work directly supports Lambda’s mission to make compute as ubiquitous as electricity by enabling the seamless deployment, operation, and scaling of GPU-accelerated infrastructure across its data center footprint, starting with the Kansas City, MO facility requiring on-site presence 5 days per week for 7/24 coverage.
  • • You will ensure new server, storage, and network infrastructure is properly racked, labeled, cabled, and configured to exacting standards, forming the physical foundation for Lambda’s AI cloud services.
  • • You will troubleshoot complex hardware and software issues in some of the world’s most advanced AI systems, diagnosing and resolving failures to minimize downtime and maintain service reliability for thousands of global customers.
  • • You will document data center layout and network topology in DCIM (Data Center Infrastructure Management) software, creating accurate, up-to-date records that support capacity planning, change management, and operational efficiency across all Lambda facilities.
  • • You will collaborate with supply chain and manufacturing teams to coordinate the timely receipt, staging, and deployment of systems, aligning hardware delivery with project timelines for large-scale rollouts.
  • • You will manage a parts depot inventory, tracking equipment through the full lifecycle—from delivery and storage to staging, deployment, and handoff—to ensure accountability, reduce loss, and streamline operations.
  • • You will work closely with the HW Support team to resolve data center infrastructure-related support tickets, acting as a bridge between physical operations and technical support to ensure rapid issue resolution.
  • • You will partner with the RMA team to identify faulty components, initiate return processes, and coordinate replacement orders, maintaining optimal spare parts availability and minimizing service disruption.
  • • You will adhere to and reinforce installation standards for equipment placement, labeling, and cabling, promoting consistency, safety, and ease of maintenance across all Lambda data centers to improve discoverability and reduce human error.
  • • You are expected to have familiarity with critical data center infrastructure systems, including power distribution units (PDUs), uninterruptible power supplies (UPS), cooling and airflow management, environmental monitoring sensors, capacity planning methodologies, DCIM platforms, and structured cabling best practices.
  • • You must demonstrate exceptional attention to detail and the ability to precisely follow technical instructions, diagrams, and procedures to ensure safety, compliance, and operational excellence in a high-density computing environment.
  • • You should be action-oriented, proactive, and eager to learn, thriving in a fast-paced, mission-driven culture where initiative and continuous improvement are valued.
  • • You must be willing to travel to support the bring-up of new data center locations as Lambda expands its global footprint to meet growing demand for AI compute.

🎯 Requirements

  • • Familiarity with critical data center infrastructure systems including power distribution, airflow management, environmental monitoring, capacity planning, DCIM software, and structured cabling
  • • Strong attention to detail and ability to follow instructions precisely in technical environments
  • • Action-oriented mindset with a willingness to learn and take initiative in fast-paced operations
  • • Willingness to travel for new data center site build-outs and equipment deployment
  • • Ability to work on-site 5 days per week in Kansas City, MO for 7/24 data center coverage

🏖️ Benefits

  • • Generous cash and equity compensation package
  • • Comprehensive health, dental, and vision coverage for employees and dependents
  • • Wellness and commuter stipends for eligible roles
  • • 401(k) plan with 2% company match for U.S. employees
  • • Flexible paid time off plan designed for actual use and work-life balance
  • • Opportunity to work with cutting-edge AI hardware from partners like NVIDIA, Supermicro, and Wistron
  • • Exposure to large-scale, GPU-optimized data center operations supporting breakthrough AI research and enterprise innovation

Skills & Technologies

Linux
Onsite

Ready to Apply?

You will be redirected to an external site to apply.

Lambda Inc. logo
Lambda Inc.
Visit Website

About Lambda Inc.

Lambda Inc. provides cloud-based GPU clusters and workstations for artificial-intelligence research and development. The company designs and operates high-performance hardware infrastructure optimized for machine-learning workloads, offering on-demand access to NVIDIA GPUs, pre-configured deep-learning software stacks, and scalable storage. Customers include AI labs, universities, and enterprises training large language and computer-vision models. Founded in 2012, Lambda is headquartered in San Francisco and maintains data centers across North America and Europe.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Yerevan, Armenia
Full-time
Expires Jun 4, 2026
Python
Java
Go
+6 more

20 days ago

Apply
Pragmatike Soluciones TecnolĂłgicas S.L. logo

Pragmatike Soluciones TecnolĂłgicas S.L.

Armenia
Full-time
Expires Jun 6, 2026
JavaScript
TypeScript
Rust
+4 more

19 days ago

Apply
Yerevan, Armenia
Full-time
Expires Jun 4, 2026
Python
Java
Go
+5 more

20 days ago

Apply
Argentina
Full-time
Expires May 31, 2026
Azure
Remote
$40k-45k

24 days ago

Apply