This job has expired
This position was posted on January 3, 2026 and is likely no longer accepting applications. We've kept it here for historical reference. Check out the similar jobs below!

Job Overview
Location
Miami
Job Type
Full-time
Category
Software Engineering
Date Posted
January 3, 2026
Full Job Description
đź“‹ Description
- • Be the boots-on-the-ground hero who turns empty floor space into a humming, AI-ready data center. You will uncrate, rack, cable, and configure cutting-edge GPU and CPU servers, storage arrays, and network gear—then validate every component so that Tensorwave’s customers can train the next generation of large-language models without a hiccup.
- • Own the complete hardware lifecycle, from initial power-on through decommission. This includes BIOS/firmware updates, RAID configuration, burn-in testing, labeling every cable with laser precision, and documenting the final build in our CMDB so that remote engineers can locate any device in seconds.
- • Monitor environmental and performance telemetry 24/7. When a temperature sensor spikes, a DIMM throws ECC errors, or a switch port flaps, you are the first responder—diagnosing root cause, swapping FRUs, and writing concise post-mortems that prevent recurrence.
- • Coordinate daily with our Network, Systems, and Software teams in Miami, Austin, and remote locations. You will translate high-level deployment plans into rack elevations, power budgets, and cable-run schematics, then execute them on schedule—even if that means re-engineering on the fly when a shipment arrives with the wrong bezel.
- • Champion safety and compliance. You will enforce Tensorwave’s zero-tolerance policy for shortcuts: every lift is a two-person job, every energized circuit is LOTO-verified, and every asbestos tile is handled per EPA guidelines. Your checklists will become the gold standard for new sites.
- • Maintain meticulous digital and physical records. Spreadsheets, photos, and QR-coded labels must tell the story of every asset so that auditors, capacity planners, and midnight NOC engineers can trust the data implicitly.
- • Travel up to 25 % to stand up green-field facilities across the southeastern U.S. One week you might be installing rails in Miami, the next validating fiber runs in Atlanta. You will return home with lessons learned that improve our global runbook.
- • Participate in an on-call rotation that treats every alert as a potential customer-facing incident. When a PDU fails at 3 a.m., you will roll out of bed, drive to the site, and restore power before the first training job drops a checkpoint.
- • Contribute to continuous improvement. If you see a way to cut two minutes off a server swap or reduce cable clutter by 30 %, we will give you the tools and authority to pilot, measure, and scale your idea across all sites.
- • Mentor junior technicians and local contractors. Your calm demeanor and encyclopedic knowledge of hardware quirks will turn novices into confident operators who uphold Tensorwave’s reputation for reliability.
- • Embrace the mission: eliminate barriers to AI innovation. Every rack you commission and every alarm you silence directly enables researchers to cure diseases, optimize energy grids, and build safer autonomous vehicles.
Skills & Technologies
About TensorWave, Inc.
TensorWave develops and operates an AI accelerator cloud built on AMD Instinct GPUs, targeting large-scale model training and inference. The platform offers on-demand and reserved compute with high-bandwidth memory, InfiniBand networking, and container orchestration, delivered through a web console and API. Designed for generative AI, LLM fine-tuning, and HPC workloads, the service emphasizes AMD performance at competitive pricing, supported by 24/7 operations teams. Based in Las Vegas, Nevada, the company serves hyperscalers, research labs, and enterprises needing GPU capacity beyond NVIDIA ecosystems.


