d-Matrix Corporation logo

Software Infrastructure Engineer, Senior Staff

Job Overview

Location

Santa Clara

Job Type

Full-time

Category

Software Engineer

Date Posted

May 16, 2026

Full Job Description

đź“‹ Description

  • • Design, develop, and maintain scalable software infrastructure systems to support the entire software engineering organization at d-Matrix, with a focus on enabling ML accelerator development on both hardware and software platforms.
  • • Lead the implementation and optimization of GitLab CI/CD pipelines, including merge trains, automated testing, and code review workflows to ensure reliable and efficient software delivery.
  • • Architect and manage containerized environments using Docker and Podman, ensuring consistency across development, testing, and production stages for ML workloads.
  • • Deploy and oversee orchestration platforms such as Kubernetes (K8s) for managing distributed ML workloads in data center environments.
  • • Integrate and maintain build systems using Bazel, including automation of code coverage analysis, dependency management, and build validation across multi-language codebases.
  • • Develop and enhance DevOps tooling for root causing CI/CD failures, including bisect tools, remote code coverage analyzers, linting systems, and security/vulnerability scanning integrations.
  • • Collaborate with software and hardware teams to align infrastructure needs with the unique demands of proprietary ML accelerator systems, ensuring seamless integration between software pipelines and hardware validation workflows.
  • • Establish and monitor DevOps metrics to quantify system reliability, build throughput, test coverage, and incident resolution times, driving data-informed improvements.
  • • Maintain and improve Linux-based development and deployment stacks, ensuring robustness, performance, and security across the software development lifecycle.
  • • Enable secure and efficient data source integration workflows to support training and validation datasets used in ML model development.
  • • Provide technical leadership and mentorship to junior engineers on infrastructure best practices, CI/CD standards, and containerization strategies.
  • • Proactively identify bottlenecks in the software development lifecycle and implement scalable solutions to reduce developer friction and accelerate time-to-deploy.
  • • Stay current with emerging trends in MLOps, AI infrastructure, and DevOps tooling, and evaluate their applicability to d-Matrix’s proprietary ML accelerator stack.
  • • Participate in on-call rotation to respond to critical infrastructure incidents and ensure high availability of development tools and pipelines.
  • • Contribute to documentation, knowledge sharing, and cross-functional training to ensure organizational resilience and institutional memory for core infrastructure systems.

🎯 Requirements

  • • Computer Science, Engineering, Math, Physics, or a related degree
  • • Minimum 7+ years of industry experience in software infrastructure and DevOps
  • • Proficient in C/C++ and Python
  • • Proficient with GitLab merge process, merge trains, and CI/CD
  • • Proficient with Docker and Podman containers
  • • Experience with orchestration tools for data center deployment of ML workloads, such as K8s

🏖️ Benefits

  • • Hybrid work model with 3-5 days per week onsite at Santa Clara, CA headquarters
  • • Equal opportunity workplace with inclusive, collaborative culture
  • • Opportunity to work at the forefront of generative AI and ML accelerator innovation
  • • Exposure to cutting-edge AI compute and subsystem technologies in a startup environment

Skills & Technologies

Python
Docker
Kubernetes
GitLab
Linux
DevOps
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

AI Job Fit Analysis
Pro

See exactly how your profile matches this role — strengths, skill gaps, and what to do about them.

d-Matrix Corporation logo
d-Matrix Corporation
Visit Website

About d-Matrix Corporation

d-Matrix designs silicon for high-efficiency AI inference at scale. Its Corsair compute platform combines in-memory computing with a digital approach to slash latency and energy use in transformer and generative workloads. Targeting hyperscale data centers and edge deployments, the company offers hardware and software stacks that integrate into existing AI pipelines. Founded in 2019 and headquartered in Santa Clara, California, d-Matrix serves cloud and enterprise customers seeking cost-effective alternatives to GPUs for large language model serving.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Expired
Argentina - Remote
Full-time
Expired May 4, 2026
Python
PHP
Ruby
+5 more

3 months ago

Expired
Argentina
Full-time
Expired Apr 25, 2026
Python
JavaScript
TypeScript
+4 more

4 months ago

Expired
Colombia - Fully Remote
Full-time
Expired May 24, 2026
Python
JavaScript
TypeScript
+3 more

3 months ago

Expired
Mexico - Fully Remote
Part-time
Expired May 24, 2026
Python
JavaScript
TypeScript
+3 more

3 months ago