Deepgram Inc. logo

Systems Architect AI/ML Infrastructure

Job Overview

Location

USA | Remote

Job Type

Full-time

Category

DevOps & SysAdmin

Date Posted

April 7, 2026

Full Job Description

đź“‹ Description

  • • As a Systems Architect for AI/ML Infrastructure at Deepgram, you will own the end-to-end infrastructure architecture that powers real-time voice AI at massive scale, serving both production inference and research training workloads across bare metal GPU clusters, multi-cloud deployments, and global edge presence.
  • • You will define and drive infrastructure strategy, design compute and storage orchestration systems, lead capacity planning, drive cost optimization through FinOps practices, and architect burstable, elastic training infrastructure that scales with Deepgram's rapidly growing demands while collaborating with engineering leadership to align infrastructure with product roadmap and business objectives.
  • • Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text, text-to-speech, and voice agents, with over 200,000 developers and 1,300+ organizations building voice offerings powered by its technology, backed by a recent Series C and over $215M in total funding from investors including Y Combinator, Tiger Global, and NVIDIA.
  • • You will operate in an AI-first environment where experimentation, adaptation, and continuous learning are core to success, working at the pace of AI with rapidly evolving day-to-day responsibilities, and have the opportunity to shape the foundational infrastructure that enables Deepgram's industry-leading voice AI capabilities while establishing architectural standards and technical documentation practices.

Skills & Technologies

AWS
Kubernetes
DevOps
Senior
Remote

Ready to Apply?

You will be redirected to an external site to apply.

Deepgram Inc. logo
Deepgram Inc.
Visit Website

About Deepgram Inc.

Deepgram builds end-to-end speech AI infrastructure that converts live or recorded audio into text and insights. The company trains large-scale neural networks on GPU clusters to deliver low-latency transcription, keyword detection, and speaker diarization through a single API. Developers use the platform for call centers, meetings, podcasts, and voice bots, paying per minute or hosting the engine on-premise. Founded in 2015 and headquartered in San Francisco, Deepgram serves enterprises seeking accurate, private, and customizable speech recognition without vendor lock-in.

Get more remote jobs like this

Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.

Newsletter

Weekly remote jobs and featured talent.

No spam. Only curated remote roles and product updates. You can unsubscribe anytime.

Similar Opportunities

Yerevan, Armenia
Full-time
Expires Jun 4, 2026
Python
Java
Go
+5 more

17 days ago

Apply
Pragmatike Soluciones TecnolĂłgicas S.L. logo

Pragmatike Soluciones TecnolĂłgicas S.L.

Armenia
Full-time
Expires Jun 6, 2026
JavaScript
TypeScript
Rust
+4 more

15 days ago

Apply
Yerevan, Armenia
Full-time
Expires Jun 4, 2026
Python
Java
Go
+6 more

17 days ago

Apply
Argentina
Full-time
Expires May 31, 2026
Azure
Remote
$40k-45k

21 days ago

Apply