
Job Overview
Location
USA | Remote
Job Type
Full-time
Category
DevOps & SysAdmin
Date Posted
April 7, 2026
Full Job Description
đź“‹ Description
- • As a Systems Architect for AI/ML Infrastructure at Deepgram, you will own the end-to-end infrastructure architecture that powers real-time voice AI at massive scale, serving both production inference and research training workloads across bare metal GPU clusters, multi-cloud deployments, and global edge presence.
- • You will define and drive infrastructure strategy, design compute and storage orchestration systems, lead capacity planning, drive cost optimization through FinOps practices, and architect burstable, elastic training infrastructure that scales with Deepgram's rapidly growing demands while collaborating with engineering leadership to align infrastructure with product roadmap and business objectives.
- • Deepgram is the leading platform underpinning the emerging trillion-dollar Voice AI economy, providing real-time APIs for speech-to-text, text-to-speech, and voice agents, with over 200,000 developers and 1,300+ organizations building voice offerings powered by its technology, backed by a recent Series C and over $215M in total funding from investors including Y Combinator, Tiger Global, and NVIDIA.
- • You will operate in an AI-first environment where experimentation, adaptation, and continuous learning are core to success, working at the pace of AI with rapidly evolving day-to-day responsibilities, and have the opportunity to shape the foundational infrastructure that enables Deepgram's industry-leading voice AI capabilities while establishing architectural standards and technical documentation practices.
Skills & Technologies
About Deepgram Inc.
Deepgram builds end-to-end speech AI infrastructure that converts live or recorded audio into text and insights. The company trains large-scale neural networks on GPU clusters to deliver low-latency transcription, keyword detection, and speaker diarization through a single API. Developers use the platform for call centers, meetings, podcasts, and voice bots, paying per minute or hosting the engine on-premise. Founded in 2015 and headquartered in San Francisco, Deepgram serves enterprises seeking accurate, private, and customizable speech recognition without vendor lock-in.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities
17 days ago

Pragmatike Soluciones TecnolĂłgicas S.L.
15 days ago


