
Job Overview
Location
USA | Remote
Job Type
Full-time
Category
Data Scientist
Date Posted
March 12, 2026
Full Job Description
đź“‹ Description
- • Deepgram is at the forefront of the burgeoning trillion-dollar Voice AI economy, offering a leading platform with real-time APIs for speech-to-text (STT), text-to-speech (TTS), and the creation of production-grade voice agents at scale.
- • We empower over 200,000 developers and more than 1,300 organizations to build innovative voice solutions, with prominent clients including Twilio, Cloudflare, Sierra, Decagon, Vapi, Daily, Cresta, Granola, and Jack in the Box.
- • Our voice-native foundation models are accessible via cloud APIs or as self-hosted and on-premises software, delivering unparalleled accuracy, minimal latency, and exceptional cost efficiency.
- • With a recent Series C funding round led by top global investors and strategic partners, Deepgram has processed an astounding 50,000+ years of audio and transcribed over 1 trillion words, solidifying our position as the world's foremost authority on voice technology.
- • At Deepgram, we embrace an AI-first operating rhythm where the use and comfort with AI are not optional but fundamental to our innovation, performance, and daily operations.
- • Every team member is expected to actively utilize and experiment with advanced AI tools, integrating them into their workflows and even building custom AI solutions to enhance productivity and outcomes.
- • Performance is measured by the effective application of AI to achieve results, making consistent and creative use of the latest AI capabilities a key determinant of success.
- • Candidates must be adept at rapidly adopting new models and methodologies, seamlessly integrating AI into their work, and continuously pushing the boundaries of what AI technologies can accomplish.
- • Our operational pace mirrors the rapid evolution of AI, meaning day-to-day responsibilities can change quickly. This role is ideal for individuals excited by experimentation, adaptation, agile thinking, and continuous learning, rather than those seeking a highly prescriptive, traditional 9-to-5 environment.
- • The opportunity lies in addressing the fundamental data challenges of conversational audio, which are significantly more complex than text-based data. Real-world audio data is scarce, highly diverse, and spans a vast spectrum of voices, speaking styles, and acoustic conditions.
- • The inherent high dimensionality of audio data presents substantial computational and storage challenges, making large-scale training and deployment prohibitively expensive.
- • We believe that novel paradigms for audio AI are essential to overcome these hurdles and democratize voice interaction for everyone.
- • Deepgram is seeking experienced Data Scientists with a proven track record of solving complex data problems and exploring research frontiers to join our Research Staff.
- • This role involves building an industrial-scale “data factory” to power the next generation of Voice AI systems.
- • The goal is to unlock the creation of models that transcend basic transcription and comprehension, enabling the capture of nuanced meanings in complex conversations, robust adaptation to diverse speech patterns, and the generation of empathic, human-like, contextualized speech.
- • You will collaborate closely with our product, engineering, and data teams to develop and deploy models within the most scalable voice API available.
- • We encourage you to bring your expertise, share insights from your latest experiments, and contribute to pushing the boundaries of AI and voice technology.
- • We are looking for individuals who view "unsolved" problems as opportunities to pioneer entirely new approaches and can identify the single critical experiment to validate or invalidate an idea swiftly.
- • The role requires the vision to scale successful proofs-of-concept by orders of magnitude and an obsession with leveraging AI to automate and amplify personal impact.
- • If you are energized by these challenges and already conceptualizing innovative solutions, you may be the ideal researcher for this position.
- • This role demands an obsession with the problems, creativity in approach, and a relentless drive towards elegant, scalable solutions, tackling immense technical challenges with the potential for transformative impact.
- • Key responsibilities include driving high-performance data acquisition, preparation, and synthesis pipelines for next-generation speech and language AI foundation models.
- • You will develop advanced characterizations of complex conversational audio using a diverse toolkit of signal processing techniques and deep learning models.
- • Collaboration with DataOps and Engineering teams is crucial for creating automated systems that enhance the capacity of human annotators to label high-value data and provide feedback on model outputs.
- • Building advanced benchmarking methodologies and curated datasets for evaluating conversational voice systems is a core function.
- • Documenting and presenting the results of data experiments and analyses to both internal and external audiences will be a regular activity.
- • This role is ideal for those obsessed with deciphering complex or messy data, who enjoy building systems from the ground up, and are passionate about leveraging AI and data to solve challenging problems.
- • You will be motivated by the prospect of scaling your own capabilities through automation and AI models.
- • Essential experience includes building data processing pipelines from inception, owning the entire data stack from acquisition to transformation, and applying statistical methods and deep learning models to complex data.
- • Strong communication skills are vital for translating complex concepts into easily understandable terms for various audiences.
- • Robust software engineering skills, particularly in developing clean, modular Python code and working with PyTorch, are required.
Skills & Technologies
About Deepgram Inc.
Deepgram builds end-to-end speech AI infrastructure that converts live or recorded audio into text and insights. The company trains large-scale neural networks on GPU clusters to deliver low-latency transcription, keyword detection, and speaker diarization through a single API. Developers use the platform for call centers, meetings, podcasts, and voice bots, paying per minute or hosting the engine on-premise. Founded in 2015 and headquartered in San Francisco, Deepgram serves enterprises seeking accurate, private, and customizable speech recognition without vendor lock-in.
Subscribe to the weekly newsletter for similar remote roles and curated hiring updates.
Newsletter
Weekly remote jobs and featured talent.
No spam. Only curated remote roles and product updates. You can unsubscribe anytime.
Similar Opportunities

Wayflyer Limited
1 month ago

Shift Technology SAS
2 months ago

Feedzai, Inc.
2 months ago
1 month ago

