📣 Deepgram Accelerates Into 2025, Empowering 200,000+ Developers From Startups to Global Enterprises to Build Voice AI 📣

Article·Announcements·Jan 29, 2025
4 min read

Deepgram Accelerates Into 2025, Empowering 200,000+ Developers From Startups to Global Enterprises to Build Voice AI

AI Company Ends 2024 Cash-flow Positive with 400+ Enterprise Customers, 3.3x Annual Usage Growth Across the Past Four Years, Over 50,000 Years of Audio Processed, and Over One Trillion Words Transcribed
4 min read
Share this guide
Praveen Rangnath
By Praveen Rangnath
PublishedJan 29, 2025
UpdatedJan 29, 2025

SAN FRANCISCO, January 29, 2025Deepgram, the leading voice AI platform for developers building speech-to-text (STT), text-to-speech (TTS), and full speech-to-speech (STS) offerings, today announced record business growth and technical milestones achieved in the past year. Today, over 200,000 developers build with Deepgram’s voice-native foundational models, choosing Deepgram due to its unmatched accuracy, low latency, and pricing, as well as the flexibility for all voice-native AI models to be accessed through cloud APIs or self-hosted / on-premises APIs. Organizations that build on Deepgram’s infrastructure for STT, TTS, and AI Voice Agents include technology ISVs building voice products or platforms, co-sell partners working with large enterprises, and enterprises solving internal use cases.  

“2024 was a stellar year for Deepgram, as our traction is accelerating and our long-term vision of empowering developers to build voice AI with human-like accuracy, human-like expressivity, and human-like latency is materializing,” said Scott Stephenson, CEO of Deepgram. “Our product strategy from founding has been to focus on deep-tech first, and the work we have done in building 3-factor automated model adaptation, extreme compression on latent space models (LSMs), hosting models with efficient and unrestricted hot-swapping, and symmetrical delivery across public cloud, private cloud, or on-premises, uniquely positions us to succeed in the $50B market for voice AI agents in demanding environments requiring exceptional accuracy, lowest COGS, highest model adaptability, and lowest latency.”   

“Integrating AI voice agents will be one of the most impactful initiatives for our business operations over the next five years, driving unparalleled efficiency and elevating the quality of our service. Our partnership with Deepgram has allowed us to explore their Voice AI solutions and its benefits in our restaurants,” said Doug Cook, CTO of Jack in the Box.  

“As we build out our voice AI platform, it is crucial that we pick the best underlying voice AI technologies,” said Jordan Dearsley, co-founder and CEO of Vapi. “Building on Deepgram’s leading AI models and APIs accelerates our path to build an AI platform that is highly accurate, real-time, and easy to use. Deepgram's solution has proven to be a game-changer, offering up to 30% lower word error rates, 40x faster processing times, and 3-5x cost efficiency compared to competitors.”  

Looking forward to 2025, Deepgram will continue to innovate to extend its unique value proposition of offering the highest accuracy and lowest COGS at scale and highest model adaptability, and lowest latency. Through continued innovation, Deepgram expects to end 2025 as the industry’s only end-to-end speech-to-speech solution built to solve the four critical challenges of enterprise-ready voice AI:  

  1. Accuracy / audio perception: Enterprise use cases require high recognition, understanding, and generation of specialized vocabulary in often challenging audio conditions. Deepgram solves this through novel, non-lossy compressions of these spaces for rapid processing paired with generation, training, and evaluation on synthetic data that precisely matches Deepgram customers’ real-world conditions.

  2. COGS at scale: Deepgram customers need to profitably build and scale voice AI solutions. Deepgram delivers this through its unique latent audio model with extreme compression combined with deep expertise in high-performance computing.

  3. Latency: Real-time conversation requires near-instantaneous responses. Deepgram achieves this using streaming state space model architectures, optimized specifically for the underlying hardware to deliver minimal processing delays.

  4. Context: Effective conversations are deeply contextualized. Deepgram will pass the speech Turing test thanks to its ability to train on vast bodies of data that thoroughly represent its customers’ use cases and pass that context through the entire system and interaction.

Additional Resources:

About Deepgram

Deepgram is the leading voice AI platform for developers building speech-to-text (STT), text-to-speech (TTS), and full speech-to-speech (STS) offerings. 200,000+ developers build with Deepgram’s voice-native foundational models – accessed through cloud APIs or as self-hosted / on-premises APIs – due to our unmatched accuracy, low latency, and pricing.  Customers include technology ISVs building voice products or platforms, co-sell partners working with large enterprises, and enterprises solving internal use cases. Having processed over 50,000 years of audio and transcribed over 1 trillion words, there is no organization in the world that understands voice better than Deepgram. To learn more, visit www.deepgram.com, read our developer docs, or follow @DeepgramAI on X and LinkedIn

Unlock language AI at scale with an API call.

Get conversational intelligence with transcription and understanding on the world's best speech AI platform.