Technology Partner
Twilio Media Streams pipes real-time call audio to Deepgram for transcription. The reference architecture forks audio from calls to a proxy server via WebSockets, the server forwards to Deepgram, and transcripts flow back to subscribed clients. Setup needs a Twilio number, a Deepgram API key, and ngrok for local development.
For batch transcription of completed calls, Programmable Voice recordings flow into Deepgram via API.
Aura-2 handles speech synthesis for AI-generated responses on the outbound side of a Twilio call.
Full voice agent flows orchestrate STT, an LLM, and TTS through Twilio. Documented patterns include OpenAI as the LLM and the Deepgram Voice Agent API as the speech layer.
For non-developer ops teams, Twilio Studio plus Pipedream plus Deepgram delivers automated transcription pipelines with email or webhook delivery.
If you are building voice features on Twilio, the fastest start is the Marketplace Add-On. For the integration patterns above, the developer docs walk through each end to end. Enterprise terms via deepgram.com/contact-us.

Media Transcription
Contact Centers
Conversational AI
Looking to use Deepgram + Twilio ?
Talk to an ExpertOther Partners

OneReach.ai

Think41

Vapi
Carahsoft

Genesys

Vonage

Cloudflare

Daily.co

Stream

Kore

Google Cloud

AudioCodes

Vida

Recall.ai

Porter

Perlon AI

OneSix Solutions

Lumio AI

LucidPoint

Lindy

InfoCap

Five9

Caylent

APrime

AI Heroes

AICG

Deepgram & Vercel Next.js Templates

AWS
Abby Connect

Voximplant

Cognigy

Enterprise Bot

Deepgram × IBM: Enterprise Voice AI Inside watsonx CX