Technology Partner

LiveKit

Deepgram and LiveKit enable developers to build production-ready voice AI agents with real-time speech recognition and natural text-to-speech, all within LiveKit's open-source framework for voice, video, and physical AI agents.

LiveKit's Agents framework ships native Deepgram plugins for both STT and TTS. Developers run Nova-3 for high-accuracy streaming transcription across 30+ languages, or Flux for conversational speech recognition with built-in end-of-turn detection and interruption handling. Flux is purpose-built for voice agent workflows where handling barge-in cleanly is the difference between a demo and a production deployment. Model selection is a single configuration line in Python or TypeScript.

The integration covers the core voice agent use cases: customer support automation, multilingual voice assistants, meeting bots, and inbound/outbound telephony via LiveKit's SIP integration. Deepgram models are also available through LiveKit Inference, with billing and model access managed directly through LiveKit's platform.

LiveKit runs on a global cloud network built for production workloads, with over 300,000 developers and billions of calls processed annually. With voice-native infrastructure and Deepgram's sub-300ms latency speech models, development teams get the fastest path from prototype to production for agents that hold real conversations at scale.

Build a Voice Agent with LiveKit and Deepgram: https://developers.deepgram.com/docs/build-voice-agent-with-livekit-and-deepgram

Deepgram and LiveKit
Technology

Communications / CPaaS

Speech to Text

Text to Speech

STT Nova

TTS Flux


Looking to use Deepgram + LiveKit?

Talk to an Expert