Deepgram vs ElevenLabs
ElevenLabs is built for batch use cases - Deepgram is built for real-time use cases.
Deepgram streaming STT is #1. ElevenLabs does not offer streaming STT.
Deepgram Aura-2 is #1 for enterprise use cases, and 40% less expensive than ElevenLabs
Deepgram offers an enterprise-grade Voice Agent API, powered by Deepgram Enterprise Runtime.
Deepgram is available as on-premises or cloud APIs

User Preference for Enterprise Use Cases (Blinded Human Evals)
Speech to Text
Power your products with world-class speech recognition. Everything developers need to build with confidence and ship faster. Unmatched performance guaranteed:
• The most accurate speech to text model in the world
• Batch and streaming STT
• Available via on-premises and cloud APIs

Text To Speech
In blinded evals, users prefer Deepgram to ElevenLabs for enterprise use cases 62% to 38%
• Deepgram Aura-2 is built for real-time use cases
• Deepgram Aura-2 is priced 40% lower than ElevenLabs Flash
• Deepgram Aura-2 is the leading choice for enterprise use cases

Conversational Voice AI Agents
Deepgram Voice Agent API is the industry's only offering that delivers the single API experience developers love combined with the full controllability enterprises need. No need to stitch together STT, TTS and LLM orchestration. No black box limitations. Performs at the latency of human speech. Priced at $4.50 per hour.

Trusted by enterprises and AI leaders for STT, TTS, and Voice Agent API
Deepgram showed me less than 200ms latency today. That's the fastest text-to-speech I’ve ever seen. And, our customers would be more than satisfied with the conversation quality.

Jordan Dearsley
Co-founder, Vapi

Aura-2’s remarkable clarity and naturalness significantly enhance our conversational AI solutions, making customer interactions smoother and more engaging.
Thys Waanders
SVP of AI Transformation, Cognigy

Aura-2 sets a new bar for enterprise-grade TTS. The clarity, consistency, and low latency it delivers have been game-changers for our AI agent experiences.
Bernardo Aceituno
Co-Founder, Stack AI
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

Deepgram showed me less than 200ms latency today. That's the fastest text-to-speech I’ve ever seen. And, our customers would be more than satisfied with the conversation quality.

Jordan Dearsley
Co-founder, Vapi

Aura-2’s remarkable clarity and naturalness significantly enhance our conversational AI solutions, making customer interactions smoother and more engaging.
Thys Waanders
SVP of AI Transformation, Cognigy

Aura-2 sets a new bar for enterprise-grade TTS. The clarity, consistency, and low latency it delivers have been game-changers for our AI agent experiences.
Bernardo Aceituno
Co-Founder, Stack AI
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

Deepgram showed me less than 200ms latency today. That's the fastest text-to-speech I’ve ever seen. And, our customers would be more than satisfied with the conversation quality.

Jordan Dearsley
Co-founder, Vapi

Aura-2’s remarkable clarity and naturalness significantly enhance our conversational AI solutions, making customer interactions smoother and more engaging.
Thys Waanders
SVP of AI Transformation, Cognigy

Aura-2 sets a new bar for enterprise-grade TTS. The clarity, consistency, and low latency it delivers have been game-changers for our AI agent experiences.
Bernardo Aceituno
Co-Founder, Stack AI
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

Deepgram showed me less than 200ms latency today. That's the fastest text-to-speech I’ve ever seen. And, our customers would be more than satisfied with the conversation quality.

Jordan Dearsley
Co-founder, Vapi

Aura-2’s remarkable clarity and naturalness significantly enhance our conversational AI solutions, making customer interactions smoother and more engaging.
Thys Waanders
SVP of AI Transformation, Cognigy

Aura-2 sets a new bar for enterprise-grade TTS. The clarity, consistency, and low latency it delivers have been game-changers for our AI agent experiences.
Bernardo Aceituno
Co-Founder, Stack AI
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

We’ve decided to use Aura for our text-to-speech because latency is crucial to a natural conversational flow. We’ve tried other vendors and Aura is the fastest of the high voice quality options.

Bruce Sharpe
Chief Product Officer, Humach

Deepgram and Groq share the belief that speed and efficiency are the missing ingredients in unlocking natural AI for daily use by everyone.
Aura's very fast time-to-first-byte and natural voice quality make it a perfect fit for conversational AI agents.

Kwindla Hultman Kramer
CEO, Daily

Aura’s voices are very realistic and extremely fast, striking the right balance between quality and latency that’s needed to put in production.

Leandro Torres
Co-founder, Voxity

Aura is becoming the preferred choice for our voice AI customers at Retell AI. With its impressive 200ms latency, affordability, and human-like voices, it has set a new standard within our offerings.
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

We’ve decided to use Aura for our text-to-speech because latency is crucial to a natural conversational flow. We’ve tried other vendors and Aura is the fastest of the high voice quality options.

Bruce Sharpe
Chief Product Officer, Humach

Deepgram and Groq share the belief that speed and efficiency are the missing ingredients in unlocking natural AI for daily use by everyone.
Aura's very fast time-to-first-byte and natural voice quality make it a perfect fit for conversational AI agents.

Kwindla Hultman Kramer
CEO, Daily

Aura’s voices are very realistic and extremely fast, striking the right balance between quality and latency that’s needed to put in production.

Leandro Torres
Co-founder, Voxity

Aura is becoming the preferred choice for our voice AI customers at Retell AI. With its impressive 200ms latency, affordability, and human-like voices, it has set a new standard within our offerings.
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

We’ve decided to use Aura for our text-to-speech because latency is crucial to a natural conversational flow. We’ve tried other vendors and Aura is the fastest of the high voice quality options.

Bruce Sharpe
Chief Product Officer, Humach

Deepgram and Groq share the belief that speed and efficiency are the missing ingredients in unlocking natural AI for daily use by everyone.
Aura's very fast time-to-first-byte and natural voice quality make it a perfect fit for conversational AI agents.

Kwindla Hultman Kramer
CEO, Daily

Aura’s voices are very realistic and extremely fast, striking the right balance between quality and latency that’s needed to put in production.

Leandro Torres
Co-founder, Voxity

Aura is becoming the preferred choice for our voice AI customers at Retell AI. With its impressive 200ms latency, affordability, and human-like voices, it has set a new standard within our offerings.
We’ve been really impressed with the quality and responsiveness of Aura-2. Deepgram’s approach to text-to-speech shows real promise for creating natural, scalable voice experiences.
David Lawson
Co-Founder & CEO, Call Simulator, Inc.
In our initial tests with Deepgrams text-to-speech, we've consistently seen sub 200ms latency, which is 2-4x faster than what we've been getting with play.ht and virtually all other text-to-speech providers.

Will Bodewes
CEO, Phonely

We’ve decided to use Aura for our text-to-speech because latency is crucial to a natural conversational flow. We’ve tried other vendors and Aura is the fastest of the high voice quality options.

Bruce Sharpe
Chief Product Officer, Humach

Deepgram and Groq share the belief that speed and efficiency are the missing ingredients in unlocking natural AI for daily use by everyone.
Aura's very fast time-to-first-byte and natural voice quality make it a perfect fit for conversational AI agents.

Kwindla Hultman Kramer
CEO, Daily

Aura’s voices are very realistic and extremely fast, striking the right balance between quality and latency that’s needed to put in production.

Leandro Torres
Co-founder, Voxity

Aura is becoming the preferred choice for our voice AI customers at Retell AI. With its impressive 200ms latency, affordability, and human-like voices, it has set a new standard within our offerings.