Text-to-Speech API for Voice Agents

Aura-2 delivers sub-200ms streaming text-to-speech built for voice agents with domain-specific accuracy and secure, scalable deployment across cloud and on-prem environments.

Try it Now Sign Up Free

70 / 1,000

Aura-2 Text-to-Speech features

Aura-2 is engineered for the demands of real-time voice agents: low latency, cost-effective scale across thousands of concurrent sessions, and the reliability production workloads require.

Domain-tuned pronunciation

Ensures accurate pronunciation for industry-specific terminology in healthcare, finance, legal, and beyond.

Learn More

Authentic, Natural Voices

Features 40+ English voices with localized accents, delivering natural, business-appropriate speech for professional settings.

Learn More

Context-aware delivery

Adjusts pacing, tone, and expression to ensure smooth, coherent communication in any context.

Learn More

Real-time performance

Delivers sub-200ms latency for ultra-responsive interactions, while efficiently handling thousands of concurrent requests.

Learn More

Cost-effectiveness at scale

Achieves enterprise-grade speech at $0.030 per 1,000 characters—no hidden fees, with volume discounts for large deployments.

Learn More

Flexible deployment options

Supports public, private cloud, and on-premises deployments, ensuring compliance and security.

Learn More

Enterprise-ready AI voices

Voice agents don't need cinematic range. They need clarity, consistency, and low listener fatigue across thousands of turns. Aura-2's 40+ voices are tuned for professional conversations in support, sales, healthcare, and finance, with consistent pacing and enunciation that builds trust on every call.

Explore Aura-2 Voices

Scalable infrastructure for Text-to-Speech

Powered by the Deepgram Enterprise Runtime, Aura-2 delivers real-time text-to-speech using the same infrastructure that powers our trusted speech-to-text and speech-to-speech capabilities, providing builders with the control, adaptability, and performance needed to deploy and scale production-grade voice AI.

Learn More

Speech-to-Text leadership enhances Text-to-Speech

When STT and TTS run on the same streaming infrastructure, the entire speech loop gets faster. Fewer handoffs, lower latency, and consistent pronunciation across what the agent hears and what it says. Deepgram's unified architecture means improvements in speech recognition directly sharpen text-to-speech accuracy.

Test Speech to Text Now

Loading video...

Deepgram Text-to-Speech resources

Explore real-world applications, insights, and industry trends to see how Aura-2 is powering voice agents across industries.

News

Silicon Angle

Deepgram’s Aura-2 is a high-performance text-to-speech engine built for business interactions

News

AIM: Deepgram's New Text-to-Speech AI Model Outperforms ElevenLabs and Open AI

Featured Image for Introducing Aura-2: The World’s Most Professional, Cost-Effective, and Enterprise-Grade Text-to-Speech Model

Blog

Announcements

Introducing Aura-2: The World’s Most Professional, Cost-Effective, and Enterprise-Grade Text-to-Speech Model

Jose Nicholas Francisco

Trusted by voice agent builders for Text-to-Speech

Start building with Aura-2 today

Real-time, streaming-first text-to-speech that's ready for production voice agents. From first prototype to thousands of concurrent calls.

Sign Up Free View Pricing

Text-to-Speech API for Voice Agents

Aura-2 Text-to-Speech features

Domain-tuned pronunciation

Authentic, Natural Voices

Context-aware delivery

Real-time performance

Cost-effectiveness at scale

Flexible deployment options

Enterprise-ready AI voices

Scalable infrastructure for Text-to-Speech

Speech-to-Text leadership enhances Text-to-Speech

Deepgram Text-to-Speech resources

Deepgram’s Aura-2 is a high-performance text-to-speech engine built for business interactions

AIM: Deepgram's New Text-to-Speech AI Model Outperforms ElevenLabs and Open AI

Introducing Aura-2: The World’s Most Professional, Cost-Effective, and Enterprise-Grade Text-to-Speech Model

AI Minds #056 | Jordan Dearsley, Founder & CEO at Vapi

Deepgram Accelerates Into 2025, Empowering 200,000+ Developers From Startups to Global Enterprises to Build Voice AI

AI Minds #054 | Brent Pretty, CEO and Founder at Retellio

Now Available: Deepgram Aura’s Websocket Interface for Faster Text to Speech Input Streaming

Trusted by voice agent builders for Text-to-Speech

Start building with Aura-2 today