Deepgram Pricing

Name: Deepgram Voice AI Platform Pricing
Brand: Deepgram
Price: 0.0065 USD
Availability: InStock

Straightforward costs for projects at any scale.

Find the right plan with our clear, transparent, and flexible pricing. Whether you're just exploring or deploying at enterprise scale, our plans are built for innovation, not contract negotiation.

Pay As You Go

No minimums. No expiration.
No credit card required.

GrowthSave up to 20%

With pre-paid credits for the year.
Credits are redeemed against actual usage.

Overview

Price

Free $200 Credit

then pay-as-you-go

$4K+

/ year

Best for

Developers & Startups

Growing Applications

Access

All endpoint in public models

All endpoints in public models

Usage & Scale (Concurrency)

Speech-to-Text

Up to 50 for the REST API

Up to 150 for the WSS API

Up to 225 for the WSS API

Up to 5 for Deepgram Whisper Cloud

Text-to-Speech

Up to 45 for the REST API + WSS API

Up to 60 for the REST API + WSS API

Voice Agent API

Up to 45 for the WSS API

Up to 60 for the WSS API

Audio Intelligence

Up to 10 for the REST API

Support

Support Channels

Community & Discord

SLAs

Standard Uptime

Get Started

Enterprise

For businesses with large volumes, data or deployment requirements, or support needs.

Contact Sales

Speech to Text

Power your applications with world-class speech recognition APIs. Nova models support 45+ languages with advanced capabilities including Speaker Diarization (multi-speaker detection), Smart Formatting for readability, Keyterm Prompting, and Automatic Language Detection.

For detailed model, language, and feature availability, please refer to our Developer Documentation.

Limited-time promotional rates on streaming.

Model	Pay As You Go	Growth
Flux English i Conversational speech recognition for real-time voice agents with built-in turn detection, natural interruption handling, and ultra-low latency.	$0.0065/min $0.0077/min	$0.0057/min $0.0065/min
Flux Multilingual i Conversational speech recognition for real-time voice agents that handle multiple languages within a single conversation, with built-in turn detection, natural interruption handling, and ultra-low latency.	$0.0078/min	$0.0068/min
Nova-3 Monolingual i Our highest performing model. Recommended for most use cases, especially audio with multiple languages, background noise, crosstalk and far field audio.	$0.0048/min $0.0077/min	$0.0042/min $0.0065/min
Nova-3 Multilingual i Our highest-accuracy multilingual model with automatic language detection. Recommended for audio with multiple languages, background noise, crosstalk, and far-field input.	$0.0058/min $0.0092/min	$0.0050/min $0.0078/min
Custom i Custom speech-to-text models trained on proprietary or novel datasets for maximum accuracy in edge-case scenarios.	Contact Sales	Contact Sales

Model	Pay As You Go
Flux English i Conversational speech recognition for real-time voice agents with built-in turn detection, natural interruption handling, and ultra-low latency.	$0.0065/min $0.0077/min
Flux Multilingual i Conversational speech recognition for real-time voice agents that handle multiple languages within a single conversation, with built-in turn detection, natural interruption handling, and ultra-low latency.	$0.0078/min
Nova-3 Monolingual i Our highest performing model. Recommended for most use cases, especially audio with multiple languages, background noise, crosstalk and far field audio.	$0.0048/min $0.0077/min
Nova-3 Multilingual i Our highest-accuracy multilingual model with automatic language detection. Recommended for audio with multiple languages, background noise, crosstalk, and far-field input.	$0.0058/min $0.0092/min
Custom i Custom speech-to-text models trained on proprietary or novel datasets for maximum accuracy in edge-case scenarios.	Contact Sales

Speech-to-Text Add-ons

Enhance your transcripts with powerful AI understanding features.

Feature	Description	Pay As You Go	Growth
Redaction	Automatically identify and remove sensitive PII such as social security numbers, credit cards, and phone numbers.	$0.0020/min	$0.0017/min
Keyterm Prompting	Boost accuracy for specific domain-specific jargon, product names, or acronyms important to your use case.	$0.0013/min	$0.0012/min
Smart Formatting	Automatically format punctuation, casing, dates, and currency for readability.	Included	Included
Speaker Diarization	Detect multiple speakers and label who spoke when in the transcript.	$0.0020/min	$0.0017/min

Feature	Description
Redaction	Automatically identify and remove sensitive PII such as social security numbers, credit cards, and phone numbers.
Keyterm Prompting	Boost accuracy for specific domain-specific jargon, product names, or acronyms important to your use case.
Smart Formatting	Automatically format punctuation, casing, dates, and currency for readability.
Speaker Diarization	Detect multiple speakers and label who spoke when in the transcript.

Text to Speech

Generate natural, low-latency speech for your voice assistants and conversational AI applications.

Model	Pay As You Go	Growth
Aura-2	$0.030/1k characters	$0.027/1k characters
Aura-1	$0.0150/1k characters	$0.0135/1k characters

Model	Pay As You Go
Aura-2	$0.030/1k characters
Aura-1	$0.0150/1k characters

Voice Agent API

Our Voice Agent API enables real-time conversational AI agents that seamlessly handle interruptions, take complex actions, and deliver natural, responsive customer interactions without delays or rigid turn-taking.

Tier	Pay As You Go	Growth
Standard	$0.075/min i calculated based on websocket connection time.	$0.068/min i calculated based on websocket connection time.
Standard - BYO TTS	$0.065/min i calculated based on websocket connection time.	$0.051/min i calculated based on websocket connection time.
Custom - BYO LLM	$0.065/min i calculated based on websocket connection time.	$0.059/min i calculated based on websocket connection time.
Custom - BYO LLM + TTS	$0.050/min i calculated based on websocket connection time.	$0.041/min i calculated based on websocket connection time.
Advanced	$0.163/min i calculated based on websocket connection time.	$0.146/min i calculated based on websocket connection time.
Advanced - BYO TTS	$0.122/min i calculated based on websocket connection time.	$0.110/min i calculated based on websocket connection time.

Custom - BYO LLM	$0.065/min i calculated based on websocket connection time.
Tier	Pay As You Go
Standard	$0.075/min i calculated based on websocket connection time.
Standard - BYO TTS	$0.065/min i calculated based on websocket connection time.
Custom - BYO LLM + TTS	$0.050/min i calculated based on websocket connection time.
Advanced	$0.163/min i calculated based on websocket connection time.
Advanced - BYO TTS	$0.122/min i calculated based on websocket connection time.

For detailed LLM tier information, please refer to our Developer Documentation.

Audio Intelligence

Analyze audio for insights. Extract actionable insights from conversational audio and text at scale.

Topic Detection
Model	Pay As You Go	Growth
Summarization	$0.0003/1k input tokens - $0.0006/1k output tokens	$0.00024/1k input tokens - $0.00048/1k output tokens
Sentiment Analysis
Intent Recognition

Topic Detection
Model	Pay As You Go
Summarization	$0.0003/1k input tokens - $0.0006/1k output tokens
Sentiment Analysis
Intent Recognition

Rates listed above opt in to the Model Improvement Program.

Security & Compliance

Deepgram is built for enterprise-grade trust, meeting the standards for data protection and privacy.

SOC 2 Type 1 & Type 2 Certified

Independently audited security controls and procedures to ensure your data is protected in the cloud.

HIPAA Compliant

We sign Business Associate Agreements (BAA) for Enterprise customers handling electronic Protected Health Information (ePHI).

GDPR Ready & EU Data Residency

Fully compliant with GDPR. We offer a dedicated EU endpoint to ensure data processing stays within the European Union (api.eu.deepgram.com).

CCPA Compliant

Adheres to the California Consumer Privacy Act (CCPA) to secure privacy rights and control over personal information.

PCI Compliant

Maintains PCI compliance with yearly reviews for secure payment processing.

Frequently Asked Questions

How much does Deepgram Speech-to-Text cost per hour?

Are older models (Nova-2, Enhanced, Base) still available?

Does Deepgram charge for silence or round up audio time?

What is included in the $200 free credit?

How do you calculate costs for multichannel audio?

What is the difference between Pay-As-You-Go and Growth plans?

Are there extra fees for real-time streaming vs. pre-recorded audio?

How does pricing work for the Voice Agent API?

Do Audio Intelligence features cost extra?

Is Text-to-Speech (Aura) billed by the minute or character?

Can I deploy Deepgram on-premise or in a private cloud?

What happens if I exceed my concurrency limit?

Is Deepgram HIPAA and SOC 2 compliant?

Do you offer volume discounts for high-usage applications?

Managing your Deepgram Pay-As-You-Go credits: auto-load settings, refunds, and transfers

At Deepgram, we offer a flexible pay-as-you-go service where customers can purchase credits for usage on our platform. Our goal is to provide transparency and flexibility, while ensuring that you have full control over your account and payments.

Auto-Load Functionality

Managing Your Subscription

Refunds for Unused Credits

Refund Request Process

Disclaimer for Incorrect Audio Files

Credit Transfer Policy