Deepgram

Meet the most powerful Speech to Text API

Supercharge your voice-enabled products with Deepgram's leading speech-to-text API. Achieve pinpoint precision in real-time, blazingly fast. But that's just the beginning. Scale even further with custom model training for even higher transcription accuracy on business-critical terminology.

Explore how Deepgram can take your voice products to new heights.

Trusted by the world’s top Enterprises, Startups, and Researchers

Superior accuracy. Lightning-fast. Most affordable.

Fast, efficient, low-cost transcription. Pick three. No tradeoffs required.

36languages (and counting)
Support the needs of your global customers with best-in-class transcription in over 30 languages and dialects. Deepgram provides various use-case-specific speech-to-text models to choose from.

Transcribe in real-time

Deepgram provides the fastest real-time transcription speeds (under 300 millisecond latency) to drive human-like conversational AI experiences, real-time analytics, and enablement.

Over 90% accuracy

Deepgram's Nova-2 model leads the industry with the highest transcription accuracy in the market across use case categories.

Up to 40x faster

Quickly transcribe audio in real-time or an hour of pre-recorded audio in about 12 seconds.

Speech models for every use case
Our AI-powered models ensure top accuracy for industries like contact centers, healthcare, and conversational AI. Boost performance with custom models tailored to your business needs.

Flexible deployment
Choose cloud, on-premises, or private cloud to securely manage voice and transcription data with Kubernetes, Docker, and pre-built VM support for easy setup in any environment.

Save thousands on transcription
By using GPUs, not CPUs, Deepgram provides faster, scalable transcription at lower costs. Get a more affordable and efficient solution than big tech or open-source alternatives.

High-performance Voice AI
Power high-demand, real-time voice apps with our fast, accurate speech-to-text, text-to-speech, and conversational AI models—all accessible via simple API integration.

Enterprise-grade security
Protect sensitive data with enterprise-grade security protocols. Deepgram complies with standards like PCI, SOC 2, and HIPAA, safeguarding your customers' privacy.

Efficient, scalable AI solutions
Deepgram’s AI models are optimized for high efficiency, enabling cost savings and support for high concurrent usage. Easily scale your voice AI applications with our robust infrastructure.

Deepgram raises the bar for Speech to Text performance

Elevate your transcription with better features and custom speech-to-text models. All the features you need, with better performance, at the lowest price.
See how Deepgram stacks up.

38%more accurate than Assembly AI
5xfaster than Assembly AI
5xcheaper than Assembly AI
logo
Assembly AI
13.6%
143.2s
$0.0108
VS
Word Error Rate
Speed
Cost
logo
Deepgram
8.4%
29.8s
$0.0043
Word Error Rate (WER) [%] Speed (Median Inference Time [Sec] Per Audio Hour). Lower is better.
38%more accurate than Assembly AI
5xfaster than Assembly AI
3xcheaper than Assembly AI

Try Deepgram Speech to Text

Play around with transcribing sample audio files.

Select an audio file
Call Center Nike Support:Customer Service
00:00
00:00
Medical Pediatric:Healthcare Call
00:00
00:00
TranscriptionModel: Nova-2

[Speaker 0]: Hi. Thank you so much for calling Nike. This is Allison. How can I help you today? timestamp: 1.83-6.33 

[Speaker 1]: Hey. I was supposed to receive a shoe order last Tuesday, and it's now Wednesday, a week later. So, like, what's going on? timestamp: 7.25-22.16 

[Speaker 0]: Oh, okay. I see. Could I have your order number, please? timestamp: 22.16-26.93 

[Speaker 1]: Yeah. It's 905933 679. timestamp: 27.55-28.77
 

[Speaker 0]: Okay. Thank you so much for that. Okay. I apologize. timestamp: 32.46-41.89 

[Speaker 0]: It looks like your order there's been some inclement weather in, like, the Midwest especially in the north. And your package got stuck at a facility in Wyoming because of that. But it looks like it's cleared since, and the package should be moved to New Jersey. It should be at your home address in 4 days. So it it should be arriving on Saturday. timestamp: 42.35-71.70
 

[Speaker 1]: So it'll be arriving almost 2 weeks late instead of a week and a half late. timestamp: 72.8-77.85
 

[Speaker 0]: Yeah. I I'm I'm really sorry about that. I can see here that you know, you you're a long time customer with us, and, you know, you've placed a lot of orders. So if it's okay with you, on your next order that you placed through our website because we messed up this time, I'd like to offer you a 40% discount on your next purchase with us. timestamp: 77.99-101.67
 

[Speaker 1]: Yeah. I mean, that sounds good to me. timestamp: 103.65-106.07
 

[Speaker 0]: Okay. That's good. Okay. Great. Okay. timestamp: 106.21-110.84
 

[Speaker 0]: Your yeah. Your shoe should be there on Saturday, this coming Saturday. Is there anything else I can help you with at this time? timestamp: 111.38-120.76
 

[Speaker 1]: No. Thank you. timestamp: 121.22-122.26
 

[Speaker 0]: Okay. Great. And I added the discount to your account, so it should populate shortly. timestamp: 122.26-126.64
 

[Speaker 1]: Thanks. timestamp: 127.20-127.70
 

[Speaker 0]: Thank you. Bye. timestamp: 127.84-129.06
 

Pricing

Power your apps with world-class speech recognition in 30+ languages.

Includes: Speaker Diarization, Smart formatting, Automatic Language Detection, Deep Search, Keyword Boosting, Multichannel Support, and Callbacks.

For detailed model, language, and feature availability, please refer to our Developer Documentation.

Pre-Recorded
Streaming
Model
Pay As You Go
Growth
Enterprise
Nova-2
$0.0043/min
$0.0036/min
Contact Sales
Nova-1
$0.0043/min
$0.0036/min
Enhanced
$0.0145/min
$0.0115/min
Base
$0.0125/min
$0.0095/min
$0.0048/min
$0.0048/min
$0.0042/min
$0.0035/min
$0.0038/min
$0.0032/min
$0.0033/min
$0.0027/min
$0.0035/min
$0.0028/min
Custom
Redaction
$0.0020/min
$0.0017/min
Entity Detection
$0.0013/min
$0.0011/min

Rates listed above opt in to the Model Improvement Program.

Model
Pay As You Go
Growth
Enterprise
Nova-2
$0.0059/min
$0.0049/min
Contact Sales
Nova-1
$0.0059/min
$0.0049/min
Enhanced
$0.0165/min
$0.0136/min
Base
$0.0145/min
$0.0105/min
Custom
Redaction
$0.0020/min
$0.0017/min
Entity Detection
$0.0013/min
$0.0011/min

Rates listed above opt in to the Model Improvement Program.

Trusted by startups and enterprises

Discover the power of our product through real stories.

Easy to try, easy to use

Use Deepgram with your existing tech or on its own—whatever fits your needs. Fill out the form to discover how our leading voice AI can help scale your products.