

ASR Reinvented
The fastest, most accurate, most reliable, most scalable speech transcription, with understanding — rebuilt just for enterprise.






Say goodbye to flatlining accuracy.
We’ve reinvented ASR with 100% deep learning that allows companies to continuously improve accuracy. Stop waiting for the big tech players to improve their software and forcing your developers to manually boost accuracy with keywords in every API call. Start training your speech model and reaping the benefits in weeks, not months or years.




Designed for Developers
The Easiest and Most Reliable Speech-to-Text API Ever
Use one API to label, train and deploy AI speech models and transcriptions at scale. No need to splice your audio file into shorter durations, decide between synchronous or async, real-time or batch. Deepgram is one API that easily integrates accurate transcripts into your application or existing data processing pipelines.
curl \
-X POST \
-u USERNAME:PASSWORD \
-H "Content-Type: application/json" \
-d '{"url":
"https://www.deepgram.com/examples/interview_speech-analytics.wav"}' \
https://brain.deepgram.com/v2/listen?punctuate=true
import base64
import json
import urllib.request
url = 'https://brain.deepgram.com/v2/listen'
username = 'USERNAME'
password = 'PASSWORD'
headers = {}
headers['Authorization'] = 'Basic {}'.format(
base64.b64encode('{}:{}'.format(username,
password).encode('utf-8')).decode('utf-8')
)
headers['Content-Type'] = 'application/json'
data = { 'url': 'https://www.deepgram.com/examples/interview_speech-analytics.wav' }
req = urllib.request.Request(
url,
method='POST',
headers=headers,
data=json.dumps(data).encode('utf-8')
)
resp = urllib.request.urlopen(req)
const axios = require('axios'); let url = 'https://brain.deepgram.com/v2/listen'; let username = 'USERNAME'; let password = 'PASSWORD'; let audio = 'https://www.deepgram.com/examples/interview_speech-analytics.wav'; axios({ method: 'post', url: url, auth:{ username username, password: password }, headers:{ 'Content-Type': 'application/json' }, data:{ url: audio } }) .then(function(response) { console.log(response.data); }) .catch(function(error) { console.log('Error happened!: ' + error); });

Why deepgram?
100% Deep Learning
Like you, we weren’t satisfied with the way automatic transcription worked — or more accurately, didn’t work — so we rebuilt it. We scrapped the traditional methods and opted for complete deep learning models that deliver speech-to-text on a level that the big tech players can’t touch.
Unlike other ASRs, Deepgram isn’t stagnant. It auto-learns from your data. Take it for a spin. Train a model to recognize the data you care about. It’ll continuously improve, delivering even greater accuracy and understanding over time.
-
Maximum Accuracy
Over 90% Accuracy. The only AI speech platform that trains on your unique audio.
-
Accelerated Time to Value
120x Faster. Transcribe hour long recordings in seconds. Get superior speech models in weeks, not months or years.
-
Continuous Learning
11 Patents for deep neural networking, which allow models to increase accuracy at unprecedented speed and cost.
-
Resilient Operations
State-of-the-art infrastructure supports 100s of models across our customer base. Process billions of words per day with 99.9% availability.
-
Future-Proof Foundation
Flexible and built for change. Train models and deploy anywhere – on premises or in the cloud.

Create a future with voice
The Foundation for a AI-Enabled Enterprise
Across industries, Deepgram makes converting speech to text simpler than ever before. Our GPU enabled, patented deep learning approach to speech recognition allows us to process billions of hours every day. Unlock your audio and start transforming your business.
