Use Case Types Available:
=General, =Conversational AI, =Earnings, =Meeting, =Phone Call, =Videos
FREE* = Free to transcribe for a limited time on our hosted environment.
LANGUAGES
Every customer. Heard and understood.
When it comes to your customer base, a generic one-size-fits-all speech recognition model isn’t going to cut it. Deepgram provides the most accurate speech models in the speech-to-text industry, regardless of the language your customers speak.
Dozens of Languages. Multiple Form Factors.
With our team of linguists, our AI researchers have an unfair advantage. We understand the nuances of every language we build, which in turn gives you higher accuracy. Since our language models are created exclusively with End-to-End Deep Learning, we can perform transfer learning from one language to another, and quickly support new languages and dialects to better meet your use case. Don’t see your language listed below? Contact us, as new languages and dialects are released frequently.
More Voices. Transcribed More Accurately.
Dutch FREE*
Did you know that Dutch and Flemish, despite the different names, are varieties of the same language, just spoken in different countries? Our model works for both, regardless of where the speaker is from. Deepgram’s Dutch speech recognition model is suitable for general use cases.
- Dutch—nl
English
Over 1.348 billion people speak English as a first or second language across the globe in hundreds of different dialects. That’s why we have different speech models for the most widely spoken varieties of English. Deepgram’s English speech recognition models are suitable for the following dialects and use cases:
- English—en
- English—en-AU
- English—en-US
- English—en-GB
- English—en-NZ
- English—en-IN
French
French has a large number of homophones—words with the same pronunciation but different spellings and meanings—which is a challenge that our models can handle. Deepgram’s French speech recognition models are suitable for general audio use cases and the following dialects:
- French—fr FREE*
- French Canadian—fr-CA
German FREE*
German and English are close cousins, but even with 60% of their vocabulary sharing a common origin, one model doesn’t work for both. Deepgram German speech recognition model is suitable for general use cases.
- German—de
Hindi
Hindi has some 37 consonants, compared to only 24 in English. But that’s not a problem for Deepgram’s Hindi speech recognition models, which are suitable for general use cases. We also have a speech model that transliterates Hindi to Latin characters.
- Hindi—hi
- Hindi— hi-Latn
Indonesian FREE*
Indonesian is the 9th most spoken language in the world and has approximately 160 million speakers. Deepgram’s Indonesian speech recognition models are suitable for general use cases.
- Indonesian—id
Italian FREE*
Italian is spoken in a number of dialects up and down the peninsula. Our model was designed with this in mind, and works well with a range of standard Italian varieties. Deepgram’s Italian speech recognition model is suitable for general use cases.
- Italian—it
Japanese FREE*
Japanese uses a combination of three different writing systems (four if you count romaji, or words written in Latin characters). We match pronunciation to the correct transcription so that it can be easily read by Japanese speakers. Deepgram’s Japanese speech recognition model is suitable for general use cases.
- Japanese—ja
Korean FREE*
Korean’s own script, Hangul, is made of characters said to represent the shape of your mouth when you pronounce them. And rather than being written sound by sound, the sounds are stacked into blocks that represent syllables. Deepgram’s Korean speech recognition model is suitable for general use cases.
- Korean—ko
Mandarin FREE*
Mandarin starting developing over 1000 years ago as a language and is spoken by 14% of the world’s population. Deepgram’s Mandarin speech recognition model is suitable for general use cases.
- Mandarin, Simplified script—
zh (cmn-Hans-CN) - Mandarin, Traditional script—
zh-TW (cmn-Hant-TW)
Portuguese
European and Brazilian Portuguese are considerably different from each other than, say, US and British English, which is why we have three distinct speech models (Mixed, Portugal and Brazil). Deepgram’s Portuguese speech recognition model is suitable for general use cases and dialects:
- Portuguese—pt
- Portuguese—pt-PT
- Portuguese—pt-BR
Russian FREE*
Russian is one of the languages of space, so astronauts need to learn Russian as part of their training for the International Space Station. Luckily for Deepgram’s partnership with NASA, we can transcribe both English and Russian. Deepgram’s Russian speech recognition model is suitable for general use cases.
- Russian—ru
Spanish
Spanish is an official language in 21 countries and has half a billion speakers worldwide, which means dialects matter. We created our Spanish language model to recognize a variety of regional accents and dialects, making a great fit for the diverse variety of Spanish spoken throughout the Americas. Deepgram’s Spanish speech recognition models are suitable for general use cases.
- Spanish—es
- Spanish Latin America—es-419
Swedish FREE*
Written Swedish is very close to Danish and Norwegian. The spoken languages are more distinct from each other, though, which is why we have a model specifically for Swedish. Deepgram’s Swedish speech recognition models are suitable for general use cases.
- Swedish—sv
Turkish
In Turkish, one word can be equivalent to a whole sentence in another language. An example is gitmeyecekmişçesine ‘it’s as if (s)he won’t leave’. Luckily, our ASR model knows about all Turkish morphology to correctly transcribe roots and suffixes. Deepgram’s Turkish speech recognition model is suitable for general use cases.
- Turkish—tr
Ukrainian FREE*
Our newest Ukrainian speech model is free to use, no fee, no limits. Deepgram’s Ukrainian speech recognition model is suitable for general use cases.
- Ukrainian—uk