Cantonese Speech to Text

Convert Cantonese speech-to-text with high accuracy, low latency, and enterprise-grade scalability. Deepgram delivers real-time and batch transcription through a developer-first speech-to-text API.

Sign Up Free Contact Sales

OR

Start speaking or upload audio. Select from 50+ languages to change transcription. Your text appears in real time.

Trusted by the world's top Enterprises and Startups

daily

vapi

livekit

cloudfare

daily

vapi

livekit

cloudfare

Fast and accurate Cantonese speech recognition for real-world audio

Get real-time Cantonese speech-to-text in under 300 ms while maintaining high accuracy in noisy, accented, or overlapping conversations.

STT | Switchback 1 | NAME

Cantonese Language Overview

Speakers: 86 million total speakers

Regions: China (Guangdong and Guangxi), Hong Kong, Macau, Malaysia, United States, Vietnam, Singapore, Australia, United Kingdom, Canada

Dialects: Guangzhou Cantonese (standard), Hong Kong Cantonese, Macau Cantonese, Taishanese (Toisanese)

Writing system: Traditional Chinese characters with Cantonese-specific colloquial characters

Language family: Sino-Tibetan, Yue branch of Chinese languages

Cantonese is widely used across Hong Kong, southern China, and global diaspora communities, making it a key language for call center analytics, customer support AI, media captioning, healthcare transcription, multilingual voice agents, financial services, and legal documentation.

STT | Switchback 2 | NAME

Cantonese Speech-to-Text Capabilities

Deepgram includes everything required to produce accurate, readable, and secure Cantonese transcripts out of the box.

Diarization

Automatically detect and label who is speaking in multi-speaker Cantonese conversations.

Learn More →

Smart formatting

Apply automatic capitalization, paragraphing, and clean transcript structure for Cantonese text.

Learn More →

Search

Instantly find words or phrases inside long Cantonese recordings without reprocessing audio.

Learn More →

Utterances

Segment streaming Cantonese audio into real-time sentence-level units for voice agents.

Learn More →

Punctuation

Add accurate punctuation and capitalization to Cantonese transcripts for easy reading.

Learn More →

Redaction

Automatically remove sensitive data like credit cards, phone numbers, and PII from Cantonese transcripts.

Learn More →

Keyword boosting for Cantonese

Improve recognition of uncommon words, product names, and industry terms in Cantonese audio by boosting them in the transcript output.

STT | Switchback 2 | Single Feature | NAME

Scale beyond Cantonese with one API

Start with Cantonese speech-to-text, then expand to 45+ languages using the same API, models, and tooling.

Frequently Asked Questions

What is Cantonese speech-to-text and how does it work?

Does Deepgram support Cantonese speech-to-text?

Which Deepgram models support Cantonese speech-to-text?

Can Deepgram transcribe Cantonese in real time?

Does Deepgram support automatic language detection for Cantonese?

Can Deepgram handle audio with multiple languages or code-switching?

What features are supported for Cantonese transcripts?

How accurate is Deepgram for Cantonese speech-to-text?

How do I get started with Deepgram's Cantonese speech-to-text API?

Ready to build with Cantonese speech to text?

Start transcribing Cantonese audio with Deepgram's speech to text API. It is fast, accurate, and built for real-time applications.

Sign Up Free Contact Sales