Convert Catalan speech to text with high accuracy, low latency, and enterprise-grade scalability. Deepgram delivers real-time and batch transcription through a developer-first speech-to-text API.
Trusted by the world's top Enterprises and Startups
Get real-time Catalan speech-to-text in under 300 ms while maintaining high accuracy in noisy, accented, or overlapping conversations.

Speakers: 9 million total speakers (4 million native speakers, 5 million second-language speakers)
Regions: Spain (Catalonia, Valencian Community, Balearic Islands, Aragon), Andorra, France (Roussillon), Italy (Alghero, Sardinia)
Dialects: Central Catalan, Northwestern Catalan, Valencian, Balearic, Northern Catalan (Roussillonese), Algherese
Writing system: Latin alphabet with diacritical marks (accents and diaeresis)
Language family: Romance (Indo-European)
Catalan is widely used across government administration, education, media, and business in Catalonia, the Valencian Community, the Balearic Islands, and Andorra, making it a key language for public sector transcription, educational platforms, contact center analytics, media captioning, and multilingual voice agents in Spanish and European markets.

Deepgram includes everything required to produce accurate, readable, and secure Catalan transcripts out of the box.
Automatically detect and label who is speaking in multi-speaker Catalan conversations.
Apply automatic capitalization, paragraphing, and clean transcript structure for Catalan text.
Instantly find words or phrases inside long Catalan recordings without reprocessing audio.
Segment streaming Catalan audio into real-time sentence-level units for voice agents.
Add accurate punctuation and capitalization to Catalan transcripts for easy reading.
Automatically remove sensitive data like credit cards, phone numbers, and PII from Catalan transcripts.

Keyterm prompting for Catalan
Boost recognition of brand names, product terms, and domain-specific vocabulary in Catalan audio to improve keyword recall and transcript accuracy.

Automatic language detection
Identify when audio is spoken in Catalan and transcribe it without pre-selecting a language. For mixed-language datasets, sources, and batch transcription pipelines.
Start with Catalan speech-to-text, then expand to 45+ languages using the same API, models, and tooling.
Start transcribing Catalan audio with Deepgram's speech to text API. It is fast, accurate, and built for real-time applications.