Convert Persian speech-to-text with high accuracy, low latency, and enterprise-grade scalability. Deepgram delivers real-time and batch transcription through a developer-first speech-to-text API.
Trusted by the world's top Enterprises and Startups
Get real-time Persian speech-to-text in under 300 ms while maintaining high accuracy in noisy, accented, or overlapping conversations.

Speakers: 70-80 million total speakers
Regions: Iran (official language), Afghanistan (as Dari), Tajikistan (as Tajik), with significant diaspora communities in the United States, Canada, Uzbekistan, Pakistan, and Iraq
Dialects: Iranian Persian (Western Persian), Dari Persian (Eastern Persian), Tajik
Writing system: Modified Arabic script (Perso-Arabic) written right-to-left; Cyrillic script in Tajikistan
Language family: Indo-European family, Iranian branch, Western Iranian languages
Persian is widely used across Iran, Afghanistan, Tajikistan, and global diaspora communities, making it a key language for call center analytics, customer support AI serving Middle Eastern markets, multilingual healthcare services, media transcription, legal and immigration services, and voice-enabled education platforms.

Automatically detect and label who is speaking in multi-speaker Persian conversations.
Apply automatic capitalization, paragraphing, and clean transcript structure for Persian text.
Instantly find words or phrases inside long Persian recordings without reprocessing audio.
Segment streaming Persian audio into real-time sentence-level units for voice agents.
Add accurate punctuation and capitalization to Persian transcripts for easy reading.
Automatically remove sensitive data like credit cards, phone numbers, and PII from Persian transcripts.
Boost recognition of brand names, product terms, and domain-specific vocabulary in Persian audio to improve keyword recall and transcript accuracy.

Start with Persian speech-to-text, then expand to 45+ languages using the same API, models, and tooling.
Start transcribing Persian audio with Deepgram's speech to text API. It is fast, accurate, and built for real-time applications.