Article·AI Engineering & Research·Sep 22, 2025

Working with Timestamps, Utterances, and Speaker Diarization in Deepgram

Speech-to-text (STT) is the baseline; the value is in the metadata. Deepgram returns timestamped transcripts with utterances and speaker diarization so you can answer who said what, when—reliably. Follow this in-depth tutorial to make the most of your STT models!

15 min read
Featured Image for Working with Timestamps, Utterances, and Speaker Diarization in Deepgram

By Stephen Oladele

Updated