Real-time voice agents only work if they can listen and respond as quickly as a human.
That comes down to three things working together: fast infrastructure, reliable models, and a simple way to run them in one place. Today, we’re making that last part easier. Deepgram speech-to-text is now natively available on Together AI, so you can run your STT, LLM, and TTS on a single platform while keeping the speech layer powered by Deepgram.
"Speed and accuracy are non-negotiable for production voice agents. Voice capabilities powered by Deepgram give Together AI developers a reliable speech layer that keeps up with real-time conversation, all within our co-located infrastructure."
- Arielle Fidel, VP Strategic Partnerships, Together AI
What’s New
If you’re already building on Together AI, you don’t need to rethink your stack to get Deepgram. You can now pick Deepgram as the STT engine inside your existing Together AI voice pipelines and keep everything—audio, tokens, and logs—on one platform.
With this integration, you can:
- Choose Deepgram as your STT engine inside Together AI’s voice pipelines.
- Keep STT, LLM, and TTS co-located instead of hopping across multiple vendors.
- Use one API and one bill, while still getting Deepgram quality on every transcript.
- Maintain access to the full transcript and response text for logging, QA, and routing.
No extra glue code. No new vendor to wire in. Just Deepgram inside your existing Together AI setup.
Why This Matters for Voice Agents
Teams building voice agents tend to run into the same problems as they move from demo to production: latency creeps up, accuracy drops in real-world environments, and operations get complicated fast.
This integration is designed to reduce that friction.
Faster turn-taking. With Deepgram hosted directly on Together AI, audio doesn’t have to leave the environment just to be transcribed. That keeps end-to-end latency low enough that users can interrupt, clarify, and keep talking without awkward gaps.
Better understanding of real calls. Deepgram is tuned for real customer audio—contact centers, financial calls, healthcare workflows, sales conversations—not just clean lab recordings. That means fewer misheard entities, fewer “sorry, can you repeat that?”, and smoother handoffs to humans when needed.
A simpler stack to run. You get Together AI’s unified control plane plus Deepgram at the speech layer. Instead of juggling multiple dashboards and support paths, you can focus on how your agent behaves, not how the plumbing is wired together.
If you already rely on Together AI, this is the most direct way to upgrade your STT without rebuilding your architecture.
How Deepgram Fits into Your Voice Stack
Deepgram is the voice layer that sits underneath your agent experience. Our platform covers the full surface area of voice:
- Speech-to-Text (Nova & Flux) for real-time and batch transcription, tuned for accuracy and low latency.
- Text-to-Speech (Aura) for natural voices that are built for production, not just demos.
- Voice Agent API that combines STT, orchestration, and TTS into a single real-time API for teams who want Deepgram to run the full conversational pipeline.
With Deepgram now hosted on Together AI, you have options for how deep you go:
You can stay on Together AI for model hosting and orchestration, and simply choose Deepgram as your embedded STT engine there. This keeps your LLMs, TTS, and infrastructure where they are today, while upgrading the speech recognition layer.
Or you can pair this with Deepgram’s own Voice Agent API, dedicated environments, or self-hosted deployments when you need more control over compliance, routing, or multi-region architecture. In both cases, Deepgram becomes the speech backbone for voice agents that have to work in production, not just in slideware.
See It in Action and Start Building
You don’t have to imagine how this feels, you can call it.
Call the live demo. Dial (847) 851-4323 to talk to a real-time voice agent running on Together AI’s co-located pipeline. Interrupt it mid-sentence, change topics, and notice how quickly it recovers.
Use the Together AI docs. Follow their voice quickstart and explore the Together AI voice platform to configure a pipeline, then plug in Deepgram STT as your transcription layer.
Explore Deepgram. Visit deepgram.com to learn more about our Speech-to-Text, Text-to-Speech, and Voice Agent APIs, and to grab an API key for your own apps.
We’re excited to see what you build when Together AI’s infrastructure and Deepgram’s voice AI are working side by side: one platform, real-time performance, and speech that just works.

