Deepgram Changelog

Stay up to date with enhancements to our AI Speech Platform and ecosystem

Api
SpeechModel

Updated Nova-2 Model (English, Pre-recorded) - Improved Word Recognition

Natalie Rutgers
Jan 17, 2024
model = nova-2 Deepgram has done additional training to our Nova-2 General Model for English to improve accuracy for pre-recorded transcription, specifically addressing an issue with missing words / phrases. We intend to roll out similar updates for streaming transcription and across other Nova-2 tier models over the next two weeks. There should be no code changes required on your end - all requests with model=nova-2 will automatically get the latest version with the fix applied. For On-Premises customers, please reach out to your Deepgram contact if you'd like to update your models.
OnPrem

Deepgram On-premises January Release (240104)

Jason Maldonis
Jan 4, 2024
Container Images (release 240104) deepgram/onprem-api:release-240104 Equivalent image tag to deepgram/onprem-api:1.110.0 deepgram/onprem-engine:release-240104 Equivalent image tag to deepgram/onprem-engine:3.59.6 deepgram/onprem-license-proxy:release-240104 Equivalent image tag to deepgram/onprem-license-proxy:1.5.1 deepgram/onprem-billing:release-240104 Equivalent image tag to deepgram/onprem-billing:1.8.0 deepgram/onprem-dgtools:release-240104 Equivalent image tag to deepgram/onprem-dgtools:2.1.5 This Release Contains The Following Changes Improved accuracy for timestamps and punctuation for recognized keywords . Stability improvements, security patches, and bug fixes.šŸ›
Api

Deepgram Releases Improved SDKs

John Vajda
Dec 21, 2023
The Developer Experience Team at Deepgram has been working the last several months on greatly improving our SDKs (Software Development Kits) so developers have an easier time integrating with the Deepgram API. We are happy to announce you can now upgrade to the next major version of our improved SDKs for JavaScript, Go and Python to benefit from many "quality of life" improvements and added functionality. Please refer to the SDK repository and blog post for more details on each major release. Go SDK on GitHub (Now officially supported by Deepgram!) Go SDK blog announcement JavaScript SDK on GitHub JavaScript SDK blog announcement Python SDK on GitHub Python SDK blog announcement .NET SDK on Github This SDK major release is still in development but targeted for Q1 2024. If you upgrade your SDK version let us know how it goes in our Developer Communities on Github Discussions or on Discord ! Happy Coding šŸš€
OnPrem

Deepgram On-premises December Release (231207)

Brent George
Dec 7, 2023
Container Images (release 231207) deepgram/onprem-api:release-231207 Equivalent image tag to deepgram/onprem-api:1.109.0 deepgram/onprem-engine:release-231207 Equivalent image tag to deepgram/onprem-engine:3.59.6 deepgram/onprem-license-proxy:release-231207 Equivalent image tag to deepgram/onprem-license-proxy:1.5.1 deepgram/onprem-billing:release-231207 Equivalent image tag to deepgram/onprem-billing:1.8.0 deepgram/onprem-dgtools:release-231207 Equivalent image tag to deepgram/onprem-dgtools:2.1.5 This Release Contains The Following Changes Improved support for container health checks/probes. There are now native endpoints available for all Deepgram containers to help you more easily manage container lifecycles and system maintenance. These probes can be configured with Kubernetes, or any other container orchestration technology that supports health probes. For help with using these endpoints, please contact your Deepgram Account Representative for updated configuration files. Further documentation will also be published describing available health probe endpoints, and will be linked to this changelog. Stability improvements, security patches, and bug fixes.šŸ›
OnPrem

Deepgram On-premises November Release (231114)

Brent George
Nov 14, 2023
Container Images (release 231114) deepgram/onprem-api:release-231114 Equivalent image tag to deepgram/onprem-api:1.107.2 deepgram/onprem-engine:release-231114 Equivalent image tag to deepgram/onprem-engine:3.59.6 deepgram/onprem-license-proxy:release-231114 Equivalent image tag to deepgram/onprem-license-proxy:1.5.0 deepgram/onprem-billing:release-231114 Equivalent image tag to deepgram/onprem-billing:1.8.0 deepgram/onprem-dgtools:release-231114 Equivalent image tag to deepgram/onprem-dgtools:2.1.5 This Release Contains The Following Changes Support for Nova 2 models in general availability . License Proxy and Billing containers now accept configuration files license-proxy.toml and billing.toml , similar to APIā€™s api.toml and Engineā€™s engine.toml . Compared to CLI flags, configuration files are easier to place into source control for comparing config changes to your onprem deployment over time. Reach out to your Deepgram Account Representative if you have an existing deployment and would like to migrate to using configuration files for License Proxy or Billing containers. Stability improvements, security patches, and bug fixes.šŸ›
Api
Languages

Improvements to Language Detection

Jason Maldonis
Nov 14, 2023
Deepgram now offers the ability to restrict which languages will be detected by Deepgram's language detection algorithm. If you currently use the detect_language=true feature of Deepgram but your audio files are, for example, only in English and Spanish, you can improve the accuracy of your results by restricting which languages will be detected. Here's an example of how to restrict the detectable languages to only English ( en ) and Spanish ( es ): curl \ --request POST \ --header 'Authorization: Token YOUR_DEEPGRAM_API_KEY' \ --header 'Content-Type: audio/wav' \ --data-binary @youraudio.wav \ --url 'https://api.deepgram.com/v1/listen?detect_language=en&detect_language=es' This functionality is available for pre-recorded audio and Deepgram's models (not Whisper). For more information, head over to our developer documentation .
SpeechModel

Announcing Nova-2 GA & Additional Language Support

Natalie Rutgers
Nov 14, 2023
We are pleased to announce additional language support and the General Availability (GA) of Nova-2, our newest speech-to-text model. Deepgram Nova-2 Nova-2 presents the new state-of-the-art in speech recognition. Read about Nova-2's performance and benchmarks in our announcement . The complete list of available use cases and languages can be found in our documentation . Nova-2 is available to all Pay as you Go and Growth customers. Premium customers will have access to Nova-2 at contracted Nova rates. If you are a Premium customer and do not have access, please reach out to your account team. Pricing for Nova-2 is available at deepgram.com/pricing .
OnPrem

Deepgram On-premises October Release (231026)

Brent George
Oct 26, 2023
Container Images (release 231026) deepgram/onprem-api:release-231026 Equivalent image tag to deepgram/onprem-api:1.106.0 deepgram/onprem-engine:release-231026 Equivalent image tag to deepgram/onprem-engine:3.59.2 deepgram/onprem-license-proxy:release-231026 Equivalent image tag to deepgram/onprem-license-proxy:1.4.3 deepgram/onprem-billing:release-231026 Equivalent image tag to deepgram/onprem-billing:1.7.3 deepgram/onprem-dgtools:release-231026 Equivalent image tag to deepgram/onprem-dgtools:2.1.5 This Release Contains The Following Changes More control with language detection features! You can now restrict which languages can be detected , and view the language detection confidence score in returned ASR results. Adjusted order of operations with async requests to more consistently return a request ID immediately. Improved model loading logic. If you have explicit model loading directives under the features section of your engine.toml configuration file, you can remove it when upgrading to more recent releases. The minimum supported CUDA runtime version for onprem-engine has changed from 11.3.1 to 12.1.1. Systems using NVIDIA drivers before version 525.60.13 might encounter errors when attempting to start this release of onprem-engine. Deepgram recommends installing the latest NVIDIA drivers for maximum compatibility, stability, and performance. Stability improvements, security patches, and bug fixes.šŸ›
Api

Deepgram Log Usage Data limited to 90 days

John Vajda
Oct 6, 2023
On Oct 18th 2023 Deepgram will be introducing a 90 day limit on the storage of log usage data for all of our customers. What this means: In the Deepgram Console > Usage > Logs users will now only be able to query for log data for up to 90 days of time. When using the Deepgram API Usage endpoint users will now only be able to query for usage data up to 90 days of time. You can still retrieve Summarized Usage for greater than 90 days. Questions? Please refer to our Documentation for more details on Usage & Logs.
Feature

Improved Redaction Functionality

Shir Goldberg
Sep 28, 2023
Deepgramā€™s latest update offers enhanced redaction options for customers submitting pre-recorded audio to our hosted endpoint. Customers can now select specific types of entities, like locations, URLs, or names, to be redacted from their transcriptions. Hereā€™s an example of how to accomplish redacting different types of entities (in this case, city names, ages, and emails): curl \ --request POST \ --header 'Authorization: Token YOUR_DEEPGRAM_API_KEY' \ --header 'Content-Type: audio/wav' \ --data-binary @youraudio.wav \ --url 'https://api.deepgram.com/v1/listen?redact=location_city&redact=age&redact=email_address' For more information, head over to our developer documentation .
OnPrem

Deepgram On-premises September Release (230920)

Brent George
Sep 20, 2023
Container Images (release 230920) deepgram/onprem-api:release-230920 Equivalent image tag to deepgram/onprem-api:1.102.1 deepgram/onprem-engine:release-230920 Equivalent image tag to deepgram/onprem-engine:3.58.1 deepgram/onprem-license-proxy:release-230920 Equivalent image tag to deepgram/onprem-license-proxy:1.4.2 deepgram/onprem-billing:release-230920 Equivalent image tag to deepgram/onprem-billing:1.7.2 deepgram/onprem-metrics-server:release-230920 Equivalent image tag to deepgram/onprem-metrics-server:2.0.6 deepgram/onprem-dgtools:release-230920 Equivalent image tag to deepgram/onprem-dgtools:2.1.5 This Release Contains The Following Changes Support for Deepgram Nova-2 . Please contact Deepgram Customer Success to request access to this new model architecture. Significant improvements in diarization quality for batch requests. Addresses a memory leak in onprem-engine that originated in a upstream dependency. This memory leak was only present in the August (230804) release. onprem-dgtools now accepts licensing information passed via the DEEPGRAM_API_KEY environment variable, similar to onprem-api and onprem-engine . Other stability improvements & bug fixes.šŸ›
SpeechModel

Introducing Nova-2 Early Access

Natalie Rutgers
Sep 19, 2023
Deepgram is excited to announce early access to our next-gen speech-to-text model, Nova-2. As shared in our Marketing Announcement , Nova-2: Outperforms all alternatives in terms of accuracy, speed, and cost ( starting at $0.0043/min ). Is 18% more accurate than our previous Nova model and offers a 36% relative WER improvement over OpenAI Whisper (large). Pay as You Go and Growth users may access this model immediately in the API Playground or by requesting model=nova-2-ea in their API requests. Enterprise customers can reach out to their account representative or Contact Us for access. Nova-2 Early Access supports hosted and on-prem transcription of pre-recorded and streaming English audio. Read more about Nova-2 in the Deepgram Documentation .
OnPrem

Deepgram On-premises August Release (230804)

Pankaj Trivedi
Aug 4, 2023
Container Images (release 230804) deepgram/onprem-api:1.97.1 deepgram/onprem-engine:3.53.6 deepgram/onprem-license-proxy:1.4.2 deepgram/onprem-billing:1.7.2 deepgram/onprem-metrics-server:2.0.6 deepgram/onprem-dgtools:2.1.4 Deepgram On-premises Release Tags deepgram/onprem-api:release-230804 deepgram/onprem-engine:release-230804 deepgram/onprem-license-proxy:release-230804 deepgram/onprem-billing:release-230804 deepgram/onprem-metrics-server:release-230804 deepgram/onprem-dgtools:release-230804 This Release Contains The Following Changes Summarization efficiency improvements for broader GPU compatibility. Summarization-related errors and warnings produced by API calls have been expanded and made more detailed; please see our docs on this topic . Opus compatibility improvements with multichannel audio. Added a configuration parameter for batch sizes specifically for Whisper models. Please contact your account manager for more details. Added additional error reporting for streaming-related failures when the initial request includes the debug=true query parameter. Stability improvements and bug fixes.šŸ›
Feature

Introducing New Summarization

Pankaj Trivedi
Jul 19, 2023
We're excited to announce the release of our first domain-specific language model (DSLM) for speech summarization of call center interactions. You can request our new Summarization API endpoint by adding a summarize parameter set to v2 in the API call. It will then return a summary object in the response body of the output. The summary object includes status and a concise summary of the entire conversation. The URL query to call the DSLM-powered Summarization API might look like this: https://api.deepgram.com/v1/listen?summarize=v2 Example curl request: curl --location --request POST 'https://api.deepgram.com/v1/listen?summarize=v2' \ --header 'Authorization: Token <Your API KEY>' \ --header 'Content-Type: audio/wave' \ --data-binary '@/Path to file' You can send requests to the API with an Authorization header that references your project's API key Authorization: Token YOUR_DEEPGRAM_API_KEY The output response will contain the generated summary based on the provided audio. Summarization V2 supports English and Pre-Recorded audio. Primary difference between V1 (summarize=true) and V2 (summarize=v2) V1 provides summaries per channel. V2 provides a single summary across all the channels. V1 response contains summary objects (with summary, start, and end word). V2 response contains a single object with result and short key. V2 of our Summarization offers improved performance in terms of quality, content, and readability of generated summaries. For the best results moving forward, we recommend leveraging V2 of our summarization. Learn more about using our new Summarization V2 feature. Test Summarization V2 using our API Playground. We are thrilled to get this feature into your hands and await your feedback. Please share it with us at Product Feedback or your dedicated support channel.
OnPrem

Deepgram On-premises July Release (230705)

Evan Henry
Jul 7, 2023
Container Images (release 230705) deepgram/onprem-api:1.95.0 deepgram/onprem-engine:3.53.0 deepgram/onprem-license-proxy:1.4.1 deepgram/onprem-billing:1.7.1 deepgram/onprem-metrics-server:2.0.6 deepgram/onprem-dgtools:2.1.4 Deepgram On-premises Release Tags deepgram/onprem-api:release-230705 deepgram/onprem-engine:release-230705 deepgram/onprem-license-proxy:release-230705 deepgram/onprem-billing:release-230705 deepgram/onprem-metrics-server:release-230705 deepgram/onprem-dgtools:release-230705 This Release Contains The Following Changes Support for license keys created and managed from Deepgram Console . Support for new Domain-Specific Language Model powered summarization. Learn more . The minimum supported CUDA runtime version for onprem-engine has changed from 11.0.3 to 11.3.1. Systems using NVIDIA drivers before version 450.80.02 might encounter errors when attempting to start this release of onprem-engine. Deepgram recommends installing the latest NVIDIA drivers for maximum compatibility, stability, and performance. The onprem-engine container size has been significantly reduced. Reduction in frequency of hallucinations when using Deepgram enhanced models. Improvements to accuracy of reported word times when using existing Whisper models. Duration values specified in the onprem-api configuration file can now include unit suffixes. For example, instead of writing 480 it is now possible to write 4m. Values with no suffix are assumed to be seconds. Stability improvements and bug fixes.šŸ›
Page 1 of 6

Stop building work-arounds for STT systems that don't work.

Start FreeTalk to an expert