Microsoft’s offer to purchase Nuance Communications for $19.7B validates Automatic Speech Recognition (ASR) has become an essential technology for business. It’s exciting to see an AI powerhouse like Microsoft, which has amazing talent and tools in house, finding extreme value in acquiring speech solutions at this scale. Overall, this acquisition is a great sign for AI companies like Deepgram and the Speech Recognition industry.
Why Buy Nuance? Hello Healthcare & Upsell potential.
Microsoft has provided their own Speech To Text (STT) solution for years, however due to difficulty of using the service, lack of scale, and preference for working well with its own hardware and software solutions (Cortana, Bing and the Teams communication app), it has not gained substantial traction beyond their ecosystem. Nuance on the contrary is the leader in the medical transcription market which was estimated to be 1.32B in 2019 and expected to grow to 4.89B by 2027. With the acquisition of Nuance, Microsoft immediately expands their STT reach into the healthcare community, and also opens up the possibility of expanding their Azure cloud storage revenues, as stored audio files are not small.
What product leaders of voice enabled applications need to know about the Microsoft acquisition of Nuance
While this acquisition will benefit Microsoft, the core architecture of STT does not have a high likelihood of improving. Microsoft and Nuance STT are both based on the same legacy tri-gram model so neither architecture will dramatically change, but perhaps have incremental improvements. There will also be challenges for these two companies to integrate the artifacts for two speech processing pipelines as they have different libraries, acoustic models, language lexicons, style guides, etc.
As a comparison, Deepgram built our speech recognition solution from scratch using a completely different architecture. Deepgram uses an end to end Deep Learning Neural Network, which in simple terms means we perform audio to text transcription in one AI-enabled step and we can continually improve our accuracy. Due to our architectural differences, Deepgram customers do not have to compromise accuracy vs. speed, speed vs. costs or cost vs. scalability.
Impact to the Broader ASR Market
So what does this do to the broader speech market? It elevates the conversation around Speech Recognition to higher levels within the organization. Businesses will be looking to see why ASR is a growth strategy for Microsoft and consider it as an important technology strategy to gather customer insights, improve employee engagement and accelerate their growth.
This acquisition also shows that ASR and voice technology growth is not only for consumer needs (Siri, Alexa, Cortana) but an important aspect of business needs. The recent report from Opus Research 2021 State of Speech Report validates the strategic importance of ASR and voice technology for businesses.
Microsoft’s acquisition of Nuance is just the tip of the iceberg, as speech recognition is more pervasive and extends beyond the patient experience to the customer and employee experience. We always believed that ASR is going to change the world and that every company will need speech recognition to get closer to their customers, find new insights for products and services, and better personalize their customer experiences. The best is yet to come.