Home / Companies / Deepgram / Blog / Post Details
Content Deep Dive

Introducing Batch Diarization V2

Blog post from Deepgram

Post Details
Company
Date Published
Author
Nick Kaimakis
Word Count
924
Language
English
Hacker News Points
-
Summary

Batch Diarization V2 is a significant upgrade to speaker attribution technology, enhancing the accuracy of speaker labeling in pre-recorded audio across various domains such as contact centers, healthcare, and voice AI applications. The new model introduces expanded training data, an improved speaker embedding model, and better segmentation and clustering, leading to more precise speaker attribution and reduced labeling errors. Evaluations showed that Diarization V2 consistently outperformed its predecessor, Diarization V1, with human evaluators preferring the new version 3.3 times more often due to improved performance, particularly in challenging audio scenarios. The update is available via the new diarize_model parameter, allowing users to select between model versions without any changes in pricing or existing integrations, and is compatible with Deepgram's batch Speech-to-Text offerings.