Home / Companies / Deepgram / Blog / Post Details
Content Deep Dive

Trained on 100,000+ Voices: Deepgram Unveils Next-Gen Speaker Diarization and Language Detection Models

Blog post from Deepgram

Post Details
Company
Date Published
Author
Josh Fox
Word Count
2,158
Company Posts That Month
20
Language
English
Hacker News Points
-
Summary

Deepgram has released a new speaker diarization model that offers best-in-class accuracy and processes audio 10 times faster than its nearest competitor. The language-agnostic diarization model is free with all of the company's automatic speech recognition (ASR) models, including Nova and Whisper. Deepgram has also revamped its automatic language detection feature, resulting in a relative error rate improvement of up to 54.7% on high-demand languages such as English, Spanish, Hindi, and German. The company's large-scale multilingual training approach enables it to employ fast and lean networks while still obtaining world-class accuracy. Deepgram's diarization feature outperforms many commercial diarization models and common open-source alternatives like Pyannote when dealing with domain-specific, real-world data.

Trends Found in this Post
Trend Post Mentions Total Month Mentions Posts Companies MoM
Vector Search 3 1,125 124 52 +87%
Voice AI 1 113 26 11 +190%