Trained on 100,000+ Voices: Deepgram Unveils Next-Gen Speaker Diarization and Language Detection Models
Blog post from Deepgram
Deepgram has released a new speaker diarization model that offers best-in-class accuracy and processes audio 10 times faster than its nearest competitor. The language-agnostic diarization model is free with all of the company's automatic speech recognition (ASR) models, including Nova and Whisper. Deepgram has also revamped its automatic language detection feature, resulting in a relative error rate improvement of up to 54.7% on high-demand languages such as English, Spanish, Hindi, and German. The company's large-scale multilingual training approach enables it to employ fast and lean networks while still obtaining world-class accuracy. Deepgram's diarization feature outperforms many commercial diarization models and common open-source alternatives like Pyannote when dealing with domain-specific, real-world data.
| Trend | Post Mentions | Total Month Mentions | Posts | Companies | MoM |
|---|---|---|---|---|---|
| Vector Search | 3 | 1,125 | 124 | 52 | +87% |
| Voice AI | 1 | 113 | 26 | 11 | +190% |