Home / Companies / Deepgram / Blog / Post Details
Content Deep Dive

Benchmarking Top Open Source Speech Recognition Models: Whisper, Facebook wav2vec2, and Kaldi

Blog post from Deepgram

Post Details
Company
Date Published
Author
Andrew Seagraves
Word Count
5,472
Language
English
Hacker News Points
2
Summary

In this comparison of open-source ASR models, Kaldi performs poorly across all metrics and domains. Whisper outperforms wav2vec 2.0 in terms of accuracy but is significantly slower. The choice between these two options would depend on the specific needs of the user.