/plushcap/analysis/assemblyai/kaldi-speech-recognition-for-beginners-a-simple-tutorial

Kaldi Speech Recognition for Beginners - A Simple Tutorial

What's this blog post about?

In this tutorial, we learn how to use the open-source speech recognition toolkit Kaldi in conjunction with Python to automatically transcribe audio files. The process involves several steps including installing Kaldi and its dependencies, creating necessary input files for Kaldi, modifying MFCC configuration file, feature extraction, pre-trained model download and extraction, decoding graph construction, transcription retrieval, and rescoring with LSTM-based model. The tutorial also provides information on how to use the AssemblyAI Speech-to-Text API for easy transcription if Kaldi seems too complex or time-consuming.

Company
AssemblyAI

Date published
Jan. 20, 2022

Author(s)
Ryan O'Connor

Word count
4046

Hacker News points
6

Language
English


By Matt Makai. 2021-2024.