/plushcap/analysis/deepgram/python-speech-recognition-locally-torchaudio

Python Speech Recognition Locally with TorchAudio

What's this blog post about?

This article discusses how to perform speech recognition locally using Python and the TorchAudio library. It covers building a Python audio data streaming function, creating a context cache for storing audio data, setting up PyTorch TorchAudio, and using an Emformer model for local speech recognition in Python. The author provides detailed explanations of each step and includes full code examples.

Company
Deepgram

Date published
July 14, 2022

Author(s)
Yujian Tang

Word count
2308

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.