Content Deep Dive
Transcribe phone calls in real time with Twilio, Vosk, and ASP.NET Core
Blog post from Twilio
Post Details
Company
Date Published
Author
Mark Heath
Word Count
3,511
Language
English
Hacker News Points
-
Source URL
Summary
This tutorial demonstrates how to implement live transcription of phone calls using Twilio's Media Streams API, ASP.NET Core, and the Vosk speech recognition engine. The application receives audio from a Twilio phone number through a WebSocket connection, converts the audio format to 16 kHz PCM, and then passes it to the Vosk engine for transcription. The recognized text is printed to the console. The tutorial also covers how to add functionality such as speaker identification using Vosk's model-spk, and how to use ngrok to expose the application to the internet.