Home / Companies / Twilio / Blog / Post Details
Content Deep Dive

Transcribe phone calls in real time with Twilio, Vosk, and ASP.NET Core

Blog post from Twilio

Post Details
Company
Date Published
Author
Mark Heath
Word Count
3,511
Language
English
Hacker News Points
-
Summary

This tutorial demonstrates how to implement live transcription of phone calls using Twilio's Media Streams API, ASP.NET Core, and the Vosk speech recognition engine. The application receives audio from a Twilio phone number through a WebSocket connection, converts the audio format to 16 kHz PCM, and then passes it to the Vosk engine for transcription. The recognized text is printed to the console. The tutorial also covers how to add functionality such as speaker identification using Vosk's model-spk, and how to use ngrok to expose the application to the internet.