This project demonstrates the use of WebSockets to establish a real-time 2-way audio connection between a Twilio phone call and a user's device. It uses ASP.NET Core, Google Cloud Speech-to-Text API, and Twilio Media Streams. The application plays animal sounds in response to the user's voice commands, using a combination of natural language processing (NLP) and machine learning (ML) techniques.