The Open-Source VAD Upgrade That Benefits Billions of Live Stream Viewers

Company

Agora

Date Published

Sept. 23, 2025

Author

TEN Team

Word count

805

Language

English

Hacker News points

None

URL

www.agora.io/en/blog/the-open-source-vad-upgrade-that-benefits-billions-of-live-stream-viewers

Summary

The text discusses LiveCap, a software developed by Hakase Shojo to address challenges faced by streamers in providing real-time subtitles, particularly for global audiences. Initially, LiveCap relied on silence detection to generate subtitles, which caused delays and inaccuracies. To improve efficiency, the software transitioned to using Voice Activity Detection (VAD), specifically the TEN VAD model, which significantly enhanced synchronization between speech and subtitles by reducing lag and false detections. TEN VAD offered reliable speech detection, ultra-low latency, and resource efficiency, making it suitable for real-time applications beyond streaming, such as AI customer service and tutoring. The integration of TEN VAD into LiveCap resulted in more natural transcripts and reduced user frustration, demonstrating the importance of selecting the right technical tools for product performance.