Home / Companies / AssemblyAI / Blog / Post Details
Content Deep Dive

Introducing Multilingual Universal-Streaming: Go global with ultra-fast, ultra-accurate real-time speech-to-text

Blog post from AssemblyAI

Post Details
Company
Date Published
Author
Madison Bernstein
Word Count
1,266
Language
English
Hacker News Points
-
Summary

Universal-Streaming has introduced a multilingual real-time speech-to-text solution that supports six languages—English, Spanish, French, German, Italian, and Portuguese—in a unified model, offering exceptional accuracy for voice agents. This innovation addresses the challenges and additional costs associated with expanding beyond English, such as inaccuracies in multilingual transcription that lead to increased quality assurance expenses. By utilizing a single architecture, Universal-Streaming enables instant processing, natural code-switching, and consistent quality across all languages, with transparent pricing set at $0.15/hr for each language. The solution is designed for real-world applications, providing low Word Error Rates (WER) and minimal latency to ensure optimal user experience. It integrates seamlessly with existing systems and offers production-ready capabilities, such as proper punctuation, capitalization, and intelligent endpointing, all without requiring complex custom processing. Customers can easily test and implement the system through API integration, interactive testing environments, and comprehensive documentation.