What languages does the Voice Agent API support?
Blog post from AssemblyAI
AssemblyAI's Voice Agent API supports six languages—English, Spanish, French, German, Italian, and Portuguese—and facilitates native code-switching, allowing seamless transitions between languages within a single conversation using the Universal-3 Pro Streaming model. Priced at a flat $4.50 per hour, this API integrates speech-to-text, LLM, and text-to-speech processes through a single WebSocket with approximately one second of latency. A critical feature beyond language support is the inclusion of keyterms prompting, enabling the agent to accurately recognize and transcribe specific product names, customer names, and industry-specific jargon, enhancing the overall user experience. While the Voice Agent API covers these six languages, AssemblyAI's broader platform offers support for over 99 languages through Whisper-Streaming for real-time speech-to-text and Universal-2 for post-call transcription, with plans for future language expansion.
No tracked trend matches for this post yet.