Home / Companies / AssemblyAI / Blog / Post Details
Content Deep Dive

What languages does the Voice Agent API support?

Blog post from AssemblyAI

Post Details
Company
Date Published
Author
Kelsey Foster
Word Count
1,252
Company Posts That Month
28
Language
English
Hacker News Points
-
Summary

AssemblyAI's Voice Agent API supports six languages—English, Spanish, French, German, Italian, and Portuguese—and facilitates native code-switching, allowing seamless transitions between languages within a single conversation using the Universal-3 Pro Streaming model. Priced at a flat $4.50 per hour, this API integrates speech-to-text, LLM, and text-to-speech processes through a single WebSocket with approximately one second of latency. A critical feature beyond language support is the inclusion of keyterms prompting, enabling the agent to accurately recognize and transcribe specific product names, customer names, and industry-specific jargon, enhancing the overall user experience. While the Voice Agent API covers these six languages, AssemblyAI's broader platform offers support for over 99 languages through Whisper-Streaming for real-time speech-to-text and Universal-2 for post-call transcription, with plans for future language expansion.

Trends Found in this Post

No tracked trend matches for this post yet.