Company
Date Published
Author
-
Word count
855
Language
English
Hacker News points
None

Summary

Gladia has introduced partial transcripts to its real-time API for Voice AI agents, a feature designed to enhance response times by streaming word-by-word transcripts rather than waiting for complete outputs. This innovation allows agents to grasp user intent more quickly and respond with less delay, thereby improving the fluidity and naturalness of conversations. Partials are emitted exceptionally fast, particularly the initial words of an utterance, facilitating ultra-low latency in the Speech-to-Text (STT) phase and allowing developers to leverage Large Language Models (LLMs) for formulating responses. Although partials may be less accurate than final transcripts, LLMs can interpret them effectively to maintain the quality of interactions. To optimize performance, Gladia recommends specifying the target language to avoid potential issues in language detection. The new feature is available to all Gladia users but requires activation through the API's configuration settings.