Amazon Connect transcription: real-time speech-to-text for AWS contact centers
Blog post from Gladia
Amazon Connect transcription services face challenges with high costs and accuracy issues, particularly for multilingual scenarios and long call durations. To address these, routing audio through Kinesis Video Streams or S3 to Solaria-1 is recommended, as it eliminates the risk of AWS Lambda's 15-minute timeout and reduces Word Error Rate (WER) by 29% compared to other solutions. The native transcription issues can lead to corrupted CRM records and compliance risks. Solaria-1 offers a more accurate and cost-effective solution, especially in environments with accented or multilingual speech, by integrating with existing Amazon Connect infrastructure without altering the telephony layer. This setup ensures enhanced real-time transcription and post-call analysis capabilities, providing structured outputs for CRM and QA platforms. The comprehensive solution includes features like sentiment analysis, PII redaction, and code-switching detection, which are critical for high-volume contact centers operating in diverse linguistic environments.
| Trend | Post Mentions | Total Month Mentions | Posts | Companies | MoM |
|---|---|---|---|---|---|
| Real-time | 26 | 5,457 | 1,338 | 238 | -5% |
| Serverless | 25 | 1,011 | 235 | 82 | -44% |
| LLM | 3 | 5,172 | 1,006 | 220 | -43% |
| RAG | 1 | 885 | 228 | 95 | -58% |