Together AI Launches Speech-to-Text: High-Performance Whisper APIs
Blog post from Together AI
Together AI has launched its new speech-to-text APIs, addressing the speed and quality challenges faced by voice application developers. Their Whisper V3 Large deployment offers transcription 15 times faster than OpenAI while maintaining accuracy, thanks to optimizations like smart voice activity detection, intelligent chunking, and improved GPU utilization. These advancements enable real-time applications in sectors like customer support, meetings, and healthcare, by eliminating the traditional bottlenecks associated with audio processing. The APIs support files over 1GB, provide superior word-level alignment, and handle over 50 languages, offering substantial cost savings for high-volume applications. The service is designed for easy integration, with compatibility for existing Whisper users and an interactive playground for real-time testing. This release marks a significant step towards building a comprehensive voice infrastructure, making voice-enabled applications faster and more accessible.