What's new in Universal-3 Pro: smarter code-switching, faster turnaround, and better timestamps
Blog post from AssemblyAI
Universal-3 Pro has undergone significant enhancements, making it the most accurate model in the market for speech-to-text tasks, particularly in code-switching, disfluencies, turnaround time, diarization, and timestamps. These updates include a ~19% relative improvement in code-switching benchmarks and ~5.9% improvement in capturing disfluencies, crucial for verbatim transcription workloads. The model now offers the fastest turnaround time in AssemblyAI’s lineup, with up to 34% improvement in latency and more accurate speaker diarization, handling up to 30 speakers for longer audio files. Timestamp precision has also seen substantial gains, especially for non-English content, enhancing its utility for tasks requiring word-level timing accuracy. These improvements are automatically available for current Universal-3 Pro users, making it a superior choice over its predecessor, Universal-2, across various performance metrics.