/plushcap/analysis/assemblyai/boosting-transcript-readability-with-automatic-punctuation-and-casing-and-itn

Boosting Transcript Readability with Automatic Punctuation and Casing and ITN

What's this blog post about?

This text discusses the process of automatic punctuation and casing in transcription texts using the AssemblyAI Speech-to-Text API. It explains how the model is trained on a large dataset to predict proper punctuation and casing, making it easier for users to read transcriptions. The text also introduces Inverse Text Normalization (ITN), which translates spoken form of text into its written form. The accuracy of the Punctuation and Casing Model is mentioned as being 93.5%, well above industry standards. The text concludes by discussing future model updates and improvements.

Company
AssemblyAI

Date published
Feb. 7, 2022

Author(s)
Kelsey Foster

Word count
1471

Hacker News points
None found.

Language
English


By Matt Makai. 2021-2024.