How we measure medical transcription: MER, and why WER lies to you
Blog post from AssemblyAI
The text discusses the inadequacy of Word Error Rate (WER) as a metric for evaluating medical transcription accuracy, highlighting its failure to differentiate between inconsequential errors, like missing filler words, and critical mistakes, such as incorrect drug names. It introduces the Missed Entity Rate (MER) as a superior alternative, focusing on the accurate transcription of clinically meaningful words—such as drug names, dosages, and diagnoses—essential for patient safety. The text provides a comparison of various transcription models, showcasing that a low WER does not necessarily indicate accurate medical transcription, while emphasizing the importance of measuring MER on one's own audio to ensure accurate results. The narrative argues for prioritizing MER over WER in medical transcription to reduce clinical risk, as well as the need for human review despite lower MER scores, to ensure comprehensive quality assurance.