On the Benefits of Confidence Visualization in Speech Recognition

Home > Publications > On the Benefits of Confidence Visualization in Speech Recognition

On the Benefits of Confidence Visualization in Speech Recognition

Keith Vertanen, Per Ola Kristensson

CHI '08: Proceedings of the ACM Conference on Human Factors in Computing Systems, 2008.

In a typical speech dictation interface, the recognizer's best-guess is displayed as normal, unannotated text. This ignores potentially useful information about the recognizer's confidence in its recognition hypothesis. Using a confidence measure (which itself may sometimes be inaccurate), we investigated providing visual feedback about low-confidence portions of the recognition using shaded, red underlining. An evaluation showed, compared to a baseline without underlining, underlining low-confidence areas did not increase user's speed or accuracy in detecting errors. However, we found that when recognition errors were correctly underlined, they were discovered significantly more often than baseline. Conversely, when errors failed to be underlined, they were discovered less often. Our results indicate confidence visualization can be effective - but only if the confidence measure has high accuracy. Further, since our results show that users tend to trust confidence visualization, designers should be careful in its application if a high accuracy confidence measure is not available.

Paper:
Reference:
Video:

There are more details about this research in chapter 2 of my thesis.