Acoustic Model Adaptation for Speech Recognition

Koichi SHINODA

doi:10.1587/transinf.E93.D.2348

Abstract

Statistical speech recognition using continuous-density hidden Markov models (CDHMMs) has yielded many practical applications. However, in general, mismatches between the training data and input data significantly degrade recognition accuracy. Various acoustic model adaptation techniques using a few input utterances have been employed to overcome this problem. In this article, we survey these adaptation techniques, including maximum a posteriori (MAP) estimation, maximum likelihood linear regression (MLLR), and eigenvoice. We also present a schematic view called the adaptation pyramid to illustrate how these methods relate to each other.

Content from these authors

Favorites & Alerts

Corresponding author

Register with J-STAGE for free!