Improvement of Lecture Speech Recognition by Using Unsupervised Adaptation

Improvement of Lecture Speech Recognition by Using Unsupervised Adaptation

Author: 
Kosaka, Tetsuo
Place: 
Hershey, PA
Publisher: 
IGI Global
Date published: 
2011
Record type: 
Responsibility: 
Kusama, Takashi, jt. author
Kato, Masaharu, jt. author
Editor: 
Matsuo, Tokuro
Journal Title: 
E-Activity and Intelligent Web Construction
Source: 
E-Activity and Intelligent Web Construction
Subject: 
Abstract: 

The aim of this work is to improve the recognition performance of spontaneous speech. In order to achieve the purpose, the authors of this chapter propose new approaches of unsupervised adaptation for spontaneous speech and evaluate the methods by using diagonal-covariance and full-covariance hidden Markov models. In the adaptation procedure, both methods of language model (LM) adaptation and acoustic model (AM) adaptation are used iteratively. Several combination methods are tested to find the optimal approach. In the LM adaptation, a word trigram model and a part-of-speech (POS) trigram model are combined to build a more task-specific LM. In addition, the authors propose an unsupervised speaker adaptation technique based on adaptation data weighting. The weighting is performed depending on POS class. In Japan, a large-scale spontaneous speech database “Corpus of Spontaneous Japanese (CSJ)” has been used as the common evaluation database for spontaneous speech and the authors used it for their recognition experiments. From the results, the proposed methods demonstrated a significant advantage in that task.

Series: 
Advances in Web Technologies and Engineering

CITATION: Kosaka, Tetsuo. Improvement of Lecture Speech Recognition by Using Unsupervised Adaptation edited by Matsuo, Tokuro . Hershey, PA : IGI Global , 2011. E-Activity and Intelligent Web Construction - Available at: https://library.au.int/improvement-lecture-speech-recognition-using-unsupervised-adaptation