Anomaly Detection based Pronunciation Verification Approach Using Speech Attribute Features

Publication date: Available online 11 June 2019Source: Speech CommunicationAuthor(s): Mostafa Shahin, Beena AhmedAbstractComputer aided pronunciation training tools require accurate automatic pronunciation error detection algorithms to identify errors made by their users. However, the performance of these algorithms is highly dependent on the amount of mispronounced speech data used to train them and the reliability of its manual annotation. To overcome this problem, we turned the mispronunciation detection into an anomaly detection problem, which utilize algorithms trained with only correctly pronounced speech data. In this work we adopted the One-Class SVM as our anomaly detection model, with a specific model built for each phoneme. Each model was fed with a set of speech attribute features, namely the manners and places of articulation, extracted from a bank of binary DNN speech attribute detectors. We also applied multi-task learning and dropout approaches to alleviate the overfitting problem in the DNN speech attribute detectors. We trained the system using the WSJ0 and TIMIT standard data sets which contain only native English speech data and then evaluated it using three different data sets, a native English speaker corpus with artificial errors, a foreign-accented speech corpus and a children's disordered speech corpus. Finally, we compared our system with the conventional Goodness-of-Pronunciation (GOP) algorithm to demonstrate the effectiveness of our method. The re...
Source: Speech Communication - Category: Speech-Language Pathology Source Type: research