Non-native children's automatic speech recognition: The INTERSPEECH 2020 shared task ALTA systems

Knill, KM; Wang, L; Wang, Y; Wu, X; Gales, MJF

Non-native children's automatic speech recognition: The INTERSPEECH 2020 shared task ALTA systems

Accepted version

Peer-reviewed

Repository URI

https://www.repository.cam.ac.uk/handle/1810/308723

Repository DOI

https://doi.org/10.17863/CAM.55812

Files

Accepted version (119.92 KB)

Type

Conference Object

Authors

Knill, KM

Wang, L

Wang, Y

Wu, X

Gales, MJF

Abstract

Automatic spoken language assessment (SLA) is a challenging problem due to the large variations in learner speech combined with limited resources. These issues are even more problematic when considering children learning a language, with higher levels of acoustic and lexical variability, and of code-switching compared to adult data. This paper describes the ALTA system for the INTERSPEECH 2020 Shared Task on Automatic Speech Recognition for Non-Native Children’s Speech. The data for this task consists of examination recordings of Italian school children aged 9-16, ranging in ability from minimal, to basic, to limited but effective command of spoken English. A variety of systems were developed using the limited training data available, 49 hours. State-of-the-art acoustic models and language models were evaluated, including a diversity of lexical representations, handling code-switching and learner pronunciation errors, and grade specific models. The best single system achieved a word error rate (WER) of 16.9% on the evaluation data. By combining multiple diverse systems, including both grade independent and grade specific models, the error rate was reduced to 15.7%. This combined system was the best performing submission for both the closed and open tasks.

Keywords

speech recognition, children's speech, language learning

Journal Title

Proceedings of the Annual Conference of the International Speech Communication Association, INTERSPEECH

Conference Name

Interspeech 2020

Journal ISSN

2308-457X
1990-9772

Volume Title

2020-October

Publisher

ISCA

Publisher DOI

https://doi.org/10.21437/Interspeech.2020-2154

Rights

Sponsorship

Cambridge Assessment (Unknown)

Collections

University of Cambridge Research Outputs