Automatic Speech Recognition of Quechua Language Using HMM Toolkit

Zevallos, R.; Cordova, J.; Camacho, L.

doi:https://doi.org/10.1007/978-3-030-46140-9_6

Automatic Speech Recognition of Quechua Language Using HMM Toolkit

Date

2020

Authors

Zevallos, R.

Cordova, J.

Camacho, L.

Publisher

Springer

URI

http://hdl.handle.net/20.500.14657/205921

DOI

https://doi.org/10.1007/978-3-030-46140-9_6

Abstract

In this paper, we present the implementation of an Automatic Speech Recognition system (ASR) for southern Quechua language. The software can recognize both continuous speech and isolated words. The ASR was developed using Hidden Markov Model Toolkit (HTK) and the corpus collected by Siminchikkunarayku. A dictionary provides the system with a mapping of vocabulary words to sequences of phonemes; the audio files were processed to extract the speech feature vectors (MFCC) and then, the acoustic model was trained using the MFCC files until its convergence. The paper also describes a detailed architecture of an ASR system developed using HTK library modules and tools. The ASR was tested using the audios recorded by volunteers obtaining a 12.70% word error rate.

Keywords

Hidden Markov model, Computer science, Speech recognition, Mel-frequency cepstrum, Word error rate, Vocabulary, Artificial intelligence, Natural language processing, Feature (linguistics), Word (group theory), Language model, Software, Feature extraction, Linguistics, Programming language

Collections

Artículos (DFI)

Full item page

Automatic Speech Recognition of Quechua Language Using HMM Toolkit

Date

Authors

Journal Title

Journal ISSN

Volume Title

Publisher

URI

DOI

Acceso al texto completo solo para la Comunidad PUCP

Abstract

Description

Keywords

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By