Automatic Speech Recognition of Quechua Language Using HMM Toolkit

Zevallos, R.; Cordova, J.; Camacho, L.

doi:https://doi.org/10.1007/978-3-030-46140-9_6

Automatic Speech Recognition of Quechua Language Using HMM Toolkit

dc.contributor.affiliation	Pontificia Universidad Católica del Perú
dc.contributor.author	Zevallos, R.
dc.contributor.author	Cordova, J.
dc.contributor.author	Camacho, L.
dc.date.accessioned	2026-03-13T16:58:26Z
dc.date.issued	2020
dc.description.abstract	In this paper, we present the implementation of an Automatic Speech Recognition system (ASR) for southern Quechua language. The software can recognize both continuous speech and isolated words. The ASR was developed using Hidden Markov Model Toolkit (HTK) and the corpus collected by Siminchikkunarayku. A dictionary provides the system with a mapping of vocabulary words to sequences of phonemes; the audio files were processed to extract the speech feature vectors (MFCC) and then, the acoustic model was trained using the MFCC files until its convergence. The paper also describes a detailed architecture of an ASR system developed using HTK library modules and tools. The ASR was tested using the audios recorded by volunteers obtaining a 12.70% word error rate.
dc.description.sponsorship	Funding: This project was supported by CONCYTEC CIENCIACTIVA of the Peruvión government through grant 164-2015-FONDECYT and by PUCP through grant 2017-3-0039/436.
dc.identifier.doi	https://doi.org/10.1007/978-3-030-46140-9_6
dc.identifier.uri	http://hdl.handle.net/20.500.14657/205921
dc.language.iso	eng
dc.publisher	Springer
dc.relation.conferencename	Communicatións in Computer and Information Science; Vol. 1070 CCIS (2020)
dc.relation.ispartof	urn:isbn:978-3-030-46140-9
dc.rights	info:eu-repo/semantics/closedAccess
dc.subject	Hidden Markov model
dc.subject	Computer science
dc.subject	Speech recognition
dc.subject	Mel-frequency cepstrum
dc.subject	Word error rate
dc.subject	Vocabulary
dc.subject	Artificial intelligence
dc.subject	Natural language processing
dc.subject	Feature (linguistics)
dc.subject	Word (group theory)
dc.subject	Language model
dc.subject	Software
dc.subject	Feature extraction
dc.subject	Linguistics
dc.subject	Programming language
dc.subject.ocde	https://purl.org/pe-repo/ocde/ford#1.02.01
dc.title	Automatic Speech Recognition of Quechua Language Using HMM Toolkit
dc.type	http://purl.org/coar/resource_type/c_5794
dc.type.other	Comunicación de congreso
dc.type.version	https://vocabularies.coar-repositories.org/version_types/c_970fb48d4fbd8a85/

Collections

Artículos (DFI)

Automatic Speech Recognition of Quechua Language Using HMM Toolkit

Files

Collections