Automatic Speech Recognition of Quechua Language Using HMM Toolkit

dc.contributor.affiliationPontificia Universidad Católica del Perú
dc.contributor.authorZevallos, R.
dc.contributor.authorCordova, J.
dc.contributor.authorCamacho, L.
dc.date.accessioned2026-03-13T16:58:26Z
dc.date.issued2020
dc.description.abstractIn this paper, we present the implementation of an Automatic Speech Recognition system (ASR) for southern Quechua language. The software can recognize both continuous speech and isolated words. The ASR was developed using Hidden Markov Model Toolkit (HTK) and the corpus collected by Siminchikkunarayku. A dictionary provides the system with a mapping of vocabulary words to sequences of phonemes; the audio files were processed to extract the speech feature vectors (MFCC) and then, the acoustic model was trained using the MFCC files until its convergence. The paper also describes a detailed architecture of an ASR system developed using HTK library modules and tools. The ASR was tested using the audios recorded by volunteers obtaining a 12.70% word error rate.
dc.description.sponsorshipFunding: This project was supported by CONCYTEC CIENCIACTIVA of the Peruvión government through grant 164-2015-FONDECYT and by PUCP through grant 2017-3-0039/436.
dc.identifier.doihttps://doi.org/10.1007/978-3-030-46140-9_6
dc.identifier.urihttp://hdl.handle.net/20.500.14657/205921
dc.language.isoeng
dc.publisherSpringer
dc.relation.conferencenameCommunicatións in Computer and Information Science; Vol. 1070 CCIS (2020)
dc.relation.ispartofurn:isbn:978-3-030-46140-9
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectHidden Markov model
dc.subjectComputer science
dc.subjectSpeech recognition
dc.subjectMel-frequency cepstrum
dc.subjectWord error rate
dc.subjectVocabulary
dc.subjectArtificial intelligence
dc.subjectNatural language processing
dc.subjectFeature (linguistics)
dc.subjectWord (group theory)
dc.subjectLanguage model
dc.subjectSoftware
dc.subjectFeature extraction
dc.subjectLinguistics
dc.subjectProgramming language
dc.subject.ocdehttps://purl.org/pe-repo/ocde/ford#1.02.01
dc.titleAutomatic Speech Recognition of Quechua Language Using HMM Toolkit
dc.typehttp://purl.org/coar/resource_type/c_5794
dc.type.otherComunicación de congreso
dc.type.versionhttps://vocabularies.coar-repositories.org/version_types/c_970fb48d4fbd8a85/

Files

Collections