Spell-checking based on syllabification and character-level graphs for a peruvian agglutinative language

dc.contributor.affiliationPontificia Universidad Católica del Perú. Facultad de Ciencias e Ingeniería
dc.contributor.affiliationPontificia Universidad Católica del Perú. Departamento de Ingeniería
dc.contributor.authorAlva, C.
dc.contributor.authorOncevay, A.
dc.date.accessioned2026-03-13T16:59:39Z
dc.date.issued2017
dc.description.abstractThere are several native languages in Peru which are mostly agglutinative. These languages are transmitted from generation to generation mainly in oral form, causing different forms of writing across different communities. For this reason, there are recent efforts to standardize the spelling in the written texts, and it would be beneficial to support these tasks with an automatic tool such as a spell-checker. In this way, this spelling corrector is being developed based on two steps: an automatic rule-based syllabification method and a character-level graph to detect the degree of error in a misspelled word. The experiments were realized on Shipibo-konibo, a highly agglutinative and Amazonian language, and the results obtained have been promising in a dataset built for the purpose.
dc.description.sponsorshipFunding: For this study, the authors acknowledge the support of the “Consejo Nacional de Ciencia, Tec-nología e Innovación Tecnológica” (CONCYTEC Perú) under the contract 225-2015-FONDECYT.
dc.identifier.doihttps://doi.org/10.18653/v1/w17-4116
dc.identifier.urihttp://hdl.handle.net/20.500.14657/206391
dc.language.isoeng
dc.publisherAssociation for Computational Linguistics (ACL)
dc.relation.conferencenameEMNLP 2017 - 1st Workshop on Subword and Character Level Models in NLP, SCLeM 2017 - Proceedings of the Workshop (2017)
dc.relation.ispartofurn:isbn:9781945626913
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectAgglutinative language
dc.subjectSpelling
dc.subjectSyllabification
dc.subjectComputer science
dc.subjectSpell
dc.subjectNatural language processing
dc.subjectArtificial intelligence
dc.subjectProofreading
dc.subjectCharacter (mathematics)
dc.subjectGraph
dc.subjectLanguage model
dc.subjectWord (group theory)
dc.subjectSpeech recognition
dc.subjectLinguistics
dc.subjectMathematics
dc.subjectTheoretical computer science
dc.subjectSyllable
dc.subject.ocdehttps://purl.org/pe-repo/ocde/ford#1.02.00
dc.titleSpell-checking based on syllabification and character-level graphs for a peruvian agglutinative language
dc.typehttp://purl.org/coar/resource_type/c_5794
dc.type.otherComunicación de congreso
dc.type.versionhttps://vocabularies.coar-repositories.org/version_types/c_970fb48d4fbd8a85/

Files

Collections