Spell-checking based on syllabification and character-level graphs for a peruvian agglutinative language

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

Association for Computational Linguistics (ACL)

Acceso al texto completo solo para la Comunidad PUCP

Abstract

There are several native languages in Peru which are mostly agglutinative. These languages are transmitted from generation to generation mainly in oral form, causing different forms of writing across different communities. For this reason, there are recent efforts to standardize the spelling in the written texts, and it would be beneficial to support these tasks with an automatic tool such as a spell-checker. In this way, this spelling corrector is being developed based on two steps: an automatic rule-based syllabification method and a character-level graph to detect the degree of error in a misspelled word. The experiments were realized on Shipibo-konibo, a highly agglutinative and Amazonian language, and the results obtained have been promising in a dataset built for the purpose.

Description

Keywords

Agglutinative language, Spelling, Syllabification, Computer science, Spell, Natural language processing, Artificial intelligence, Proofreading, Character (mathematics), Graph, Language model, Word (group theory), Speech recognition, Linguistics, Mathematics, Theoretical computer science, Syllable

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By