Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo
Loading...
Date
Journal Title
Journal ISSN
Volume Title
Publisher
European Language Resources Association (ELRA)
DOI
Acceso al texto completo solo para la Comunidad PUCP
Abstract
In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.
Description
Keywords
Computation and Language
