Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

Loading...
Thumbnail Image

Date

Journal Title

Journal ISSN

Volume Title

Publisher

European Language Resources Association (ELRA)

DOI

Acceso al texto completo solo para la Comunidad PUCP

Abstract

In this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.

Description

Keywords

Computation and Language

Citation

Collections

Endorsement

Review

Supplemented By

Referenced By