Building an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo

dc.contributor.affiliationPontificia Universidad Católica del Perú
dc.contributor.authorZariquiey, R.
dc.contributor.authorAlvarado, C.
dc.contributor.authorEchevarria, X.
dc.contributor.authorSaltachin, L.
dc.contributor.authorGonzales, R.
dc.contributor.authorIllescas, M.
dc.contributor.authorOporto, S.
dc.contributor.authorBlum, F.
dc.contributor.authorOncevay, A.
dc.contributor.authorVera, J.
dc.date.accessioned2026-03-13T17:01:05Z
dc.date.issued2022
dc.description.abstractIn this paper, we launch a new Universal Dependencies treebank for an endangered language from Amazonia: Kakataibo, a Panoan language spoken in Peru. We first discuss the collaborative methodology implemented, which proved effective to create a treebank in the context of a Computational Linguistic course for undergraduates. Then, we describe the general details of the treebank and the language-specific considerations implemented for the proposed annotation. We finally conduct some experiments on part-of-speech tagging and syntactic dependency parsing. We focus on monolingual and transfer learning settings, where we study the impact of a Shipibo-Konibo treebank, another Panoan language resource.
dc.description.sponsorshipFunding: The first author acknowledges the support of CONCYTEC-ProCiencia, Peru, under the contract 183-2018-FONDECYT-BM-IADT-MU from the funding call E041-2018-01-BM.
dc.identifier.urihttp://hdl.handle.net/20.500.14657/206828
dc.language.isoeng
dc.publisherEuropean Language Resources Association (ELRA)
dc.relation.conferencename2022 Language Resources and Evaluation Conference, LREC 2022
dc.relation.urihttps://aclanthology.org/2022.lrec-1.409/
dc.rightsinfo:eu-repo/semantics/closedAccess
dc.subjectComputation and Language
dc.subject.ocdehttps://purl.org/pe-repo/ocde/ford#1.02.01
dc.titleBuilding an Endangered Language Resource in the Classroom: Universal Dependencies for Kakataibo
dc.typehttp://purl.org/coar/resource_type/c_5794
dc.type.otherComunicación de congreso
dc.type.versionhttps://vocabularies.coar-repositories.org/version_types/c_970fb48d4fbd8a85/

Files

Collections