08/12/2020

Parsing in the absence of related languages: Evaluating low-resource dependency parsers on Tagalog

Angelina Aquino, Franz de Leon

Keywords:

Abstract: Cross-lingual and multilingual methods have been widely suggested as options for dependency parsing of low-resource languages; however, these typically require the use of annotated data in related high-resource languages. In this paper, we evaluate the performance of these methods versus monolingual parsing of Tagalog, an Austronesian language which shares little typological similarity with any existing high-resource languages. We show that a monolingual model developed on minimal target language data consistently outperforms all cross-lingual and multilingual models when no closely-related sources exist for a low-resource language.

The video of this talk cannot be embedded. You can watch it here:
https://underline.io/lecture/6546-parsing-in-the-absence-of-related-languages-evaluating-low-resource-dependency-parsers-on-tagalog
(Link will open in new window)
 0
 0
 0
 0
This is an embedded video. Talk and the respective paper are published at COLING Workshops 2020 virtual conference. If you are one of the authors of the paper and want to manage your upload, see the question "My papertalk has been externally embedded..." in the FAQ section.

Comments

Post Comment
no comments yet
code of conduct: tbd

Similar Papers