Repository logo
 
Loading...
Thumbnail Image
Publication

CINTIL DependencyBank PREMIUM. A corpus of grammatical dependencies for Portuguese

Use this identifier to reference this record.
Name:Description:Size:Format: 
2016CarvalhoQueridoCamposEtAl.pdf336.59 KBAdobe PDF Download

Advisor(s)

Abstract(s)

This paper presents a new linguistic resource for the study and computational processing of Portuguese. CINTIL DependencyBank PREMIUM is a corpus of Portuguese news text, accurately manually annotated with a wide range of linguistic information (morpho-syntax, named-entities, syntactic function and semantic roles), making it an invaluable resource specially for the development and evaluation of data-driven natural language processing tools. The corpus is under active development, reaching 4,000 sentences in its current version. The paper also reports on the training and evaluation of a dependency parser over this corpus. CINTIL DependencyBank PREMIUM is freely-available for research purposes through META-SHARE.

Description

Keywords

Dependency bank Corpora Dependency parsing

Pedagogical Context

Citation

de Carvalho, R., Andreia Querido, Rita Valadas Pereira, Marisa Campos, João Silva, & António Branco. “CINTIL DependencyBank PREMIUM. A corpus of grammatical dependencies for Portuguese”. In Proceedings of the 10th Language Resources and Evaluation Conference (LREC 2016), Portoroz, Eslovénia, 23-28 de maio de 2016.

Organizational Units

Journal Issue

Publisher

European Language Resources Association

CC License