| Name: | Description: | Size: | Format: | |
|---|---|---|---|---|
| 336.59 KB | Adobe PDF |
Advisor(s)
Abstract(s)
This paper presents a new linguistic resource for the study and computational processing of Portuguese. CINTIL DependencyBank PREMIUM is a corpus of Portuguese news text, accurately manually annotated with a wide range of linguistic information (morpho-syntax, named-entities, syntactic function and semantic roles), making it an invaluable resource specially for the development and evaluation of data-driven natural language processing tools. The corpus is under active development, reaching 4,000 sentences in its current version. The paper also reports on the training and evaluation of a dependency parser over this corpus. CINTIL DependencyBank PREMIUM is freely-available for research purposes through META-SHARE.
Description
Keywords
Dependency bank Corpora Dependency parsing
Pedagogical Context
Citation
de Carvalho, R., Andreia Querido, Rita Valadas Pereira, Marisa Campos, João Silva, & António Branco. “CINTIL DependencyBank PREMIUM. A corpus of grammatical dependencies for Portuguese”. In Proceedings of the 10th Language Resources and Evaluation Conference (LREC 2016), Portoroz, Eslovénia, 23-28 de maio de 2016.
