Repository logo
 
Loading...
Thumbnail Image
Publication

Lexical semantics annotation for enriched Portuguese corpora

Use this identifier to reference this record.
Name:Description:Size:Format: 
2016NealePereiraSilvaBranco.pdf137.49 KBAdobe PDF Download

Advisor(s)

Abstract(s)

The semantic annotation of corpora has an important role to play in ensuring that sentences occurring in natural language texts are correctly understood based on their intended context. Two examples of lexical semantic units that contribute to this knowledge are word senses – which allow words with multiple meanings to be understood based on the context in which they are used – and named entities – which can be disambiguated and linked back to the specific encyclopedic resources that describe them. In this paper, we describe the construction of lexical semanticallyannotated corpora for Portuguese, annotated with both word senses linked to senses in a Portuguese wordnet and named entities linked to Portuguese Wikipedia entries using DBpedia. The result is a goldstandard lexical semantically-annotated resource that is useful in supporting the training and evaluation of tools for the disambiguation of these lexical units in Portuguese.

Description

Keywords

Annotated corpora Lexical semantics Word senses Named entities Portuguese

Pedagogical Context

Citation

Neale, S., Rita Valadas Pereira, João Silva, & António Branco. "Lexical semantics annotation for enriched Portuguese corpora". In Lecture Notes in Artificial Intelligence, 9727, Berlim: Springer, pp. 296-305.

Organizational Units

Journal Issue

Publisher

Springer

CC License

Altmetrics