Utilize este identificador para referenciar este registo: http://hdl.handle.net/10451/37489
Título: Open Resources and Tools for the Shallow Processing of Portuguese: the TagShare project
Autor: Barreto, Florbela
Branco, António
Ferreira, Eduardo
Mendes, Amália
Bacelar do Nascimento, Maria Fernanda
Nunes, Filipe
Silva, João Ricardo
Data: 2006
Editora: European Language Resources Association
Citação: Barreto, F., Branco, A., Ferreira, E., Mendes, A., Bacelar do Nascimento, M. F., Nunes, F. & Silva, J. R. (2006): "Open Resources and Tools for the Shallow Processing of Portuguese: the TagShare project", in Proceedings of the V International Conference on Language Resources and Evaluation - LREC2006, Genoa, May 22-28, 2006.
Resumo: This paper presents the TagShare project and the linguistic resources and tools for the shallow processing of Portuguese developed in its scope. These resources include a 1 million token corpus that has been accurately hand annotated with a variety of linguistic information, as well as several state-of­the-­art shallow processing tools capable of automatically producing that type of annotation. At present, the linguistic annotations in the corpus are sentence and paragraph boundaries, token boundaries, morphosyntactic POScategories, values of inflection features, lemmas and named­ entities. Hence, the set of tools comprise a sentence chunker, a tokenizer, a POS tagger, nominal and verbal analyzers and lemmatizers, a verbal conjugator, a nominal “inflector”, and a named­-entity recognizer, some of which underline several on­line services.
URI: http://hdl.handle.net/10451/37489
Versão do Editor: http://www.lrec-conf.org/proceedings/lrec2006/
Aparece nas colecções:FL - CLUL - Livros de Actas

Ficheiros deste registo:
Ficheiro Descrição TamanhoFormato 
lrec2006_final.pdf835,02 kBAdobe PDFVer/Abrir


FacebookTwitterDeliciousLinkedInDiggGoogle BookmarksMySpace
Formato BibTex MendeleyEndnote 

Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.