Utilize este identificador para referenciar este registo:
http://hdl.handle.net/10451/37489
Título: | Open Resources and Tools for the Shallow Processing of Portuguese: the TagShare project |
Autor: | Barreto, Florbela Branco, António Ferreira, Eduardo Mendes, Amália Bacelar do Nascimento, Maria Fernanda Nunes, Filipe Silva, João Ricardo |
Data: | 2006 |
Editora: | European Language Resources Association |
Citação: | Barreto, F., Branco, A., Ferreira, E., Mendes, A., Bacelar do Nascimento, M. F., Nunes, F. & Silva, J. R. (2006): "Open Resources and Tools for the Shallow Processing of Portuguese: the TagShare project", in Proceedings of the V International Conference on Language Resources and Evaluation - LREC2006, Genoa, May 22-28, 2006. |
Resumo: | This paper presents the TagShare project and the linguistic resources and tools for the shallow processing of Portuguese developed in its scope. These resources include a 1 million token corpus that has been accurately hand annotated with a variety of linguistic information, as well as several state-ofthe-art shallow processing tools capable of automatically producing that type of annotation. At present, the linguistic annotations in the corpus are sentence and paragraph boundaries, token boundaries, morphosyntactic POScategories, values of inflection features, lemmas and named entities. Hence, the set of tools comprise a sentence chunker, a tokenizer, a POS tagger, nominal and verbal analyzers and lemmatizers, a verbal conjugator, a nominal “inflector”, and a named-entity recognizer, some of which underline several online services. |
URI: | http://hdl.handle.net/10451/37489 |
Versão do Editor: | http://www.lrec-conf.org/proceedings/lrec2006/ |
Aparece nas colecções: | FL - CLUL - Livros de Actas |
Ficheiros deste registo:
Ficheiro | Descrição | Tamanho | Formato | |
---|---|---|---|---|
lrec2006_final.pdf | 835,02 kB | Adobe PDF | Ver/Abrir |
Todos os registos no repositório estão protegidos por leis de copyright, com todos os direitos reservados.