Repository logo
 
Publication

COMBINA-PT: a Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions

dc.contributor.authorMendes, Amália
dc.contributor.authorAntunes, Sandra
dc.contributor.authorBacelar do Nascimento, Maria Fernanda
dc.contributor.authorCasteleiro, João Miguel
dc.contributor.authorPereira, Luísa
dc.contributor.authorSá, Tiago
dc.date.accessioned2019-03-13T15:05:58Z
dc.date.available2019-03-13T15:05:58Z
dc.date.issued2006
dc.description.abstractThis paper presents the COMBINA-PT project, a study of corpus-extracted Portuguese Multiword (MW) expressions. The objective of this on-going project is to compile a large lexical database of multiword (MW) units of the Portuguese language, automatically extracted from a balanced 50 million word corpus, interpreted with lexical association measures and manually validated. MW expressions considered in the database include named entities and lexical associations with different degrees of cohesion, ranging from frozen groups, which undergo little or no variation, to lexical collocations composed of words that tend to occur together and that constitute syntactic dependencies, although with a low degree of fixedness. This new resource has a two-fold objective: (i) to be an important research tool which supports the development of MW expressions typologies and their lexicographic treatment; (ii) to be of major help in developing and evaluating language processing tools able of dealing with MW expressionspt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationMendes, A., Antunes, S., Bacelar do Nascimento, M. F., Casteleiro, J. M., Pereira, L. & Sá, T. (2006): "COMBINA-PT: a Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressions", in Proceedings of the V International Conference on Language Resources and Evaluation - LREC2006, Genoa, May 22-28, 2006.pt_PT
dc.identifier.urihttp://hdl.handle.net/10451/37490
dc.language.isoengpt_PT
dc.publisherEuropean Language Resources Associationpt_PT
dc.relationWord combinations in portuguese language COMBINA-PT
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2006/pt_PT
dc.titleCOMBINA-PT: a Large Corpus-extracted and Hand-checked Lexical Database of Portuguese Multiword Expressionspt_PT
dc.typeconference object
dspace.entity.typePublication
oaire.awardTitleWord combinations in portuguese language COMBINA-PT
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/POCI/POCTI%2FLIN%2F48465%2F2002/PT
oaire.citation.conferencePlaceGenoapt_PT
oaire.citation.titleProceedings of the V International Conference on Language Resources and Evaluation - LREC2006pt_PT
oaire.fundingStreamPOCI
person.familyNameMendes
person.givenNameAmália
person.identifier.ciencia-id4018-7A6F-1873
person.identifier.orcid0000-0001-6815-2674
person.identifier.scopus-author-id14035817100
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typeconferenceObjectpt_PT
relation.isAuthorOfPublication94be597b-a42a-42f4-8f1d-822fa454b910
relation.isAuthorOfPublication.latestForDiscovery94be597b-a42a-42f4-8f1d-822fa454b910
relation.isProjectOfPublication72c89123-b496-47ac-928f-b20b984f092a
relation.isProjectOfPublication.latestForDiscovery72c89123-b496-47ac-928f-b20b984f092a

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
poster_combina_LREC2006_final.pdf
Size:
447.46 KB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.2 KB
Format:
Item-specific license agreed upon to submission
Description: