Logo do repositório
 
Publicação

A Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language.

dc.contributor.authorSequeira, João
dc.contributor.authorGonçalves, Teresa
dc.contributor.authorQuaresma, Paulo
dc.contributor.authorMendes, Amália
dc.contributor.authorHendrickx, Iris
dc.date.accessioned2019-03-07T14:22:05Z
dc.date.available2019-03-07T14:22:05Z
dc.date.issued2018
dc.description.abstractThis work presents a comparative study between two different approaches to build an automatic classification system for Modalityvalues in the Portuguese language. One approach uses a single multi-class classifier with the full dataset that includes eleven modal verbs; the other builds different classifiers, one for each verb. The performance is measured using precision, recall and F1. Due to the unbalanced nature of the dataset a weighted average approach was calculated for each metric. We use support vector machines as ourclassifier and experimented with various SVM kernels to find the optimal classifier for the task at hand. We experimented with several different types of feature attributes representing parse tree information and compare these complex feature representation against a simple bag-of-words feature representation as baseline. The best obtained F1values are above 0.60 and from the results it is possible to conclude that there is no significant difference between both approaches.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationSequeira, João, Teresa Gonçalves, Paulo Quaresma, Amália Mendes, Iris Hendrickx (2018) A Multi-versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language. In Proceedings of the 11th Language Resources and Evaluation Conference - LREC’2018, 7-12 May 2018, Miyazaki, Japan, pp. 1000-1005.pt_PT
dc.identifier.isbn979-10-95546-00-9
dc.identifier.urihttp://hdl.handle.net/10451/37352
dc.language.isoengpt_PT
dc.publisherEuropean Language Resources Associationpt_PT
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2018/pdf/616.pdfpt_PT
dc.subjectNatural language processingpt_PT
dc.subjectModalitypt_PT
dc.subjectFeature selectionpt_PT
dc.subjectSupport Vector Machinespt_PT
dc.titleA Multi- versus a Single-classifier Approach for the Identification of Modality in the Portuguese Language.pt_PT
dc.typejournal article
dspace.entity.typePublication
oaire.citation.conferencePlaceMiyazakipt_PT
oaire.citation.endPage1005pt_PT
oaire.citation.startPage1000pt_PT
oaire.citation.titleProceedings of the 11th Language Resources and Evaluation Conference - LREC’2018pt_PT
person.familyNameMendes
person.givenNameAmália
person.identifier.ciencia-id4018-7A6F-1873
person.identifier.orcid0000-0001-6815-2674
person.identifier.scopus-author-id14035817100
rcaap.rightsopenAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublication94be597b-a42a-42f4-8f1d-822fa454b910
relation.isAuthorOfPublication.latestForDiscovery94be597b-a42a-42f4-8f1d-822fa454b910

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
616.pdf
Tamanho:
165.78 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.2 KB
Formato:
Item-specific license agreed upon to submission
Descrição: