Logo do repositório
 
Publicação

Portuguese Native Language Identification

dc.contributor.authorMalmasi, Shervin
dc.contributor.authordel Río, Iria
dc.contributor.authorZampieri, Marcos
dc.date.accessioned2019-01-18T10:18:30Z
dc.date.available2019-01-18T10:18:30Z
dc.date.issued2018
dc.description.abstractThis study presents the first Native Language Identification (NLI) study for L2 Portuguese.We used a sub-set of the NLI-PT dataset, containing texts written by speakers of five different native languages: Chinese, English, German, Italian, and Spanish.We explore the linguistic annotations available in NLI-PT to extract a range of (morpho-)syntactic features and apply NLI classification methods to predict the native language of the authors. The best results were obtained using an ensemble combination of the features, achieving 54:1% accuracy.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationShervin Malmasi, Iria del Río and Marcos Zampieri. 2018. Portuguese Native Language Identification. Proceedings of International Conference on the Computational Processing of Portuguese (PROPOR).pt_PT
dc.identifier.urihttp://hdl.handle.net/10451/36514
dc.language.isoengpt_PT
dc.subjectNative language identificationpt_PT
dc.subjectLearner corpuspt_PT
dc.subjectPortuguesept_PT
dc.titlePortuguese Native Language Identificationpt_PT
dc.typebook part
dspace.entity.typePublication
oaire.citation.conferencePlaceCanela, Brasilpt_PT
oaire.citation.titleInternational Conference on the Computational Processing of Portuguese (PROPOR)pt_PT
person.familyNamedel Río Gayo
person.givenNameIria
person.identifier.ciencia-idB01F-ECC3-7AAB
person.identifier.orcid0000-0002-4187-6485
rcaap.rightsopenAccesspt_PT
rcaap.typebookPartpt_PT
relation.isAuthorOfPublication6248a8c6-c073-47eb-a6f0-30e39bfa5c86
relation.isAuthorOfPublication.latestForDiscovery6248a8c6-c073-47eb-a6f0-30e39bfa5c86

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
portuguese-native-language-identification-2.pdf
Tamanho:
248.98 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.2 KB
Formato:
Item-specific license agreed upon to submission
Descrição: