Logo do repositório
 
A carregar...
Miniatura
Publicação

Portuguese Native Language Identification

Utilize este identificador para referenciar este registo.
Nome:Descrição:Tamanho:Formato: 
portuguese-native-language-identification-2.pdf248.98 KBAdobe PDF Ver/Abrir

Orientador(es)

Resumo(s)

This study presents the first Native Language Identification (NLI) study for L2 Portuguese.We used a sub-set of the NLI-PT dataset, containing texts written by speakers of five different native languages: Chinese, English, German, Italian, and Spanish.We explore the linguistic annotations available in NLI-PT to extract a range of (morpho-)syntactic features and apply NLI classification methods to predict the native language of the authors. The best results were obtained using an ensemble combination of the features, achieving 54:1% accuracy.

Descrição

Palavras-chave

Native language identification Learner corpus Portuguese

Contexto Educativo

Citação

Shervin Malmasi, Iria del Río and Marcos Zampieri. 2018. Portuguese Native Language Identification. Proceedings of International Conference on the Computational Processing of Portuguese (PROPOR).

Projetos de investigação

Unidades organizacionais

Fascículo

Editora

Licença CC