Logo do repositório
 
Publicação

Error annotation in the COPLE2 corpus

dc.contributor.authordel Río, Iria
dc.contributor.authorMendes, Amália
dc.date.accessioned2019-01-18T10:11:44Z
dc.date.available2019-01-18T10:11:44Z
dc.date.issued2018-09-22
dc.description.abstractWe present the general architecture of the error annotation system applied to the COPLE2 corpus, a learner corpus of Portuguese implemented on the TEITOK platform. We give a general overview of the corpus and of the TEITOK functionalities and describe how the error annotation is structured in a two-level system: first, a fully manual token-based and coarse-grained annotation is applied and produces a rough classification of the errors in three categories, paired with multi-level information for POS and lemma; second, a multi-word and fine-grained annotation in standoff is then semi-automatically produced based on the first level of annotation. The token-based level has been applied to 47% of the total corpus. We compare our system with other proposals of error annotation, and discuss the fine-grained tag set and the experiments to validate its applicability. An inter-annotator (IAA) experiment was performed on the two stages of our system using Cohen’s kappa and it achieved good results on both levels. We explore the possibilities offered by the token-level error annotation, POS and lemma to automatically generate the fine-grained error tags by applying conversion scripts. The model is planned in such a way as to reduce manual effort and rapidly increase the coverage of the error annotation over the full corpus. As the first learner corpus of Portuguese with error annotation, we expect COPLE2 to support new research in different fields connected with Portuguese as second/foreign language, like Second Language Acquisition/Teaching or Computer Assisted Learning.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationdel Río, I., & Mendes, A. (2018). Error annotation in the COPLE2 corpus. Revista Da Associação Portuguesa De Linguí­stica, (4), 225-239. https://doi.org/10.26334/2183-9077/rapln4ano2018a42pt_PT
dc.identifier.doihttps://doi.org/10.26334/2183-9077/rapln4ano2018a42pt_PT
dc.identifier.issn2183-9077
dc.identifier.urihttp://hdl.handle.net/10451/36512
dc.language.isoengpt_PT
dc.peerreviewedyespt_PT
dc.publisherAssociação Portuguesa de Linguí­sticapt_PT
dc.relationFundação Calouste Gulbenkian (Proc. nr. 134655)pt_PT
dc.relationDETECÇÃO E CORREÇÃO AUTOMÁTICA DE ERROS EM PORTUGUÊS SEGUNDA LÍNGUA/LÍNGUA ESTRANGEIRA
dc.relation.publisherversionhttps://ojs.apl.pt/index.php/rapl/article/view/42/31pt_PT
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/pt_PT
dc.subjectLearner corpuspt_PT
dc.subjectError annotationpt_PT
dc.subjectSecond language acquisitionpt_PT
dc.subjectNatural language processingpt_PT
dc.subjectCorpus de aprendentespt_PT
dc.subjectAnotação do erropt_PT
dc.subjectAquisição de língua segundapt_PT
dc.subjectProcessamento de língua naturalpt_PT
dc.titleError annotation in the COPLE2 corpuspt_PT
dc.typejournal article
dspace.entity.typePublication
oaire.awardNumberPEst-OE/LIN/UI0214/2013
oaire.awardNumberSFRH/BPD/109914/2015
oaire.awardTitleDETECÇÃO E CORREÇÃO AUTOMÁTICA DE ERROS EM PORTUGUÊS SEGUNDA LÍNGUA/LÍNGUA ESTRANGEIRA
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/3599-PPCDT/PEst-OE%2FLIN%2FUI0214%2F2013/PT
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/OE/SFRH%2FBPD%2F109914%2F2015/PT
oaire.citation.endPage239pt_PT
oaire.citation.startPage225pt_PT
oaire.citation.titleRevista Da Associação Portuguesa De Linguí­sticapt_PT
oaire.citation.volume4pt_PT
oaire.fundingStream3599-PPCDT
oaire.fundingStreamOE
person.familyNamedel Río Gayo
person.familyNameMendes
person.givenNameIria
person.givenNameAmália
person.identifier.ciencia-idB01F-ECC3-7AAB
person.identifier.ciencia-id4018-7A6F-1873
person.identifier.orcid0000-0002-4187-6485
person.identifier.orcid0000-0001-6815-2674
person.identifier.scopus-author-id14035817100
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublication6248a8c6-c073-47eb-a6f0-30e39bfa5c86
relation.isAuthorOfPublication94be597b-a42a-42f4-8f1d-822fa454b910
relation.isAuthorOfPublication.latestForDiscovery6248a8c6-c073-47eb-a6f0-30e39bfa5c86
relation.isProjectOfPublicationf3d1337e-88ba-480e-b974-1f9d806b7efc
relation.isProjectOfPublicationa663ff8b-f624-4c4d-af6f-69ab41fcbe3b
relation.isProjectOfPublication.latestForDiscoveryf3d1337e-88ba-480e-b974-1f9d806b7efc

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
ErrorannotationintheCOPLE2corpus.pdf
Tamanho:
987.93 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.2 KB
Formato:
Item-specific license agreed upon to submission
Descrição: