Logo do repositório
 
Publicação

The COPLE2 Corpus: a Learner Corpus for Portuguese

dc.contributor.authorMendes, Amália
dc.contributor.authorAntunes, Sandra
dc.contributor.authorJansseen, Maarten
dc.contributor.authorGonçalves, Anabela
dc.date.accessioned2018-01-17T16:46:22Z
dc.date.available2018-01-17T16:46:22Z
dc.date.issued2016
dc.description.abstractWe present the COPLE2 corpus, a learner corpus of Portuguese that includes written and spoken texts produced by learners of Portuguese as a second or foreign language. The corpus includes at the moment a total of 182,474 tokens and 978 texts, classified according to the CEFR scales. The original handwritten productions are transcribed in TEI compliant XML format and keep record of all the original information, such as reformulations, insertions and corrections made by the teacher, while the recordings are transcribed and aligned with EXMARaLDA. The TEITOK environment enables different views of the same document (XML, student version, corrected version), a CQP-based search interface, the POS, lemmatization and normalization of the tokens, and will soon be used for error annotation in stand-off format. The corpus has already been a source of data for phonological, lexical and syntactic interlanguage studies and will be used for a data-informed selection of language features for each proficiency level.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationMendes, Amália, Sandra Antunes, Maarten Janssen & Anabela Gonçalves (2016) The COPLE2 Corpus: A Learner Corpus for Portuguese. In: Proceedings of the Tenth Language Resources and Evaluation Conference – LREC’16, 23-28 May 2016, Portoroz, Slovenia, 3207-3214pt_PT
dc.identifier.isbn978-2-9517408-9-1
dc.identifier.urihttp://hdl.handle.net/10451/30692
dc.language.isoengpt_PT
dc.publisherEuropean Language Resources Associationpt_PT
dc.relation.publisherversionhttp://www.lrec-conf.org/proceedings/lrec2016/pdf/439_Paper.pdf
dc.subjectLearner corpuspt_PT
dc.subjectCorpus compilationpt_PT
dc.subjectLanguage learningpt_PT
dc.subjectLanguage teachingpt_PT
dc.titleThe COPLE2 Corpus: a Learner Corpus for Portuguesept_PT
dc.typejournal article
dspace.entity.typePublication
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/3599-PPCDT/PEst-OE%2FLIN%2FUI0214%2F2013/PT
oaire.citation.conferencePlacePortorozpt_PT
oaire.citation.endPage3214pt_PT
oaire.citation.startPage3207pt_PT
oaire.citation.titleProceedings of the Tenth Language Resources and Evaluation Conference – LREC’16pt_PT
oaire.fundingStream3599-PPCDT
person.familyNameMendes
person.familyNameGonçalves
person.givenNameAmália
person.givenNameAnabela
person.identifier.ciencia-id4018-7A6F-1873
person.identifier.orcid0000-0001-6815-2674
person.identifier.orcid0000-0003-2161-176X
person.identifier.ridN-7336-2013
person.identifier.scopus-author-id14035817100
person.identifier.scopus-author-id55797224900
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typearticlept_PT
relation.isAuthorOfPublication94be597b-a42a-42f4-8f1d-822fa454b910
relation.isAuthorOfPublication4c0e3a20-eaac-4702-afa0-d62dc134863c
relation.isAuthorOfPublication.latestForDiscovery94be597b-a42a-42f4-8f1d-822fa454b910
relation.isProjectOfPublicationf3d1337e-88ba-480e-b974-1f9d806b7efc
relation.isProjectOfPublication.latestForDiscoveryf3d1337e-88ba-480e-b974-1f9d806b7efc

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
Mendes_et_al_COPLE2_LREC_2016.pdf
Tamanho:
194.88 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.2 KB
Formato:
Item-specific license agreed upon to submission
Descrição: