Logo do repositório
 
Publicação

Towards error annotation in a learner corpus of Portuguese

dc.contributor.authordel Río, Iria
dc.contributor.authorAntunes, Sandra
dc.contributor.authorMendes, Amália
dc.contributor.authorJanssen, Maarten
dc.date.accessioned2018-01-29T16:50:28Z
dc.date.available2018-01-29T16:50:28Z
dc.date.issued2016
dc.description.abstractIn this article, we present COPLE2, a new corpus of Portuguese that encompasses written and spoken data produced by foreign learners of Portuguese as a foreign or second language (FL/L2). Following the trend towards learner corpus research applied to less commonly taught languages, it is our aim to enhance the learning data of Portuguese L2. These data may be useful not only for educational purposes (design of learning materials, curricula, etc.) but also for the development of NLP tools to support students in their learning process. The corpus is available online using TEITOK environment, a web-based framework for corpus treatment that provides several built-in NLP tools and a rich set of functionalities (multiple orthographic transcription layers, lemmatization and POS, normalization of the tokens, error annotation) to automatically process and annotate texts in xml format. A CQP-based search interface allows searching the corpus for different fields, such as words, lemmas, POS tags or error tags. We will describe the work in progress regarding the constitution and linguistic annotation of this corpus, particularly focusing on error annotation.pt_PT
dc.description.versioninfo:eu-repo/semantics/publishedVersionpt_PT
dc.identifier.citationRío, Iria del; Antunes, Sandra; Mendes, Amália & Janssen, Maarten (2016). Towards error annotation in a learner corpus of Portuguese. 5th NLP4CALL and 1st NLP4LA workshop in Sixth Swedish Language Technology Conference (SLTC). Umeå University, Sweden, 17-18 November.pt_PT
dc.identifier.issn1650-3740
dc.identifier.issn1650-3686
dc.identifier.urihttp://hdl.handle.net/10451/31214
dc.language.isoengpt_PT
dc.publisherLinköping University Electronic Presspt_PT
dc.relation.publisherversionhttp://www.ep.liu.se/ecp/130/002/ecp16130002.pdf
dc.titleTowards error annotation in a learner corpus of Portuguesept_PT
dc.typebook part
dspace.entity.typePublication
oaire.awardNumberUID/LIN/00214/2013
oaire.awardURIinfo:eu-repo/grantAgreement/FCT/5876/UID%2FLIN%2F00214%2F2013/PT
oaire.citation.conferencePlaceUmeå University, Swedenpt_PT
oaire.citation.endPage17pt_PT
oaire.citation.startPage8pt_PT
oaire.citation.title5th NLP4CALL and 1st NLP4LA workshop in Sixth Swedish Language Technology Conference (SLTC)pt_PT
oaire.citation.volume130pt_PT
oaire.fundingStream5876
person.familyNamedel Río Gayo
person.familyNameMendes
person.givenNameIria
person.givenNameAmália
person.identifier.ciencia-idB01F-ECC3-7AAB
person.identifier.ciencia-id4018-7A6F-1873
person.identifier.orcid0000-0002-4187-6485
person.identifier.orcid0000-0001-6815-2674
person.identifier.scopus-author-id14035817100
project.funder.identifierhttp://doi.org/10.13039/501100001871
project.funder.nameFundação para a Ciência e a Tecnologia
rcaap.rightsopenAccesspt_PT
rcaap.typebookPartpt_PT
relation.isAuthorOfPublication6248a8c6-c073-47eb-a6f0-30e39bfa5c86
relation.isAuthorOfPublication94be597b-a42a-42f4-8f1d-822fa454b910
relation.isAuthorOfPublication.latestForDiscovery94be597b-a42a-42f4-8f1d-822fa454b910
relation.isProjectOfPublication5b032e24-9bd9-47cd-b7da-2c0849fe148b
relation.isProjectOfPublication.latestForDiscovery5b032e24-9bd9-47cd-b7da-2c0849fe148b

Ficheiros

Principais
A mostrar 1 - 1 de 1
A carregar...
Miniatura
Nome:
ecp16130002.pdf
Tamanho:
446.65 KB
Formato:
Adobe Portable Document Format
Licença
A mostrar 1 - 1 de 1
Miniatura indisponível
Nome:
license.txt
Tamanho:
1.2 KB
Formato:
Item-specific license agreed upon to submission
Descrição: