Hoppa till huvudinnehåll

SweLL-pilot

Standardreferens Information

Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg, Monica Sandell (2016): SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies, in Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC 2016), May 23-28, 2016, Portorož, Slovenia BibTeX

Datacitering Information

Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg, & Monica Sandell (2016). SweLL-pilot (uppdaterad: 2016-01-01). [Data set]. Språkbanken Text. https://doi.org/10.23695/d3bg-8g24
BibTeX Ytterligare sätt att citera datamängden.
Uppsatser skrivna av vuxenstuderande i svenska, annoterade med CEFR nivåerna (en Europeisk skala med färdighetsnivåer inom språkinlärningen). Uppsatserna samlades under perioden 2006-2015.

Annotation

Essays are manually graded using the CEFR scale by human teachers. In addition, the essays are also linguistically annotated (POS tagging, lemmatization, dependency annotation) using Sparv.

Förbehåll

Data collection is limited to a small geographical area and a short period of time. Although several language backgrounds are represented, the corpus is very unbalanced in this sense and as a consequence not well suited for native language identification tasks.The corpus consists of three subcorpora, each coming from a different source, with different proficiency levels, and with different colleciton periods. While the three subcorpora can be used simultaneously, care should be taken into account to ensure that artifacts from these differences do not leak into models.

Avsedd användning

Automated grading using the CEFR scale, anonymization, second language acquisiton studies

Referenser

Tillgänglig via

Åtkomst Plattform Licens
CLARIN-ID, -PRIV, -NORED, -BY (https://www.kielipankki.fi/support/clarin-eula/#res)

Del av samling

Typ

  • Korpus
  • Samling

Språk

svenska

Storlek

Resurser: 3

Nyckelord

  • L2
  • language learning
  • essays

Skapad av

  • Elena Volodina
  • Ildikó Pilán
  • Ingegerd Enström
  • Lorena Llozhi
  • Peter Lundkvist
  • Gunlög Sundberg
  • Monica Sandell

Updaterad

2016-01-01

Kontakt

sb-info@svenska.gu.se