SweLL-gold corpus is a corpus of essays written by adult learners of Swedish. It was collected during the period of 2017-2020 in the SweLL project, and contains 502 essays that have been pseudonymized, normalized and correction annotated.
Two SweLL-gold versions -- original and corrected
The corpus consists of two versions that are parallel in nature, the original learner-written texts, SweLL-gold-original and corrected versions of these, SweLL-gold-target. Statistics is slightly different between these two versions due to the introduced corrections, which is why we offer download of the statistics separately for SweLL-gold-original and SweLL-gold-target.
To get access to SweLL-gold full texts, fill in an Application for Acess (see under Links).
Links
Cite as
Elena Volodina, Lena Granstedt, Arild Matsson, Beáta Megyesi, Ildikó Pilán, Julia Prentice, Dan Rosén, Lisa Rudebeck, Carl-Johan Schenström, Gunlög Sundberg, Mats Wirén
(2019):
The SweLL Language Learner Corpus: From Design to Annotation, in
Northern European Journal of Language Technology, volume
6, pages
67-104