SweLL-gold corpus is a corpus of essays written by adult learners of Swedish. It was collected during the period of 2017-2020 in the SweLL project, and contains 502 essays that have been pseudonymized, normalized and correction annotated.
Two SweLL-gold versions -- original and corrected
The corpus consists of two versions that are parallel in nature, the original learner-written texts, SweLL-gold-original and corrected versions of these, SweLL-gold-target. Statistics is slightly different between these two versions due to the introduced corrections, which is why we offer download of the statistics separately for SweLL-gold-original and SweLL-gold-target.
To get access to SweLL-gold full texts, fill in an Application for Acess (see under Links).
Links
Cite as
<a href="/om/personal/elena">Elena Volodina</a>, Lena Granstedt, <a href="/en/about/staff/arild">Arild Matsson</a>, Beáta Megyesi, Ildikó Pilán, Julia Prentice, <a href="/om/personal/dan">Dan Rosén</a>, Lisa Rudebeck, <a href="/om/personal/carl-johan">Carl-Johan Schenström</a>, Gunlög Sundberg, Mats Wirén
(2019):
<a href="https://gup.ub.gu.se/publication/285609?lang=en">The SweLL Language Learner Corpus: From Design to Annotation</a>, in
<em>Northern European Journal of Language Technology</em>, volume
<em>6</em>, pages
<em>67-104</em>
<a href="https://spraakbanken.gu.se/en/research/publications/bibtex/285609">
<img src="https://spraakbanken.gu.se/modules/custom/sb_publications/assets/bibtex.png" alt="BibTeX">
</a>