SweLL is an collection of tools and data (essays) for research on Swedish as a Second Language. There are several corpora in the collection:
The SweLL corpus is pseudonymized, normalized and correction annotated. SpIn, Tisus-texts and SW1203 contain pseudonymized versions of the original essays. More data can eventually be added to the SweLL collection (maybe by you?).
There are two types of users in the SweLL infrastructure: SweLL end-users and SweLL contributors, see below and read on that here: checklist.
To get access to the data and the tools, you need to select which type of access you need. We describe below the two groups. In both cases, please start by reading the Introduction document about the way SweLL data has been handled during the preparation period and how it needs to be handled by the new users.
... are users that are interested in getting access to the data (corpus/corpora and metadata) available through the SweLL infrastructure module. This includes a possibility for Korp searches in L2 data and download of the data in several available formats. To become a SweLL end-user, please, apply here: https://sweclarin.se/eng/application-access-swell-data
Note that the SweLL data contains personal information and therefore each Application for access is personal and does not give the right to spread/share the data with non-approved users.
Following license applies to the SweLL dataset: CLARIN-RES, -PRIV, -ID, -AFFIL, -PERM, -BY. Explanations of the license types can be found here.
Once you have filled in an application and it is approved, you may find useful the following links describing how to work with the data.
- personal data processing (acc to GU data protection regulations)
- manuals and instructional videos
- Korp L2 searches (Korp access)
- SVALA annotation tool - demo version (Korp & Full data access)
- GU-box file download (Full data access)