Skip to main content

SweLL - Documentation

Handling of sensitive data

  • Data handling flow (English only): link
  • Consent forms (NOTE! The forms in the other languages should be used only as support, and NOT instead of the Swedish consent form. The Swedish form has been modified at a later stage and is thus not totally identical to the form that was used for translation): Swedish, English, Arabic, BKS (Serbo-Croatian), Dari, Farsi, Greek, North Kurdish, Somali, Sorani, Spanish, Tigrinya
  • Personal metadata form (NOTE! The forms in the other languages should be used only as support, and NOT instead of the Swedish metadata form. The Swedish form has been modified at a later stage and is thus not totally identical to the form that was used for translation): Swedish, English, Arabic, BKS (serbokroatiska), Dari, Farsi (farsi), Greek, North Kurdish, Somali, Sorani, Spanish, Tigrinya
  • Task metadata form (Swedish only): link
  • Teacher instructions & checklists (Swedish only): link

Most frequent immigrant languages

  • Swedish, English, Arabic, BKS, Dari, Farsi, Greek, North Kurdish, Somali, Sorani, Spanish, Tigrinya

Based on

  • Mikael Parkvall Sveriges språk i siffror (Morfem och Språkrådet 2016)
  • Statistik från SCB (2018) Kommunal vuxenutbildning i SFI på Riksnivå

Working on data annotation: Guidelines

  • Transcription guidelines (In English with Swedish examples): link
  • Pseudonymization guidelines (In English with Swedish examples): link
  • Normalization guidelines (In English with Swedish examples): link
  • Correction annotation guidelines (In English with Swedish examples): link

Working on data: Tools and platforms

Manuals

Metadata overview: SweLL-gold corpus

Working with data: Tools and platforms