Skip to main content
Språkbanken Text is a part of Språkbanken.

Language resources

On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
Resurs Antal tokens Språk Åtkomst
Somali: Cilmiga Bulshada 1980-89
4,951 Somali
Somali: Cilmiga Bulshada 2001 Somaliland
30,258 Somali
Somali: Cilmiga Bulshada 2001-03 Soomaaliya
48,234 Somali
Somali: Cilmiga Bulshada 2010 Somaliland
11,713 Somali
Somali: Cilmiga Bulshada 2011 Itoobiya
30,124 Somali
Somali: Cilmiga Bulshada 2016 Somaliland
54,498 Somali
Somali: Cilmiga Bulshada 2018 Soomaaliya
42,557 Somali
Somali: Cilmiga Deegaanka 2012 Itoobiya
56,874 Somali
Somali: Golaha Wakiillada Somaliland
539,206 Somali
Somali: Haatuf News 2002
1,495,343 Somali
Somali: Haatuf News 2003
2,359,710 Somali
Somali: Haatuf News 2004
1,813,484 Somali
Somali: Haatuf News 2005
2,003,060 Somali
Somali: Haatuf News 2006
2,125,632 Somali
Somali: Haatuf News 2007
1,758,810 Somali
Somali: Haatuf News 2008
1,286,309 Somali
Somali: Haatuf News 2009
393,199 Somali
Somali: Kitaabka Quduuska Ah
841,187 Somali
Somali: Maaddooyinka Kale 1972–79
14,908 Somali
Somali: Ogaden Online
98,454 Somali
Somali: Qoraallo 1956-1970
14,153 Somali
Somali: Qur’aan
141,555 Somali
Somali: Raadiyaha Denmark 2014
199,173 Somali
Somali: Raadiyaha Iswiidhan 2014
235,911 Somali
Somali: Radio Muqdisho
22,801 Somali
Somali: Saynis 1972–77
112,845 Somali
Somali: Saynis 1980–89
33,034 Somali
Somali: Saynis 1994–96
60,787 Somali
Somali: Saynis 2001 Somaliland
29,988 Somali
Somali: Saynis 2001 Soomaaliya
4,659 Somali
Somali: Saynis 2010 Somaliland
30,471 Somali
Somali: Saynis 2011 Soomaaliya
45,689 Somali
Somali: Saynis 2016 Somaliland
31,196 Somali
Somali: Saynis 2018 Soomaaliya
30,786 Somali
Somali: Sheekooyin Carruureed
26,003 Somali
Somali: Sheekooyin Carruureed (Turjuman)
13,865 Somali
Somali: Sheekooyin Gaagaaban
180,852 Somali
Somali: Somali Faces
51,440 Somali
Somali: Suugaan
156,288 Somali
Somali: Suugaan (Turjuman)
8,796 Somali
Somali: Suugaan 2
2,827,328 Somali
Somali: Taariikh iyo Dhaqan (Turjuman)
35,479 Somali
Somali: Warbixin Ku Saabsan Iswiidhan
59,823 Somali
Somali: Warbixin Ku Saabsan Kanada
24,039 Somali
Somali: Wardheer News
499,037 Somali
Somali: Xeerar Somaliland
450,142 Somali
Somali: Xisaab 1971-79
1,875 Somali
Somali: Xisaab 1994-97
713 Somali
Somali: Xisaab 2001 Somaliland
32,676 Somali
Somali: Xisaab 2001 Soomaaliya
50,361 Somali
Somali: Xisaab 2011 Itoobiya
43,977 Somali
Somali: Xisaab 2016 Somaliland
41,922 Somali
Somali: Xisaab 2018 Soomaaliya
35,262 Somali
SpIn v1
256 essays collected from Language Introduction course (mid-term exams) for newly arrived refugees. Some of the students are recurrent.
46,911 Swedish
Språkprov SO 2009
De drygt 94 000 språkexemplen är hämtade ur Svensk ordbok utgiven av Svenska Akademien (2009). Exemplens uppgift är att stödja ordboksdefinitionerna och att ge information om uppslagsordens fraseologi. <br><br>För åtkomst kontakta <a href="mailto:emma.skoldberg@svenska.gu.se">Emma Sköldberg</a>.
541,568 Swedish
SUC 2.0
Stockholm-Umeå corpus 2.0
1,166,593 Swedish
SUC 3.0
Stockholm-Umeå corpus 3.0
1,166,593 Swedish
SUC Novels (StorSUC)
Stockholm-Umeå corpus
4,651,200 Swedish
SUCX 2.0
Stockholm-Umeå corpus 2.0 scrambled
1,166,593 Swedish
SUCX 3.0
Stockholm-Umeå corpus 3.0 scrambled
1,166,593 Swedish
Collection
SuperLim 2
A standardized suite for evaluation and analysis of Swedish natural language understanding systems.
Swedish
SuperSim (repackaged for Superlim) 2.0
A dataset for word similarity and relatedness in Swedish
Swedish
sv-COVID-19
A compilation of various articles related to the COVID-19 pandemic
8,130,201 Swedish
Svensk Tidskrift
27 annual volumes of the conservative journal Svensk Tidskrift, from 1891 to 1940
7,202,567 Swedish
Collection
SVT news
News texts from svt.se
Swedish
SVT news 2004
News texts from svt.se
447,189 Swedish
SVT news 2005
News texts from svt.se
3,300,646 Swedish
SVT news 2006
News texts from svt.se
4,172,111 Swedish
SVT news 2007
News texts from svt.se
5,533,682 Swedish
SVT news 2008
News texts from svt.se
7,693,570 Swedish
SVT news 2009
News texts from svt.se
8,860,985 Swedish
SVT news 2010
News texts from svt.se
9,873,332 Swedish
SVT news 2011
News texts from svt.se
9,327,078 Swedish
SVT news 2012
News texts from svt.se
9,544,671 Swedish
SVT news 2013
News texts from svt.se
13,961,829 Swedish
SVT news 2014
News texts from svt.se
16,077,222 Swedish
SVT news 2015
News texts from svt.se
19,205,040 Swedish
SVT news 2016
News texts from svt.se
21,729,542 Swedish
SVT news 2017
News texts from svt.se
21,184,642 Swedish
SVT news 2018
News texts from svt.se
18,817,638 Swedish
SVT news 2019
News texts from svt.se
18,274,785 Swedish
SVT news 2020
News texts from svt.se
16,025,766 Swedish
SVT news 2021
News texts from svt.se
14,978,995 Swedish
SVT news 2022
News texts from svt.se
13,996,419 Swedish
SVT news 2023
News texts from svt.se
7,501,502 Swedish
SVT news unknown date
News texts from svt.se
36,783 Swedish
SW1203-essays
Essays written by L2 Swedish language learners, university courses
51,972 Swedish
Swe-NERC
A resource for training and evaluation of Named Entity Recognition for Swedish.
140,914 Swedish
SweDiagnostics
Swedish version of (Super)GLUE Diagnostic
Swedish
Swedish ABSAbank
An annotated Swedish corpus for aspect-based sentiment analysis
1,574,226 Swedish
Swedish ABSAbank-Imm 1.1
An annotated Swedish corpus for aspect-based sentiment analysis (a version of Absabank)
Swedish
Swedish analogy 2.0
Swedish semantic and syntactic similarity
Swedish
Swedish Bible 1873
Swedish translation of the Bible from 1873
811,321 Swedish
Swedish Bible 1917
Official Swedish translation of the Bible from 1917
894,720 Swedish
Swedish Code of Statutes
Swedish Code of Statutes 1880-01-01 – 2023-12-15
19,748,312 Swedish
Swedish EAT: question classification
A translated version of the QAQC dataset for expected-answer-type classification.
Swedish
Swedish fraktur 1626-1816
A selection of fraktur texts printed between 1626 and 1816 from the collections of the University Library of University of Gothenburg (UB). For OCR analysis.
47,924 Swedish
Swedish framenet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This part of the resource contains the corpus examples, automatically enriched with linguistic information.
137,770 Swedish
Swedish newspapers 1818-1870
A selection of Swedish newspapers printed between 1818 and 1870 from the collections of Kungliga biblioteket (KB). For OCR analysis.
186,013 Swedish
Swedish newspapers 1871-1906
A selection of Swedish newspapers printed between 1871 and 1906 from the collections of Kungliga biblioteket (KB). For OCR analysis.
337,635 Swedish
BibTeX list