Skip to main content
Svenska
English
News and events
Research
Data
Analyses
Platforms
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1369)
Collections (31)
Corpora (1224)
Lexicons (70)
Training and evaluation data (27)
Models (48)
Title
Free search
Language
- Any -
Swedish
Albanian
Arabic
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Antal tokens
Språk
Åtkomst
SpIn
44,996
Swedish
Word statistics:
stats_SPIN-SOURCE.txt.zip
2025-04-22 – 53.36 KB – CC BY 4.0
Explore in:
SpIn v1
256 essays collected from Language Introduction course (mid-term exams) for newly arrived refugees. Some of the students are recurrent.
46,911
Swedish
Explore in:
Språkprov SO 2009
De drygt 94 000 språkexemplen är hämtade ur Svensk ordbok utgiven av Svenska Akademien (2009). Exemplens uppgift är att stödja ordboksdefinitionerna och att ge information om uppslagsordens fraseologi.
541,568
Swedish
Explore in:
SUC 2.0
Stockholm-Umeå corpus 2.0
1,166,593
Swedish
Word statistics:
stats_SUC2.txt.zip
2025-04-22 – 1.34 MB – CC BY 4.0
SUC 3.0
Stockholm-Umeå corpus 3.0
1,166,593
Swedish
Dataset:
suc3.xml.bz2
2024-06-03 – 84.44 MB – CC BY 4.0
Word statistics:
stats_suc3.csv.zip
2025-04-22 – 1.43 MB – CC BY 4.0
Explore in:
SUC Novels (StorSUC)
Stockholm-Umeå corpus
4,651,200
Swedish
Dataset:
storsuc.xml.bz2
2017-04-26 – 68.28 MB – CC BY 4.0
Word statistics:
stats_STORSUC.txt.zip
2025-04-22 – 2.25 MB – CC BY 4.0
Explore in:
SUCX 2.0
Stockholm-Umeå corpus 2.0 scrambled
1,166,593
Swedish
Dataset:
suc2.xml.bz2
2017-05-19 – 17.68 MB – CC BY-SA 4.0
Word statistics:
stats_SUC2.txt.zip
2025-04-22 – 1.34 MB – CC BY-SA 4.0
Explore in:
SUCX 3.0
Stockholm-Umeå corpus 3.0 scrambled
1,166,593
Swedish
Dataset:
suc3.xml.bz2
2024-06-03 – 84.44 MB – CC BY-SA 4.0
Word statistics:
stats_suc3.csv.zip
2025-04-22 – 1.43 MB – CC BY 4.0
Explore in:
Collection
SuperLim 2
A standardized suite for evaluation and analysis of Swedish natural language understanding systems.
Swedish
Dataset:
SuperLim-2-2.0.4.zip
2024-01-25 – 156.63 MB – CC BY 4.0
Dataset:
SuperLim_maintenance.odt
2024-01-25 – 16.96 KB
SuperSim (repackaged for Superlim) 2.0
A dataset for word similarity and relatedness in Swedish
Swedish
Dataset:
supersim-superlim.zip
2023-03-30 – 70.45 KB – CC BY 4.0
sv-COVID-19
A compilation of various articles related to the COVID-19 pandemic
8,130,201
Swedish
Dataset:
sv-covid-19.xml.bz2
2025-02-20 – 216.31 MB – CC BY 4.0
Word statistics:
stats_sv-covid-19.csv.zip
2025-04-22 – 2.45 MB – CC BY 4.0
Explore in:
Svensk Tidskrift
27 annual volumes of the conservative journal Svensk Tidskrift, from 1891 to 1940
7,202,567
Swedish
Dataset:
runeberg-svtidskr.xml.bz2
2014-12-08 – 93.06 MB – CC BY 4.0
Word statistics:
stats_RUNEBERG-SVTIDSKR.txt.zip
2025-04-22 – 4.24 MB – CC BY 4.0
Explore in:
Collection
SVT news
News texts from svt.se
Swedish
See 21 collected resources
Explore in:
SVT news 2004
News texts from svt.se
447,189
Swedish
Dataset:
svt-2004.xml.bz2
2022-12-06 – 12.54 MB – CC BY 4.0
Word statistics:
stats_svt-2004.csv.zip
2025-04-22 – 2.17 MB – CC BY 4.0
Explore in:
SVT news 2005
News texts from svt.se
3,300,646
Swedish
Dataset:
svt-2005.xml.bz2
2022-12-06 – 94.29 MB – CC BY 4.0
Word statistics:
stats_svt-2005.csv.zip
2025-04-22 – 15.38 MB – CC BY 4.0
Explore in:
SVT news 2006
News texts from svt.se
4,172,111
Swedish
Dataset:
svt-2006.xml.bz2
2022-12-06 – 120.32 MB – CC BY 4.0
Word statistics:
stats_svt-2006.csv.zip
2025-04-22 – 18.43 MB – CC BY 4.0
Explore in:
SVT news 2007
News texts from svt.se
5,533,682
Swedish
Dataset:
svt-2007.xml.bz2
2022-12-06 – 159.96 MB – CC BY 4.0
Word statistics:
stats_svt-2007.csv.zip
2025-04-22 – 22.97 MB – CC BY 4.0
Explore in:
SVT news 2008
News texts from svt.se
7,693,570
Swedish
Dataset:
svt-2008.xml.bz2
2022-12-06 – 221.24 MB – CC BY 4.0
Word statistics:
stats_svt-2008.csv.zip
2025-04-22 – 29.19 MB – CC BY 4.0
Explore in:
SVT news 2009
News texts from svt.se
8,860,985
Swedish
Dataset:
svt-2009.xml.bz2
2022-12-06 – 254.45 MB – CC BY 4.0
Word statistics:
stats_svt-2009.csv.zip
2025-04-22 – 32.25 MB – CC BY 4.0
Explore in:
SVT news 2010
News texts from svt.se
9,873,332
Swedish
Dataset:
svt-2010.xml.bz2
2022-12-06 – 284.46 MB – CC BY 4.0
Word statistics:
stats_svt-2010.csv.zip
2025-04-22 – 34.85 MB – CC BY 4.0
Explore in:
SVT news 2011
News texts from svt.se
9,327,078
Swedish
Dataset:
svt-2011.xml.bz2
2022-12-06 – 268.69 MB – CC BY 4.0
Word statistics:
stats_svt-2011.csv.zip
2025-04-22 – 33.23 MB – CC BY 4.0
Explore in:
SVT news 2012
News texts from svt.se
9,544,671
Swedish
Dataset:
svt-2012.xml.bz2
2022-12-06 – 273.87 MB – CC BY 4.0
Word statistics:
stats_svt-2012.csv.zip
2025-04-22 – 32.61 MB – CC BY 4.0
Explore in:
SVT news 2013
News texts from svt.se
13,961,829
Swedish
Dataset:
svt-2013.xml.bz2
2022-12-06 – 397.91 MB – CC BY 4.0
Word statistics:
stats_svt-2013.csv.zip
2025-04-22 – 43.4 MB – CC BY 4.0
Explore in:
SVT news 2014
News texts from svt.se
16,077,222
Swedish
Dataset:
svt-2014.xml.bz2
2022-12-07 – 454.63 MB – CC BY 4.0
Word statistics:
stats_svt-2014.csv.zip
2025-04-22 – 47.99 MB – CC BY 4.0
Explore in:
SVT news 2015
News texts from svt.se
19,205,040
Swedish
Dataset:
svt-2015.xml.bz2
2022-12-07 – 539.73 MB – CC BY 4.0
Word statistics:
stats_svt-2015.csv.zip
2025-04-22 – 54.12 MB – CC BY 4.0
Explore in:
SVT news 2016
News texts from svt.se
21,729,542
Swedish
Dataset:
svt-2016.xml.bz2
2022-12-07 – 613.63 MB – CC BY 4.0
Word statistics:
stats_svt-2016.csv.zip
2025-04-22 – 58.8 MB – CC BY 4.0
Explore in:
SVT news 2017
News texts from svt.se
21,184,642
Swedish
Dataset:
svt-2017.xml.bz2
2022-12-07 – 601.37 MB – CC BY 4.0
Word statistics:
stats_svt-2017.csv.zip
2025-04-22 – 56.78 MB – CC BY 4.0
Explore in:
SVT news 2018
News texts from svt.se
18,817,638
Swedish
Dataset:
svt-2018.xml.bz2
2022-12-07 – 533.68 MB – CC BY 4.0
Word statistics:
stats_svt-2018.csv.zip
2025-04-22 – 52.65 MB – CC BY 4.0
Explore in:
SVT news 2019
News texts from svt.se
18,274,785
Swedish
Dataset:
svt-2019.xml.bz2
2022-12-07 – 515.99 MB – CC BY 4.0
Word statistics:
stats_svt-2019.csv.zip
2025-04-22 – 51.21 MB – CC BY 4.0
Explore in:
SVT news 2020
News texts from svt.se
16,025,766
Swedish
Dataset:
svt-2020.xml.bz2
2022-12-07 – 453.02 MB – CC BY 4.0
Word statistics:
stats_svt-2020.csv.zip
2025-04-22 – 45.42 MB – CC BY 4.0
Explore in:
SVT news 2021
News texts from svt.se
14,978,995
Swedish
Dataset:
svt-2021.xml.bz2
2022-12-07 – 424.19 MB – CC BY 4.0
Word statistics:
stats_svt-2021.csv.zip
2025-04-22 – 43.7 MB – CC BY 4.0
Explore in:
SVT news 2022
News texts from svt.se
13,996,419
Swedish
Dataset:
svt-2022.xml.bz2
2023-08-30 – 395.67 MB – CC BY 4.0
Word statistics:
stats_svt-2022.csv.zip
2025-04-22 – 16.01 MB – CC BY 4.0
Explore in:
SVT news 2023
News texts from svt.se
7,501,502
Swedish
Dataset:
svt-2023.xml.bz2
2023-08-29 – 211.47 MB – CC BY 4.0
Word statistics:
stats_svt-2023.csv.zip
2025-04-22 – 9.7 MB – CC BY 4.0
Explore in:
SVT news unknown date
News texts from svt.se
36,783
Swedish
Dataset:
svt-nodate.xml.bz2
2023-02-08 – 862.74 KB – CC BY 4.0
Word statistics:
stats_svt-nodate.csv.zip
2025-04-22 – 105.07 KB – CC BY 4.0
Explore in:
SW1203 v1
52,518
Swedish
Word statistics:
stats_SW1203V1.txt.zip
2025-04-22 – 69.06 KB – CC BY 4.0
Explore in:
SW1203-essays
Essays written by L2 Swedish language learners, university courses
51,972
Swedish
Word statistics:
stats_SW1203.txt.zip
2025-04-22 – 71.13 KB – CC BY 4.0
Explore in:
SW1203-uppsatser version 2
51,956
Swedish
Word statistics:
stats_SW1203V2.txt.zip
2025-04-22 – 70.02 KB – CC BY 4.0
Explore in:
Swe-NERC
A resource for training and evaluation of Named Entity Recognition for Swedish.
140,914
Swedish
Dataset:
Swe-NERC-v1.0.tar.gz
2024-03-05 – 5.74 MB – CC BY 4.0
SweDiagnostics
Swedish version of (Super)GLUE Diagnostic
Swedish
Dataset:
swediagnostics.zip
2023-04-04 – 72.89 KB – CC BY 4.0
Swedish ABSAbank
An annotated Swedish corpus for aspect-based sentiment analysis
1,574,226
Swedish
Dataset:
swe-absa-bank.zip
2020-03-04 – 128.55 MB – CC BY 4.0
Dataset:
absabankimm-combined.zip
2023-02-20 – 15.87 MB – CC BY 4.0
Swedish ABSAbank-Imm 1.1
An annotated Swedish corpus for aspect-based sentiment analysis (a version of Absabank)
Swedish
Dataset:
absabank-imm.zip
2023-03-30 – 1.03 MB – CC BY 4.0
Swedish analogy 2.0
Swedish semantic and syntactic similarity
Swedish
Dataset:
sweanalogy.zip
2023-03-30 – 178.63 KB – CC BY 4.0
Swedish Bible 1873
Swedish translation of the Bible from 1873
811,321
Swedish
Dataset:
bibel1873dalin.xml.bz2
2015-05-20 – 5.84 MB – CC BY 4.0
Word statistics:
stats_BIBEL1873DALIN.txt.zip
2025-04-22 – 306.18 KB – CC BY 4.0
Explore in:
Swedish Bible 1917
Official Swedish translation of the Bible from 1917
894,720
Swedish
Dataset:
bibel1917.xml.bz2
2015-05-19 – 7.5 MB – CC BY 4.0
Word statistics:
stats_BIBEL1917.txt.zip
2025-04-22 – 359.83 KB – CC BY 4.0
Explore in:
Swedish Code of Statutes
Swedish Code of Statutes 1880-01-01 – 2023-12-15
19,748,312
Swedish
Dataset:
sfs.xml.bz2
2024-05-13 – 325.85 MB – CC BY 4.0
Word statistics:
stats_sfs.csv.zip
2025-04-22 – 3.04 MB – CC BY 4.0
Explore in:
Swedish EAT: question classification
A translated version of the QAQC dataset for expected-answer-type classification.
Swedish
Dataset:
swe_qaqc_train.csv
2023-06-08 – 361.34 KB – CC BY 4.0
Dataset:
Swedish_EAT_v1.0.tsv
2023-06-08 – 2.05 KB – CC BY 4.0
Swedish fraktur 1626-1816
A selection of fraktur texts printed between 1626 and 1816 from the collections of the University Library of University of Gothenburg (UB). For OCR analysis.
47,924
Swedish
Dataset:
svensk-fraktur-1626-1816.tar.gz
2021-11-26 – 757.73 MB – CC BY 4.0
Swedish framenet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This part of the resource contains the corpus examples, automatically enriched with linguistic information.
137,770
Swedish
Dataset:
swefn-ex.xml.bz2
2021-11-25 – 3.62 MB – CC BY 4.0
Word statistics:
stats_swefn-ex.csv.zip
2025-04-22 – 465.83 KB – CC BY 4.0
Explore in:
Swedish newspapers 1818-1870
A selection of Swedish newspapers printed between 1818 and 1870 from the collections of Kungliga biblioteket (KB). For OCR analysis.
186,013
Swedish
Dataset:
svenska-tidningar-1818-1870.tar.gz
2020-05-26 – 458.22 MB – CC BY 4.0
Swedish newspapers 1871-1906
A selection of Swedish newspapers printed between 1871 and 1906 from the collections of Kungliga biblioteket (KB). For OCR analysis.
337,635
Swedish
Dataset:
svenska-tidningar-1871-1906.tar.gz
2022-05-03 – 831.74 MB – CC BY 4.0
Swedish party programs and election manifestos
Swedish political party programs and election manifestos 1887–2024
2,234,400
Swedish
Dataset:
vivill.xml.bz2
2024-06-10 – 165.57 MB – CC BY 4.0
Word statistics:
stats_vivill.csv.zip
2025-04-22 – 1.16 MB – CC BY 4.0
Explore in:
Swedish Prose Fiction 1800–1900
All Swedish fiction published for the first time during the years 1800, 1820, 1840, 1860, 1880 and 1900
16,275,130
Swedish
Dataset:
spf.xml.bz2
2017-05-19 – 231.69 MB – CC BY 4.0
Word statistics:
stats_SPF.txt.zip
2025-04-22 – 3.68 MB – CC BY 4.0
Explore in:
Swedish treebank
A Swedish treebank built from recycled language resources
Swedish
Swedish Twitter 2015
Material collected from a selection of Swedish speaking twitter users from 2015
412,663,140
Swedish
Word statistics:
stats_TWITTER-2015.txt.zip
2025-04-22 – 145.29 MB – CC BY 4.0
Explore in:
Swedish Twitter 2016
Material collected from a selection of Swedish speaking twitter users from 2016
694,515,420
Swedish
Word statistics:
stats_TWITTER-2016.txt.zip
2025-04-22 – 185.83 MB – CC BY 4.0
Explore in:
Swedish Twitter 2017
Material collected from a selection of Swedish speaking twitter users from 2017
505,017,012
Swedish
Word statistics:
stats_TWITTER-2017.txt.zip
2025-04-22 – 149.19 MB – CC BY 4.0
Explore in:
Swedish Wikipedia
Corpus of Swedish Wikipedia
190,149,497
Swedish
Dataset:
wikipedia-sv.xml.bz2
2023-05-12 – 3.59 GB – CC BY 4.0
Word statistics:
stats_wikipedia-sv.csv.zip
2025-04-22 – 196.44 MB – CC BY 4.0
Explore in:
SweDN 1.0
A Swedish text summarization corpus
Swedish
SweFAQ 2.0
Frequently asked questions from Swedish authorities' websites with shuffled answers
Swedish
Dataset:
swefaq.zip
2023-03-30 – 89.81 MB – CC BY 4.0
SweFraCas 1.0
Textual inference/entailment problem set
Swedish
Dataset:
swefracas.tsv
2021-06-10 – 100.92 KB – CC BY 4.0
Dataset:
swefracas_documentation_sheet.tsv
2021-06-15 – 4.23 KB – CC BY 4.0
Collection
SweLL
SweLL -- Swedish Learner Language -- is a collection of SweLL corpora and derivative resources coming from these corpora. SweLL corpora consisf of learner texts written by learners with other mother tongues than Swedish. All texts have been collected in test situations (none of them coming from home-written tasks).
Swedish, Multiple languages
See 10 collected resources
SweLL-gold
Essays written by adult learners of Swedish, manually pseudonymized and correction annotated. The corpus contains both the original learner text and a corrected version of each essay. Collection period 2017-2020.
Swedish
Word statistics:
stats_SWELLV1-ORIGINAL.txt.zip
2025-04-22 – 147.52 KB – CC BY 4.0
Word statistics:
stats_SWELLV1-TARGET.txt.zip
2025-04-22 – 132.13 KB – CC BY 4.0
Explore in:
Collection
SweLL-pilot
Essays written by adult learners of Swedish, manually labeled with the CEFR levels (a European scale of language proficiency levels within language learning). Collection period 2006-2015.
Swedish
See 3 collected resources
Explore in:
SweNLI 1.0
A Swedish NLI dataset
Swedish
Dataset:
swenli.zip
2023-03-30 – 55.13 MB – CC BY 4.0
SweParaphrase 2.0
Semantic Textual Similarity reference data (STS Benchmark).
Swedish
Dataset:
sweparaphrase.zip
2023-03-30 – 750.9 KB – CC BY 4.0
SweWiC 2.0
A Swedish Word-in-Context dataset
Swedish
Dataset:
swewic.zip
2023-03-30 – 587.65 KB – CC BY 4.0
SweWinogender 2.0
A Swedish dataset for coreference and gender bias
Swedish
Dataset:
swewinogender.zip
2023-03-30 – 28.3 KB – CC BY 4.0
SweWinograd 2.0
A Swedish dataset for pronoun resolution
Swedish
Dataset:
swewinograd.zip
2023-03-30 – 33.41 KB – CC BY 4.0
Syntag treebank
A Swedish treebank with syntactic analysis of 158 articles from Press-65.
101,329
Swedish
Dataset:
syntag.txt
2010-02-08 – 4.45 MB – CC BY 4.0
Dataset:
syntag.html
2010-05-24 – 10.15 MB – CC BY 4.0
Sæmundaredda
Ancient Icelandic poetry collection also known as The King's Book
46,726
Old Norse
Dataset:
eddan.xml.bz2
2015-01-21 – 87.55 KB – CC BY 4.0
Word statistics:
stats_EDDAN.txt.zip
2025-04-22 – 40.11 KB – CC BY 4.0
Explore in:
TalbankenSBX
Talbanken is a Swedish treebank. This is the Språkbanken Text version of Talbanken.
96,346
Swedish
Dataset:
talbanken.xml.bz2
2017-06-07 – 1.54 MB – CC BY 4.0
Word statistics:
stats_TALBANKEN.txt.zip
2025-04-22 – 206.82 KB – CC BY 4.0
Dataset:
changelog.txt
2020-06-11 – 316 bytes – CC BY 4.0
Dataset:
TalbankenSBX_morphsplit20200610.zip
2020-06-11 – 3.64 MB – CC BY 4.0
Dataset:
TalbankenSBX_syntsplit20200610.zip
2020-06-11 – 807.09 KB – CC BY 4.0
Explore in:
TalbankenSTB
Talbanken is a Swedish treebank.
96,346
Swedish
Dataset:
TalbankenSTB.zip
2020-08-11 – 2.6 MB – CC BY 4.0
Dataset:
TalbankenSTB_README.txt
2020-08-11 – 1.05 KB – CC BY 4.0
Dataset:
TalbankenSTB_documentation.zip
2020-08-11 – 62.23 KB – CC BY 4.0
Dataset:
TalbankenSTB_datasplit.zip
2020-08-11 – 2.6 MB – CC BY 4.0
Dataset:
TalbankenSTB_original_parts.zip
2020-08-11 – 2.95 MB – CC BY 4.0
The Arabic E-Book Corpus
A collection of 1,745 books in Arabic.
76,486,597
Arabic
Dataset:
arabic-ebooks.xml.bz2
2025-09-12 – 142.88 MB – CC BY 4.0
Explore in:
The English-Swedish Parallel Corpus (ESPC)
ESPC is a combined comparable and parallel corpus suitable for cross-language research for diffferent types.
1,518,759
Swedish, English
Explore in:
The noise of the miners in Falun 1743 – Court records
Transcribed and annotated protocols from trials against 79 miners in Falun in 1743. The transcribed texts are partly annotated with place and person names, titles and definitions of archaic words mainly belonging to the mining trade.
156,087
Swedish
Dataset:
gruvdrangarna.xml.bz2
2025-10-17 – 1.96 MB – CC BY 4.0
Word statistics:
stats_gruvdrangarna.csv.zip
2025-10-17 – 175.22 KB – CC BY 4.0
Explore in:
The Riksdag's open data - Debates
Debates from the Swedish parliament in the period 1993/94-2017/18
121,987,537
Swedish
Dataset:
rd-anf-1993-2018.xml.bz2
2020-03-30 – 2.22 GB – CC BY 4.0
Word statistics:
stats_RD-ANF-1993-2018.txt.zip
2025-04-22 – 8.63 MB – CC BY 4.0
The Swedish Culturomics Gigaword Corpus
One billion Swedish words from 1950 and onwards. Code to extract data from the corpus, as well as usage instructions, can be downloaded from https://svn.spraakbanken.gu.se/sb-arkiv/tools/gigaword/
1,015,635,151
Swedish
Dataset:
gigaword-1950-59.tar
2016-06-07 – 92.69 MB – CC BY 4.0
Dataset:
gigaword-1960-69.tar
2016-06-07 – 107.78 MB – CC BY 4.0
Dataset:
gigaword-1970-79.tar
2016-06-07 – 175.03 MB – CC BY 4.0
Dataset:
gigaword-1980-89.tar
2016-06-07 – 217.9 MB – CC BY 4.0
Dataset:
gigaword-1990-99.tar
2016-06-07 – 1.05 GB – CC BY 4.0
Dataset:
gigaword-2000-09.tar
2016-06-07 – 5.48 GB – CC BY 4.0
Dataset:
gigaword-2010-15.tar
2016-06-07 – 4.32 GB – CC BY 4.0
The Swedish Literature Bank: Free Works
E-texts and searchable facsimiles fron the Swedish Literature Bank (litteraturbanken.se)
344,688,445
Swedish
Dataset:
lb-open.xml.bz2
2023-11-13 – 5.75 GB – CC BY 4.0
Word statistics:
stats_lb-open.csv.zip
2025-04-22 – 43.69 MB – CC BY 4.0
Explore in:
The Swedish Literature Bank: Restricted Works
E-texts and searchable facsimiles fron the Swedish Literature Bank (litteraturbanken.se)
128,261,903
Swedish
Dataset:
lb-restricted.xml.bz2
2023-10-28 – 2.25 GB – CC BY 4.0
Word statistics:
stats_lb-restricted.csv.zip
2025-04-22 – 26.24 MB – CC BY 4.0
Explore in:
Tiden
30 annual volumes of the socialist journal Tiden, 1909–1940
7,106,662
Swedish
Dataset:
runeberg-tiden.xml.bz2
2014-12-08 – 89.33 MB – CC BY 4.0
Word statistics:
stats_RUNEBERG-TIDEN.txt.zip
2025-04-22 – 4.11 MB – CC BY 4.0
Explore in:
TISUS texts
Essays written by L2 Swedish learners as part of a TISUS exam
59,639
Swedish
Explore in:
TISUS v1
60,632
Swedish
Word statistics:
stats_TISUSV1.txt.zip
2025-04-22 – 77.62 KB – CC BY 4.0
Explore in:
TISUS-texter v2
60,036
Swedish
Word statistics:
stats_TISUSV2.txt.zip
2025-04-22 – 77.18 KB – CC BY 4.0
Explore in:
Twitter Mix
Material from a selection of Swedish Twitter users. Is regularly updated.
499,986,353
Swedish
Word statistics:
stats_twitter.csv.zip
2025-04-22 – 218.25 MB – CC BY 4.0
Explore in:
Twitter: Party Leader Debate June 2013
Material from Twitter, collected during the party leader debate on June 12th 2013 and a few days before and after
38,959,102
Swedish
Word statistics:
stats_TWITTER-PLDEBATT-130612.txt.zip
2025-04-22 – 17.53 MB – CC BY 4.0
Explore in:
Twitter: Party Leader Debate May 2014
Material from Twitter, collected during the party leader debate on May 4th 2013 and a few days before and after
34,228,521
Swedish
Word statistics:
stats_TWITTER-PLDEBATT-140504.txt.zip
2025-04-22 – 13.81 MB – CC BY 4.0
Explore in:
Twitter: Party Leader Debate October 2013
Material from Twitter, collected during the party leader debate on June 6th 2013 and a few days before and after
25,736,586
Swedish
Word statistics:
stats_TWITTER-PLDEBATT-131006.txt.zip
2025-04-22 – 11.73 MB – CC BY 4.0
Explore in:
Ur Dagens Krönika
Eight annual volumes of the cultural journal Ur Dagens Krönika, 1881–1890
1,995,149
Swedish
Dataset:
runeberg-urdagkron.xml.bz2
2014-12-08 – 24.43 MB – CC BY 4.0
Word statistics:
stats_RUNEBERG-URDAGKRON.txt.zip
2025-04-22 – 1.95 MB – CC BY 4.0
Explore in:
Collection
Web News
News from Swedish newspapers' websites
Swedish
See 13 collected resources
Explore in:
Web News 2001
News from Swedish newspapers' websites
614,151
Swedish
Dataset:
webbnyheter2001.xml.bz2
2024-01-04 – 17.13 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2001.csv.zip
2025-04-22 – 929.87 KB – CC BY 4.0
Explore in:
Web News 2002
News from Swedish newspapers' websites
17,426,173
Swedish
Dataset:
webbnyheter2002.xml.bz2
2022-11-30 – 506.49 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2002.csv.zip
2025-04-22 – 8.19 MB – CC BY 4.0
Explore in:
Web News 2003
News from Swedish newspapers' websites
12,217,288
Swedish
Dataset:
webbnyheter2003.xml.bz2
2022-11-30 – 357.9 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2003.csv.zip
2025-04-22 – 6.44 MB – CC BY 4.0
Explore in:
Web News 2004
News from Swedish newspapers' websites
13,806,323
Swedish
Dataset:
webbnyheter2004.xml.bz2
2022-11-30 – 403.31 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2004.csv.zip
2025-04-22 – 6.68 MB – CC BY 4.0
Explore in:
Web News 2005
News from Swedish newspapers' websites
29,503,647
Swedish
Dataset:
webbnyheter2005.xml.bz2
2024-01-05 – 849.81 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2005.csv.zip
2025-04-22 – 7.6 MB – CC BY 4.0
Explore in:
Web News 2006
News from Swedish newspapers' websites
22,563,792
Swedish
Dataset:
webbnyheter2006.xml.bz2
2022-12-01 – 654.61 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2006.csv.zip
2025-04-22 – 9.77 MB – CC BY 4.0
Explore in:
Web News 2007
News from Swedish newspapers' websites
24,630,443
Swedish
Dataset:
webbnyheter2007.xml.bz2
2022-12-01 – 715.52 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2007.csv.zip
2025-04-22 – 10.25 MB – CC BY 4.0
Explore in:
Web News 2008
News from Swedish newspapers' websites
27,561,804
Swedish
Dataset:
webbnyheter2008.xml.bz2
2022-12-01 – 796.9 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2008.csv.zip
2025-04-22 – 11.17 MB – CC BY 4.0
Explore in:
Web News 2009
News from Swedish newspapers' websites
25,888,779
Swedish
Dataset:
webbnyheter2009.xml.bz2
2024-01-05 – 747.74 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2009.csv.zip
2025-04-22 – 7.33 MB – CC BY 4.0
Explore in:
Web News 2010
News from Swedish newspapers' websites
23,803,577
Swedish
Dataset:
webbnyheter2010.xml.bz2
2022-12-02 – 691.09 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2010.csv.zip
2025-04-22 – 9.17 MB – CC BY 4.0
Explore in:
Web News 2011
News from Swedish newspapers' websites
26,268,603
Swedish
Dataset:
webbnyheter2011.xml.bz2
2022-12-02 – 764.57 MB – CC BY 4.0
Word statistics:
stats_webbnyheter2011.csv.zip
2025-04-22 – 10.12 MB – CC BY 4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Next page
Next ›
Last page
Last »
News and events
News archive
Blog
Calendar
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Sustainable language representations
Position statements
Workshop on Profiling second language vocabulary and grammar - 2023
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Research meetings
Data
Analyses
Platforms
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Karp
Web API
Sparv
Web Sparv - User Manual
Web service (API)
Web Sparv - Technical Documentation
Mink
User manual
Tutorial
Video: Overview (sv)
Web API
Privacy and data policy
Strix
Lärka
Other tools
Catta
IT-baserad grammatikinlärning
FAQ
About us
Staff
Organisation
Språkbanken Text around the world
Språkbanken 50 years
Celebration
A brief history
Studera språkteknologi
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Workshops
End of the year workshop & APT 2025
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Forskningsmöten
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Working group meetings
Cookies
Internal
Contact us
Help desk