Skip to main content
Svenska
English
News and events
Research
Data
Analyses
Platforms
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1395)
Collections (32)
Corpora (1234)
Lexicons (85)
Training and evaluation data (27)
Models (49)
Title
Free search
Language
- Any -
Swedish
Albanian
Arabic
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Antal tokens
Språk
Åtkomst
Kubord 2: Word relations Expressen 2016
Part of the collection Kubord 2
40,171,544
Swedish
Word statistics:
stats_kubord2-exp-2016.csv.zip
2025-04-22 – 449.94 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2017
Part of the collection Kubord 2
38,788,832
Swedish
Word statistics:
stats_kubord2-exp-2017.csv.zip
2025-04-22 – 434.32 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2018
Part of the collection Kubord 2
39,171,386
Swedish
Word statistics:
stats_kubord2-exp-2018.csv.zip
2025-04-22 – 443.33 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2019
Part of the collection Kubord 2
38,699,090
Swedish
Word statistics:
stats_kubord2-exp-2019.csv.zip
2025-04-22 – 441.2 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2020
Part of the collection Kubord 2
34,996,401
Swedish
Word statistics:
stats_kubord2-exp-2020.csv.zip
2025-04-22 – 399.6 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2021
Part of the collection Kubord 2
34,415,510
Swedish
Word statistics:
stats_kubord2-exp-2021.csv.zip
2025-04-22 – 391.13 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2022
Part of the collection Kubord 2
32,207,321
Swedish
Word statistics:
stats_kubord2-exp-2022.csv.zip
2025-04-22 – 230.65 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2023
Part of the collection Kubord 2
30,703,605
Swedish
Word statistics:
stats_kubord2-exp-2023.csv.zip
2025-04-30 – 219.48 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Expressen 2024
Part of the collection Kubord 2
28,680,208
Swedish
Word statistics:
stats_kubord2-exp-2024.csv.zip
2025-05-15 – 215.98 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2013
Part of the collection Kubord 2
35,172,089
Swedish
Word statistics:
stats_kubord2-gp-2013.csv.zip
2025-04-22 – 405.67 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2014
Part of the collection Kubord 2
41,140,687
Swedish
Word statistics:
stats_kubord2-gp-2014.csv.zip
2025-04-22 – 469.77 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2015
Part of the collection Kubord 2
37,781,416
Swedish
Word statistics:
stats_kubord2-gp-2015.csv.zip
2025-04-22 – 439.79 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2016
Part of the collection Kubord 2
30,840,113
Swedish
Word statistics:
stats_kubord2-gp-2016.csv.zip
2025-04-22 – 367.27 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2017
Part of the collection Kubord 2
29,777,289
Swedish
Word statistics:
stats_kubord2-gp-2017.csv.zip
2025-04-22 – 357.82 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2018
Part of the collection Kubord 2
31,021,011
Swedish
Word statistics:
stats_kubord2-gp-2018.csv.zip
2025-04-22 – 370.45 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2019
Part of the collection Kubord 2
30,900,679
Swedish
Word statistics:
stats_kubord2-gp-2019.csv.zip
2025-04-22 – 367.91 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2020
Part of the collection Kubord 2
27,093,412
Swedish
Word statistics:
stats_kubord2-gp-2020.csv.zip
2025-04-22 – 318.35 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2021
Part of the collection Kubord 2
26,926,176
Swedish
Word statistics:
stats_kubord2-gp-2021.csv.zip
2025-04-22 – 317.55 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2022
Part of the collection Kubord 2
27,506,663
Swedish
Word statistics:
stats_kubord2-gp-2022.csv.zip
2025-04-22 – 204.54 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2023
Part of the collection Kubord 2
27,208,215
Swedish
Word statistics:
stats_kubord2-gp-2023.csv.zip
2025-04-30 – 200.99 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Göteborgsposten 2024
Part of the collection Kubord 2
25,111,037
Swedish
Word statistics:
stats_kubord2-gp-2024.csv.zip
2025-05-15 – 196.28 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2010
Part of the collection Kubord 2
35,569,612
Swedish
Word statistics:
stats_kubord2-svd-2010.csv.zip
2025-04-22 – 426.55 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2011
Part of the collection Kubord 2
37,267,222
Swedish
Word statistics:
stats_kubord2-svd-2011.csv.zip
2025-04-22 – 439.28 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2012
Part of the collection Kubord 2
34,337,038
Swedish
Word statistics:
stats_kubord2-svd-2012.csv.zip
2025-04-22 – 404.65 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2013
Part of the collection Kubord 2
31,293,680
Swedish
Word statistics:
stats_kubord2-svd-2013.csv.zip
2025-04-22 – 367.59 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2014
Part of the collection Kubord 2
32,195,799
Swedish
Word statistics:
stats_kubord2-svd-2014.csv.zip
2025-04-22 – 377.88 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2015
Part of the collection Kubord 2
31,155,970
Swedish
Word statistics:
stats_kubord2-svd-2015.csv.zip
2025-04-22 – 373.54 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2016
Part of the collection Kubord 2
30,769,788
Swedish
Word statistics:
stats_kubord2-svd-2016.csv.zip
2025-04-22 – 366.57 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2017
Part of the collection Kubord 2
30,140,228
Swedish
Word statistics:
stats_kubord2-svd-2017.csv.zip
2025-04-22 – 358.3 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2018
Part of the collection Kubord 2
30,465,422
Swedish
Word statistics:
stats_kubord2-svd-2018.csv.zip
2025-04-22 – 364.22 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2019
Part of the collection Kubord 2
27,607,664
Swedish
Word statistics:
stats_kubord2-svd-2019.csv.zip
2025-04-22 – 330.14 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2020
Part of the collection Kubord 2
26,943,992
Swedish
Word statistics:
stats_kubord2-svd-2020.csv.zip
2025-04-22 – 323.01 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2021
Part of the collection Kubord 2
27,265,483
Swedish
Word statistics:
stats_kubord2-svd-2021.csv.zip
2025-04-22 – 326.98 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2022
Part of the collection Kubord 2
27,524,626
Swedish
Word statistics:
stats_kubord2-svd-2022.csv.zip
2025-04-22 – 208.67 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2023
Part of the collection Kubord 2
26,862,480
Swedish
Word statistics:
stats_kubord2-svd-2023.csv.zip
2025-04-30 – 202.05 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Svenska Dagbladet 2024
Part of the collection Kubord 2
25,027,873
Swedish
Word statistics:
stats_kubord2-svd-2024.csv.zip
2025-05-15 – 199.27 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2013
Part of the collection Kubord 2
33,470,566
Swedish
Word statistics:
stats_kubord2-ss-2013.csv.zip
2025-04-22 – 400.01 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2014
Part of the collection Kubord 2
42,644,505
Swedish
Word statistics:
stats_kubord2-ss-2014.csv.zip
2025-04-22 – 502.42 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2015
Part of the collection Kubord 2
43,896,523
Swedish
Word statistics:
stats_kubord2-ss-2015.csv.zip
2025-04-22 – 512.83 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2016
Part of the collection Kubord 2
45,168,823
Swedish
Word statistics:
stats_kubord2-ss-2016.csv.zip
2025-04-22 – 525.66 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2017
Part of the collection Kubord 2
43,865,362
Swedish
Word statistics:
stats_kubord2-ss-2017.csv.zip
2025-04-22 – 509.49 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2018
Part of the collection Kubord 2
43,758,009
Swedish
Word statistics:
stats_kubord2-ss-2018.csv.zip
2025-04-22 – 509.45 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2019
Part of the collection Kubord 2
42,338,675
Swedish
Word statistics:
stats_kubord2-ss-2019.csv.zip
2025-04-22 – 490.49 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2020
Part of the collection Kubord 2
38,070,847
Swedish
Word statistics:
stats_kubord2-ss-2020.csv.zip
2025-04-22 – 439.53 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2021
Part of the collection Kubord 2
40,391,762
Swedish
Word statistics:
stats_kubord2-ss-2021.csv.zip
2025-04-22 – 458.65 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2022
Part of the collection Kubord 2
38,939,336
Swedish
Word statistics:
stats_kubord2-ss-2022.csv.zip
2025-04-22 – 250.69 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2023
Part of the collection Kubord 2
34,317,593
Swedish
Word statistics:
stats_kubord2-ss-2023.csv.zip
2025-04-30 – 224.55 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Sydsvenskan 2024
Part of the collection Kubord 2
32,757,325
Swedish
Word statistics:
stats_kubord2-ss-2024.csv.zip
2025-05-15 – 225.42 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2013
Part of the collection Kubord 2
21,696,541
Swedish
Word statistics:
stats_kubord2-ogc-2013.csv.zip
2025-04-22 – 266.29 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2014
Part of the collection Kubord 2
21,993,409
Swedish
Word statistics:
stats_kubord2-ogc-2014.csv.zip
2025-04-22 – 268.75 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2015
Part of the collection Kubord 2
20,145,730
Swedish
Word statistics:
stats_kubord2-ogc-2015.csv.zip
2025-04-22 – 249.21 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2016
Part of the collection Kubord 2
20,957,064
Swedish
Word statistics:
stats_kubord2-ogc-2016.csv.zip
2025-04-22 – 257.95 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2017
Part of the collection Kubord 2
21,384,420
Swedish
Word statistics:
stats_kubord2-ogc-2017.csv.zip
2025-04-22 – 261.92 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2018
Part of the collection Kubord 2
21,207,648
Swedish
Word statistics:
stats_kubord2-ogc-2018.csv.zip
2025-04-22 – 260.19 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2019
Part of the collection Kubord 2
19,785,748
Swedish
Word statistics:
stats_kubord2-ogc-2019.csv.zip
2025-04-22 – 242.76 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2020
Part of the collection Kubord 2
17,826,910
Swedish
Word statistics:
stats_kubord2-ogc-2020.csv.zip
2025-04-22 – 218.59 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2021
Part of the collection Kubord 2
17,046,977
Swedish
Word statistics:
stats_kubord2-ogc-2021.csv.zip
2025-04-22 – 212.3 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2022
Part of the collection Kubord 2
16,602,847
Swedish
Word statistics:
stats_kubord2-ogc-2022.csv.zip
2025-04-22 – 134.14 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2023
Part of the collection Kubord 2
14,005,481
Swedish
Word statistics:
stats_kubord2-ogc-2023.csv.zip
2025-04-30 – 113.56 MB – CC-BY-4.0
Explore in:
Kubord 2: Word relations Östgöta Correspondenten 2024
Part of the collection Kubord 2
11,886,483
Swedish
Word statistics:
stats_kubord2-ogc-2024.csv.zip
2025-05-15 – 103.55 MB – CC-BY-4.0
Explore in:
KVAH
Kungl. Vetenskapsakademiens Handlingar
97,717
Swedish
Dataset:
kvah.xml.bz2
2024-01-05 – 1.29 MB – CC-BY-4.0
Word statistics:
stats_kvah.csv.zip
2025-04-22 – 154.5 KB – CC-BY-4.0
Explore in:
Collection
Kvinnotidningar
Material from historical women's periodicals
Swedish
See 7 collected resources
Explore in:
Kvinnotidningar: Dagny
A corpus with texts from the women's newspaper Dagny
8,124,256
Swedish
Dataset:
ub-kvt-dagny.xml.bz2
2015-04-01 – 81.93 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-DAGNY.txt.zip
2025-04-22 – 7.17 MB – CC-BY-4.0
Explore in:
Kvinnotidningar: Hertha
A corpus with texts from the women's newspaper Hertha
3,842,984
Swedish
Dataset:
ub-kvt-hertha.xml.bz2
2015-04-01 – 40.16 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-HERTHA.txt.zip
2025-04-22 – 3.61 MB – CC-BY-4.0
Explore in:
Kvinnotidningar: Idun
A corpus with texts from the women's newspaper Idun
44,944,172
Swedish
Dataset:
ub-kvt-idun.xml.bz2
2015-04-01 – 417.65 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-IDUN.txt.zip
2025-04-22 – 10.34 MB – CC-BY-4.0
Explore in:
Kvinnotidningar: Kvinnornas Tidning
A corpus with texts from the women's newspaper Kvinnornas Tidning
5,468,918
Swedish
Dataset:
ub-kvt-kvt.xml.bz2
2015-04-01 – 65.01 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-KVT.txt.zip
2025-04-22 – 2.08 MB – CC-BY-4.0
Explore in:
Kvinnotidningar: Morgonbris
A corpus with texts from the women's newspaper Morgonbris
3,551,943
Swedish
Dataset:
ub-kvt-morgonbris.xml.bz2
2015-04-01 – 36.72 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-MORGONBRIS.txt.zip
2025-04-22 – 3.67 MB – CC-BY-4.0
Explore in:
Kvinnotidningar: Rösträtt för Kvinnor
A corpus with texts from the women's newspaper Rösträtt för Kvinnor
2,202,776
Swedish
Dataset:
ub-kvt-rostratt.xml.bz2
2015-04-01 – 24.16 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-ROSTRATT.txt.zip
2025-04-22 – 1.9 MB – CC-BY-4.0
Explore in:
Kvinnotidningar: Tidevarvet
A corpus with texts from the women's newspaper Tidevarvet
6,813,909
Swedish
Dataset:
ub-kvt-tidevarvet.xml.bz2
2015-04-01 – 81.49 MB – CC-BY-4.0
Word statistics:
stats_UB-KVT-TIDEVARVET.txt.zip
2025-04-22 – 3.65 MB – CC-BY-4.0
Explore in:
Lawline
Questions and answers about law counseling.
12,002,288
Swedish
Word statistics:
stats_LAWLINE.txt.zip
2025-04-22 – 2.04 MB – CC-BY-4.0
Explore in:
Laws from the 1800's
Regeringsformen 1809 med ändringar 1809-1974
446,438
Swedish
Dataset:
lag1800.xml.bz2
2015-05-20 – 3.97 MB – CC-BY-4.0
Word statistics:
stats_LAG1800.txt.zip
2025-04-22 – 467.96 KB – CC-BY-4.0
Explore in:
Laws of 1734
Materialet utgörs av balkarna i själva lagtexten, förordet samt domarreglerna. Materialet är inskrivet för hand och korrekturläst, men en del fel finns fortfarande kvar.
98,120
Swedish
Dataset:
lag1734.xml.bz2
2014-09-29 – 565.61 KB – CC-BY-4.0
Word statistics:
stats_LAG1734.txt.zip
2025-04-22 – 98.62 KB – CC-BY-4.0
Explore in:
Collection
Learner Language
Learner Language is a collection of corpor and lexicons that describe learner language. Corpora include both texts/audio produced by language learners, as well as texts/language they are exposed to (reading or listening to, e.g. course book texts). Even some derivative resources based on these corpora are included in this collection.
Swedish, Multiple languages
See 22 collected resources
Linguistic Survey of India (LSI)
1,193,437
English
Dataset:
lsi.xml.bz2
2020-08-25 – 6.23 MB – CC-BY-4.0
Explore in:
Collection
Läkartidningen medical journal
Corpus for health care technical language
Swedish
See 11 collected resources
Explore in:
Läkartidningen medical journal 1996
Läkartidningens publicerade artiklar under 1996.
2,016,356
Swedish
Dataset:
lt1996.xml.bz2
2017-04-05 – 38.78 MB – CC-BY-4.0
Word statistics:
stats_LT1996.txt.zip
2025-04-22 – 1.5 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 1997
Läkartidningens publicerade artiklar under 1997.
1,977,051
Swedish
Dataset:
lt1997.xml.bz2
2017-03-30 – 37.88 MB – CC-BY-4.0
Word statistics:
stats_LT1997.txt.zip
2025-04-22 – 1.48 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 1998
Läkartidningens publicerade artiklar under 1998.
2,195,964
Swedish
Dataset:
lt1998.xml.bz2
2017-03-30 – 41.76 MB – CC-BY-4.0
Word statistics:
stats_LT1998.txt.zip
2025-04-22 – 1.61 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 1999
Läkartidningens publicerade artiklar under 1999.
2,075,532
Swedish
Dataset:
lt1999.xml.bz2
2017-03-30 – 39.43 MB – CC-BY-4.0
Word statistics:
stats_LT1999.txt.zip
2025-04-22 – 1.53 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2000
Läkartidningens publicerade artiklar under 2000.
2,000,393
Swedish
Dataset:
lt2000.xml.bz2
2017-03-30 – 37.98 MB – CC-BY-4.0
Word statistics:
stats_LT2000.txt.zip
2025-04-22 – 1.47 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2001
Läkartidningens publicerade artiklar under 2001.
2,094,491
Swedish
Dataset:
lt2001.xml.bz2
2017-03-30 – 39.76 MB – CC-BY-4.0
Word statistics:
stats_LT2001.txt.zip
2025-04-22 – 1.5 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2002
Läkartidningens publicerade artiklar under 2002.
2,009,521
Swedish
Dataset:
lt2002.xml.bz2
2017-03-30 – 38.19 MB – CC-BY-4.0
Word statistics:
stats_LT2002.txt.zip
2025-04-22 – 1.41 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2003
Läkartidningens publicerade artiklar under 2003.
1,748,780
Swedish
Dataset:
lt2003.xml.bz2
2017-03-30 – 33.5 MB – CC-BY-4.0
Word statistics:
stats_LT2003.txt.zip
2025-04-22 – 1.29 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2004
Läkartidningens publicerade artiklar under 2004.
1,831,732
Swedish
Dataset:
lt2004.xml.bz2
2017-03-30 – 34.7 MB – CC-BY-4.0
Word statistics:
stats_LT2004.txt.zip
2025-04-22 – 1.38 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2005
Läkartidningens publicerade artiklar under 2005.
1,505,574
Swedish
Dataset:
lt2005.xml.bz2
2017-03-30 – 28.5 MB – CC-BY-4.0
Word statistics:
stats_LT2005.txt.zip
2025-04-22 – 1.2 MB – CC-BY-4.0
Explore in:
Läkartidningen medical journal 2006
Läkartidningens publicerade artiklar under 2006.
1,586,627
Swedish
Dataset:
lt2006.xml.bz2
2017-04-03 – 29.75 MB – CC-BY-4.0
Word statistics:
stats_LT2006.txt.zip
2025-04-22 – 1.23 MB – CC-BY-4.0
Explore in:
LäSBarT
Easy to read Swedish and texts from children's books
1,129,083
Swedish
Dataset:
lasbart.xml.bz2
2017-03-30 – 16.74 MB – CC-BY-4.0
Word statistics:
stats_LASBART.txt.zip
2025-04-22 – 597.31 KB – CC-BY-4.0
Explore in:
MARB
A dataset for studying Marked Attribute Reporting Bias
English
Dataset:
marb_data.tar.bz2
2025-09-08 – 12.42 MB – MIT
Dataset:
marb_code.tar.bz2
2025-09-08 – 15.99 KB – MIT
Explore in:
MAÞiR Trees
An Old Swedish treebank, with lemma, parts-of-speech, and PROIEL-style dependency syntax.
33,721
Swedish
Dataset:
mathir_trees_v0.1.tgz
2024-04-17 – 5.49 MB – CC-BY-NC-4.0
Collection
Medieval letters
Swedish medieval charters from Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Latin, German, Norwegian, Swedish
See 5 collected resources
Explore in:
Medieval letters: German
Charters written in German, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
177,806
German
Dataset:
sdhk-tyska.xml.bz2
2015-05-20 – 335.84 KB – CC-BY-4.0
Word statistics:
stats_SDHK-TYSKA.txt.zip
2025-04-22 – 97.14 KB – CC-BY-4.0
Explore in:
Medieval letters: Latin
Charters written in Latin, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
2,249,923
Latin
Dataset:
sdhk-latin.xml.bz2
2015-05-20 – 4.71 MB – CC-BY-4.0
Word statistics:
stats_SDHK-LATIN.txt.zip
2025-04-22 – 699.58 KB – CC-BY-4.0
Explore in:
Medieval letters: Norwegian
Charters written in Norwegian, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
27,718
Norwegian
Dataset:
sdhk-norska.xml.bz2
2015-05-20 – 58.95 KB – CC-BY-4.0
Word statistics:
stats_SDHK-NORSKA.txt.zip
2025-04-22 – 36.4 KB – CC-BY-4.0
Explore in:
Medieval letters: Other languages
Charters written in other languages, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
39,430
Swedish
Dataset:
sdhk-ovrigt.xml.bz2
2015-05-20 – 91.05 KB – CC-BY-4.0
Word statistics:
stats_SDHK-OVRIGT.txt.zip
2025-04-22 – 60.19 KB – CC-BY-4.0
Explore in:
Medieval letters: Swedish
Charters written in Swedish, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
967,228
Swedish
Dataset:
sdhk-svenska.xml.bz2
2014-12-09 – 1.77 MB – CC-BY-4.0
Word statistics:
stats_SDHK-SVENSKA.txt.zip
2025-04-22 – 495.31 KB – CC-BY-4.0
Explore in:
Memory books of the city of Stockholm
Minutes and memorial notes from Stockholm City Hall, 1626.
121,366
Swedish
Dataset:
tankebok.xml.bz2
2014-12-08 – 661.68 KB – CC-BY-4.0
Word statistics:
stats_TANKEBOK.txt.zip
2025-04-22 – 102.77 KB – CC-BY-4.0
Explore in:
MEPAC bloggar
2,738,428
Swedish
Word statistics:
stats_MEPAC.txt.zip
2025-04-22 – 1.25 MB – CC-BY-4.0
Explore in:
MEPAC intervjuer
331,998
Swedish
Word statistics:
stats_MEPAC-I.txt.zip
2025-04-22 – 169.61 KB – CC-BY-4.0
Explore in:
MuClaGED
MuClaGED is a dataset for multi-class Grammatical Error Detection for Swedish. The dataset is based on the SweLL-gold corpus.
155,415
Swedish
Explore in:
MultiGEC
MultiGEC is a dataset for Grammatical Error Correction containing parallel data for 12 languages and 17 subcorpora. Each subcorpus contains two or more parallel versions of the same texts (typically, full learner essays), where one version (orig) is the one that the author originally wrote, and the others (ref1, ref2, ...) are corrected versions of the same text. Languages included: Czech, English, Estonian, German, Greek, Icelandic, Italian, Latvian, Russian, Slovene, Swedish and Ukrainian (English and Russian are available on request). Texts come from different original corpora, but are reformatted to a unified format.
Czech, German, Modern Greek (1453-), English, Estonian, Icelandic, Italian, Latvian, Russian, Slovenian, Swedish, Ukrainian
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Next page
Next ›
Last page
Last »
News and events
News archive
Blog
Calendar
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Höstworkshop 2026
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Sustainable language representations
Position statements
Workshop on Profiling second language vocabulary and grammar - 2023
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Research meetings
Data
Analyses
Platforms
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Karp
Web API
Sparv
Web Sparv - User Manual
Web service (API)
Web Sparv - Technical Documentation
Mink
User manual
Tutorial
Video: Overview (in Swedish)
Web API
Privacy and data policy
Strix
Lärka
Other tools
Catta
IT-baserad grammatikinlärning
FAQ
About us
Staff
Organisation
Språkbanken Text around the world
Språkbanken 50 years
Celebration
A brief history
Studera språkteknologi
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Workshops
End of the year workshop & APT 2025
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Forskningsmöten
Gruppmöten
SBX Retreat
SBX Retreat 2026
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Cookies
Internal
Contact us
Help desk