Skip to main content

Error message

Warning: Undefined array key "license_other" in Drupal\sb_resources\Plugin\Field\FieldFormatter\SbResourcesDownloadsFormatter->viewElements() (line 42 of modules/custom/sb_resources/src/Plugin/Field/FieldFormatter/SbResourcesDownloadsFormatter.php).

Language resources

On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
Resurs Antal tokens Språk Åtkomst
Kubord 2: Word relations Expressen 2017
Part of the collection Kubord 2
38,788,832 Swedish
Kubord 2: Word relations Expressen 2018
Part of the collection Kubord 2
39,171,386 Swedish
Kubord 2: Word relations Expressen 2019
Part of the collection Kubord 2
38,699,090 Swedish
Kubord 2: Word relations Expressen 2020
Part of the collection Kubord 2
34,996,401 Swedish
Kubord 2: Word relations Expressen 2021
Part of the collection Kubord 2
34,415,510 Swedish
Kubord 2: Word relations Expressen 2022
Part of the collection Kubord 2
32,207,321 Swedish
Kubord 2: Word relations Expressen 2023
Part of the collection Kubord 2
30,703,605 Swedish
Kubord 2: Word relations Expressen 2024
Part of the collection Kubord 2
28,680,208 Swedish
Kubord 2: Word relations Göteborgsposten 2013
Part of the collection Kubord 2
35,172,089 Swedish
Kubord 2: Word relations Göteborgsposten 2014
Part of the collection Kubord 2
41,140,687 Swedish
Kubord 2: Word relations Göteborgsposten 2015
Part of the collection Kubord 2
37,781,416 Swedish
Kubord 2: Word relations Göteborgsposten 2016
Part of the collection Kubord 2
30,840,113 Swedish
Kubord 2: Word relations Göteborgsposten 2017
Part of the collection Kubord 2
29,777,289 Swedish
Kubord 2: Word relations Göteborgsposten 2018
Part of the collection Kubord 2
31,021,011 Swedish
Kubord 2: Word relations Göteborgsposten 2019
Part of the collection Kubord 2
30,900,679 Swedish
Kubord 2: Word relations Göteborgsposten 2020
Part of the collection Kubord 2
27,093,412 Swedish
Kubord 2: Word relations Göteborgsposten 2021
Part of the collection Kubord 2
26,926,176 Swedish
Kubord 2: Word relations Göteborgsposten 2022
Part of the collection Kubord 2
27,506,663 Swedish
Kubord 2: Word relations Göteborgsposten 2023
Part of the collection Kubord 2
27,208,215 Swedish
Kubord 2: Word relations Göteborgsposten 2024
Part of the collection Kubord 2
25,111,037 Swedish
Kubord 2: Word relations Svenska Dagbladet 2010
Part of the collection Kubord 2
35,569,612 Swedish
Kubord 2: Word relations Svenska Dagbladet 2011
Part of the collection Kubord 2
37,267,222 Swedish
Kubord 2: Word relations Svenska Dagbladet 2012
Part of the collection Kubord 2
34,337,038 Swedish
Kubord 2: Word relations Svenska Dagbladet 2013
Part of the collection Kubord 2
31,293,680 Swedish
Kubord 2: Word relations Svenska Dagbladet 2014
Part of the collection Kubord 2
32,195,799 Swedish
Kubord 2: Word relations Svenska Dagbladet 2015
Part of the collection Kubord 2
31,155,970 Swedish
Kubord 2: Word relations Svenska Dagbladet 2016
Part of the collection Kubord 2
30,769,788 Swedish
Kubord 2: Word relations Svenska Dagbladet 2017
Part of the collection Kubord 2
30,140,228 Swedish
Kubord 2: Word relations Svenska Dagbladet 2018
Part of the collection Kubord 2
30,465,422 Swedish
Kubord 2: Word relations Svenska Dagbladet 2019
Part of the collection Kubord 2
27,607,664 Swedish
Kubord 2: Word relations Svenska Dagbladet 2020
Part of the collection Kubord 2
26,943,992 Swedish
Kubord 2: Word relations Svenska Dagbladet 2021
Part of the collection Kubord 2
27,265,483 Swedish
Kubord 2: Word relations Svenska Dagbladet 2022
Part of the collection Kubord 2
27,524,626 Swedish
Kubord 2: Word relations Svenska Dagbladet 2023
Part of the collection Kubord 2
26,862,480 Swedish
Kubord 2: Word relations Svenska Dagbladet 2024
Part of the collection Kubord 2
25,027,873 Swedish
Kubord 2: Word relations Sydsvenskan 2013
Part of the collection Kubord 2
33,470,566 Swedish
Kubord 2: Word relations Sydsvenskan 2014
Part of the collection Kubord 2
42,644,505 Swedish
Kubord 2: Word relations Sydsvenskan 2015
Part of the collection Kubord 2
43,896,523 Swedish
Kubord 2: Word relations Sydsvenskan 2016
Part of the collection Kubord 2
45,168,823 Swedish
Kubord 2: Word relations Sydsvenskan 2017
Part of the collection Kubord 2
43,865,362 Swedish
Kubord 2: Word relations Sydsvenskan 2018
Part of the collection Kubord 2
43,758,009 Swedish
Kubord 2: Word relations Sydsvenskan 2019
Part of the collection Kubord 2
42,338,675 Swedish
Kubord 2: Word relations Sydsvenskan 2020
Part of the collection Kubord 2
38,070,847 Swedish
Kubord 2: Word relations Sydsvenskan 2021
Part of the collection Kubord 2
40,391,762 Swedish
Kubord 2: Word relations Sydsvenskan 2022
Part of the collection Kubord 2
38,939,336 Swedish
Kubord 2: Word relations Sydsvenskan 2023
Part of the collection Kubord 2
34,317,593 Swedish
Kubord 2: Word relations Sydsvenskan 2024
Part of the collection Kubord 2
32,757,325 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2013
Part of the collection Kubord 2
21,696,541 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2014
Part of the collection Kubord 2
21,993,409 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2015
Part of the collection Kubord 2
20,145,730 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2016
Part of the collection Kubord 2
20,957,064 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2017
Part of the collection Kubord 2
21,384,420 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2018
Part of the collection Kubord 2
21,207,648 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2019
Part of the collection Kubord 2
19,785,748 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2020
Part of the collection Kubord 2
17,826,910 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2021
Part of the collection Kubord 2
17,046,977 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2022
Part of the collection Kubord 2
16,602,847 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2023
Part of the collection Kubord 2
14,005,481 Swedish
Kubord 2: Word relations Östgöta Correspondenten 2024
Part of the collection Kubord 2
11,886,483 Swedish
KVAH
Kungl. Vetenskapsakademiens Handlingar
97,717 Swedish
Collection
Kvinnotidningar
Material from historical women's periodicals
Swedish
Kvinnotidningar: Dagny
A corpus with texts from the women's newspaper Dagny
8,124,256 Swedish
Kvinnotidningar: Hertha
A corpus with texts from the women's newspaper Hertha
3,842,984 Swedish
Kvinnotidningar: Idun
A corpus with texts from the women's newspaper Idun
44,944,172 Swedish
Kvinnotidningar: Kvinnornas Tidning
A corpus with texts from the women's newspaper Kvinnornas Tidning
5,468,918 Swedish
Kvinnotidningar: Morgonbris
A corpus with texts from the women's newspaper Morgonbris
3,551,943 Swedish
Kvinnotidningar: Rösträtt för Kvinnor
A corpus with texts from the women's newspaper Rösträtt för Kvinnor
2,202,776 Swedish
Kvinnotidningar: Tidevarvet
A corpus with texts from the women's newspaper Tidevarvet
6,813,909 Swedish
Lawline
Questions and answers about law counseling.
12,002,288 Swedish
Laws from the 1800's
Regeringsformen 1809 med ändringar 1809-1974
446,438 Swedish
Laws of 1734
Materialet utgörs av balkarna i själva lagtexten, förordet samt domarreglerna. Materialet är inskrivet för hand och korrekturläst, men en del fel finns fortfarande kvar.
98,120 Swedish
Collection
Learner Language
Learner Language is a collection of corpor and lexicons that describe learner language. Corpora include both texts/audio produced by language learners, as well as texts/language they are exposed to (reading or listening to, e.g. course book texts). Even some derivative resources based on these corpora are included in this collection.
Swedish, Multiple languages
Linguistic Survey of India (LSI)
1,193,437 English
Collection
Läkartidningen medical journal
Corpus for health care technical language
Swedish
Läkartidningen medical journal 1996
Läkartidningens publicerade artiklar under 1996.
2,016,356 Swedish
Läkartidningen medical journal 1997
Läkartidningens publicerade artiklar under 1997.
1,977,051 Swedish
Läkartidningen medical journal 1998
Läkartidningens publicerade artiklar under 1998.
2,195,964 Swedish
Läkartidningen medical journal 1999
Läkartidningens publicerade artiklar under 1999.
2,075,532 Swedish
Läkartidningen medical journal 2000
Läkartidningens publicerade artiklar under 2000.
2,000,393 Swedish
Läkartidningen medical journal 2001
Läkartidningens publicerade artiklar under 2001.
2,094,491 Swedish
Läkartidningen medical journal 2002
Läkartidningens publicerade artiklar under 2002.
2,009,521 Swedish
Läkartidningen medical journal 2003
Läkartidningens publicerade artiklar under 2003.
1,748,780 Swedish
Läkartidningen medical journal 2004
Läkartidningens publicerade artiklar under 2004.
1,831,732 Swedish
Läkartidningen medical journal 2005
Läkartidningens publicerade artiklar under 2005.
1,505,574 Swedish
Läkartidningen medical journal 2006
Läkartidningens publicerade artiklar under 2006.
1,586,627 Swedish
LäSBarT
Easy to read Swedish and texts from children's books
1,129,083 Swedish
MARB
A dataset for studying Marked Attribute Reporting Bias
English
MAÞiR Trees
An Old Swedish treebank, with lemma, parts-of-speech, and PROIEL-style dependency syntax.
33,721 Swedish
Collection
Medieval letters
Swedish medieval charters from Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Latin, German, Norwegian, Swedish
Medieval letters: German
Charters written in German, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
177,806 German
Medieval letters: Latin
Charters written in Latin, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
2,249,923 Latin
Medieval letters: Norwegian
Charters written in Norwegian, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
27,718 Norwegian
Medieval letters: Other languages
Charters written in other languages, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
39,430 Swedish
Medieval letters: Swedish
Charters written in Swedish, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
967,228 Swedish
Memory books of the city of Stockholm
Minutes and memorial notes from Stockholm City Hall, 1626.
121,366 Swedish
MEPAC bloggar
2,738,428 Swedish
MEPAC intervjuer
331,998 Swedish
MuClaGED
MuClaGED is a dataset for multi-class Grammatical Error Detection for Swedish. The dataset is based on the SweLL-gold corpus.
155,415 Swedish
MultiGEC
MultiGEC is a dataset for Grammatical Error Correction containing parallel data for 12 languages and 17 subcorpora. Each subcorpus contains two or more parallel versions of the same texts (typically, full learner essays), where one version (orig) is the one that the author originally wrote, and the others (ref1, ref2, ...) are corrected versions of the same text. Languages included: Czech, English, Estonian, German, Greek, Icelandic, Italian, Latvian, Russian, Slovene, Swedish and Ukrainian (English and Russian are available on request). Texts come from different original corpora, but are reformatted to a unified format.
Czech, German, Modern Greek (1453-), English, Estonian, Icelandic, Italian, Latvian, Russian, Slovenian, Swedish, Ukrainian
MultiGED
MultiGEC is a dataset for Grammatical Error Detection (a task within NLP) containing data for 5 languages (Czech, English, German, Italian and Swedish).
Czech, German, English, Italian, Swedish
BibTeX list