Skip to main content
Svenska
English
Språkbanken Text is a part of
Språkbanken
.
News and events
Research
Tools
Data
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1282)
Collections (28)
Corpora (1161)
Lexicons (61)
Training and evaluation data (15)
Models (45)
Name or description
Language
- Any -
Swedish
Albanian
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Faroese
Finland Swedish
Finnish
French
German
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Lower Sorbian
Macedonian
Modern Greek (1453-)
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Resurs
Antal tokens
Språk
Åtkomst
Old Finland Swedish: Non-fiction 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
346,718
Swedish
Word statistics:
stats_SAKPROSA1900-1959.txt
2017-06-18 – 2.43 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Protokoll vid lantdagen i Borgå år 1809
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
472,093
Swedish
Word statistics:
stats_FSBMYNDIGHET1800TAL.txt
2017-06-18 – 1.77 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Spanska Flugan 1839–1841
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
41,935
Swedish
Word statistics:
stats_SPANSKAFLUGAN.txt
2017-06-25 – 527.61 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Tidningar Utgifne af et Sällskap i Åbo 1771–1783
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
6,532
Swedish
Word statistics:
stats_TIDNINGARSALLSKAPETIABO.txt
2017-06-18 – 123.03 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Typografiskt minnesblad 1891
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
10,234
Swedish
Word statistics:
stats_TYPOGRAFISKTMINNESBLAD.txt
2017-06-18 – 196.54 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Typograftidning 1889–1890
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
63,394
Swedish
Word statistics:
stats_TYPOGRAFTIDNING.txt
2017-06-18 – 753.61 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Uleåborgs Tidning 1877–1887
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
13,474
Swedish
Word statistics:
stats_ULEABORGSTIDNING.txt
2017-06-18 – 273.23 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Wasabladet 1866–1896
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
69,695
Swedish
Word statistics:
stats_WASABLADET.txt
2017-06-18 – 894 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Wiborgs Tidning 1867–1877
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
19,086
Swedish
Word statistics:
stats_WIBORGSTIDNING.txt
2017-06-18 – 343.96 KB – CC BY 4.0
Explore in:
Older Swedish novels
A collection of more than 50 older Swedish novels by 14 different authors
4,347,047
Swedish
Dataset:
romg.xml.bz2
2017-03-17 – 60.15 MB – CC BY 4.0
Word statistics:
stats_ROMG.txt
2017-03-19 – 8.91 MB – CC BY 4.0
Explore in:
OpenEDGeS
The public license subset of the EDGeS Diachronic Bible Corpus, a diachronically and synchronically parallel corpus of Bible translations in Dutch,English, German and Swedish, with texts from the 14th century until today.
19,399,149
Swedish, English, German, Dutch
Dataset:
OpenEDGeS_v1.01.zip
2024-01-25 – 121.17 MB – CC BY-NC-SA 4.0
Dataset:
OpenEDGeS_v1.0.0.zip
2024-01-25 – 72.89 MB – For license details of the previous versions, see the 'Read me.txt' file in the download.
Oral Copus for Reference of Contemporary Spanish
Corpus with transcriptions from recorded audio tapes from 1991 to 1992. Part of SOL - Spanish Online
1,200,830
Spanish
Dataset:
cor92.xml.bz2
2017-11-10 – 2.33 MB – CC BY 4.0
Explore in:
ORDAT
Yearbook of Svenska Dagbladet 1923–1958
1,528,935
Swedish
Dataset:
ordat.xml.bz2
2017-05-16 – 28.07 MB – CC BY 4.0
Word statistics:
stats_ORDAT.txt
2017-05-21 – 7.13 MB – CC BY 4.0
Explore in:
PAROLE
A corpus annotated with morphological and syntactic information
24,303,096
Swedish
Dataset:
parole.xml.bz2
2017-05-17 – 425.19 MB – CC BY 4.0
Dataset:
parole.zip
2024-01-25 – 67.62 MB – CC BY 4.0
Word statistics:
stats_PAROLE.txt
2017-05-21 – 39.18 MB – CC BY 4.0
Explore in:
Podiet
Articles from the consert magazine Podiet
651,150
Swedish
Dataset:
podiet.xml.bz2
2024-01-02 – 14.4 MB – CC BY 4.0
Word statistics:
stats_podiet.csv
2024-01-03 – 11.99 MB – CC BY 4.0
Explore in:
Poeter.se
Poetry from Poeter.se
106,196,502
Swedish
Dataset:
poeter.xml.bz2
2017-04-20 – 1.65 GB – CC BY 4.0
Word statistics:
stats_POETER.txt
2017-04-23 – 71.35 MB – CC BY 4.0
Explore in:
Preperatory work 1734
Material från lagkommissionen till 1734 års lag
1,603,126
Swedish
Dataset:
forarbeten1734.xml.bz2
2014-12-08 – 9.11 MB – CC BY 4.0
Word statistics:
stats_FORARBETEN1734.txt
2015-06-25 – 4.11 MB – CC BY 4.0
Explore in:
Collection
Press
Swedish press
Swedish
See 6 collected resources
Explore in:
Press 65
Swedish press 1965
1,119,449
Swedish
Dataset:
press65.xml.bz2
2017-03-14 – 20.88 MB – CC BY 4.0
Word statistics:
stats_PRESS65.txt
2017-03-19 – 6.71 MB – CC BY 4.0
Explore in:
Press 76
Swedish press 1976
1,348,122
Swedish
Dataset:
press76.xml.bz2
2017-03-17 – 24.45 MB – CC BY 4.0
Word statistics:
stats_PRESS76.txt
2017-03-19 – 7.37 MB – CC BY 4.0
Explore in:
Press 95
Swedish press 1995
7,671,700
Swedish
Dataset:
press95.xml.bz2
2017-03-15 – 139.65 MB – CC BY 4.0
Word statistics:
stats_PRESS95.txt
2017-03-19 – 19.6 MB – CC BY 4.0
Explore in:
Press 96
Swedish press 1996
6,516,030
Swedish
Dataset:
press96.xml.bz2
2017-03-15 – 117.54 MB – CC BY 4.0
Word statistics:
stats_PRESS96.txt
2017-03-19 – 18.49 MB – CC BY 4.0
Explore in:
Press 97
Swedish press 1997
13,703,279
Swedish
Dataset:
press97.xml.bz2
2017-03-17 – 241.09 MB – CC BY 4.0
Word statistics:
stats_PRESS97.txt
2017-03-19 – 28.62 MB – CC BY 4.0
Explore in:
Press 98
Swedish press 1998
10,740,849
Swedish
Dataset:
press98.xml.bz2
2017-03-17 – 187.08 MB – CC BY 4.0
Word statistics:
stats_PRESS98.txt
2017-03-19 – 24.51 MB – CC BY 4.0
Explore in:
Psalm book (1937)
The Swedish psalm book from 1937
163,574
Swedish
Dataset:
psalmboken.xml.bz2
2017-05-18 – 1.72 MB – CC BY 4.0
Word statistics:
stats_PSALMBOKEN.txt
2016-03-20 – 670.48 KB – CC BY 4.0
Explore in:
Questions and answers about the Swedish language
Counselling mails of the Language Council of Sweden
20,083,415
Swedish
Explore in:
Collection
Riksdag of the Estates
Collection of textual documents from the Swedish Riksdag of the Estates
Swedish
See 7 collected resources
Explore in:
Riksdag of the Estates: Adelsståndet
Part of the data set "Riksdag of the Estates"
64,915,391
Swedish
Dataset:
standsriksdagen-adelsstandet.xml.bz2
2024-06-17 – 852.82 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-adelsstandet.csv
2024-08-05 – 75.66 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Bihang m.m.
Part of the data set "Riksdag of the Estates"
66,201,274
Swedish
Dataset:
standsriksdagen-bihang.xml.bz2
2024-06-18 – 841.09 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-bihang.csv
2024-08-05 – 88.43 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Bondeståndet
Part of the data set "Riksdag of the Estates"
32,884,985
Swedish
Dataset:
standsriksdagen-bondestandet.xml.bz2
2024-06-18 – 411.81 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-bondestandet.csv
2024-08-05 – 54.4 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Borgarståndet
Part of the data set "Riksdag of the Estates"
35,604,839
Swedish
Dataset:
standsriksdagen-borgarstandet.xml.bz2
2024-06-18 – 477.72 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-borgarstandet.csv
2024-08-05 – 50.25 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Prästeståndet
Part of the data set "Riksdag of the Estates"
30,653,241
Swedish
Dataset:
standsriksdagen-prastestandet.xml.bz2
2024-06-19 – 422.82 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-prastestandet.csv
2024-08-05 – 45.59 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Riksdagsakter
Part of the data set "Riksdag of the Estates"
4,052,160
Swedish
Dataset:
standsriksdagen-riksdagsakter.xml.bz2
2024-06-17 – 44.69 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-riksdagsakter.csv
2024-08-05 – 12.27 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Riksdagsbeslut
Part of the data set "Riksdag of the Estates"
355,722
Swedish
Dataset:
standsriksdagen-riksdagsbeslut.xml.bz2
2024-06-18 – 3.44 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-riksdagsbeslut.csv
2024-08-05 – 2.96 MB – CC BY 4.0
Explore in:
Riksdag's open data: Documents from Committees
Dokument från utskotten, bland annat KU-anmälningar, protokoll, verksamhetsberättelser och den gamla dokumentserien Utredningar från riksdagen
65,746
Swedish
Dataset:
segreg-rd-utsk.xml.bz2
2024-11-18 – 1.08 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-utsk.csv
CC BY 4.0
Explore in:
Riksdag's open data: Statements of opinion
Utskottens yttranden
669,769
Swedish
Dataset:
segreg-rd-yttr.xml.bz2
2024-11-18 – 13.04 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-yttr.csv
CC BY 4.0
Explore in:
Collection
Riksdagens öppna data
Data from the Swedish parliament collected from data.riksdagen.se
Swedish
See 21 collected resources
Explore in:
Riksdagens öppna data: Betänkande
Utskottens betänkanden och utlåtanden, inklusive rksdagens beslut, en sammanfattning av voteringsresultaten och Beslut i korthet
203,229,298
Swedish
Dataset:
rd-bet.xml.bz2
2022-10-11 – 3.84 GB – CC BY 4.0
Word statistics:
stats_rd-bet.csv
2022-10-11 – 290.32 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Departementsserien
Utredningar från regeringens departement
50,678,547
Swedish
Dataset:
rd-ds.xml.bz2
2022-09-06 – 928.31 MB – CC BY 4.0
Word statistics:
stats_rd-ds.csv
2022-09-06 – 48.44 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: EUN
Dokument från EU-nämnden, bland annat möteskallelser, föredragningslistor, protokoll och skriftliga samråd med regeringen
722,016
Swedish
Dataset:
rd-eun.xml.bz2
2023-02-03 – 8.72 MB – CC BY 4.0
Word statistics:
stats_rd-eun.csv
2023-02-04 – 1.01 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Faktapromemoria
Regeringens faktapromemorior om EU-kommissionens förslag
3,373,261
Swedish
Dataset:
rd-fpm.xml.bz2
2024-01-08 – 68.67 MB – CC BY 4.0
Word statistics:
stats_rd-fpm.csv
2023-01-26 – 7.05 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Föredragningslista
Föredragningslistor för kammarens sammanträden
842,042
Swedish
Dataset:
rd-flista.xml.bz2
2023-02-03 – 11.77 MB – CC BY 4.0
Word statistics:
stats_rd-flista.csv
2023-02-04 – 1.9 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Framställning/redogörelse
Framställningar och redogörelser från organ som utsetts av riksdagen
18,044,760
Swedish
Dataset:
rd-frsrdg.xml.bz2
2022-09-06 – 350.59 MB – CC BY 4.0
Word statistics:
stats_rd-frsrdg.csv
2022-09-06 – 20.91 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Interpellation
Interpellationer från ledamöterna till regeringen
25,969,006
Swedish
Dataset:
rd-ip.xml.bz2
2022-09-06 – 521.92 MB – CC BY 4.0
Word statistics:
stats_rd-ip.csv
2022-09-06 – 22.57 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Kammaraktiviteter
6,298,451
Swedish
Dataset:
rd-kammakt.xml.bz2
2023-02-06 – 129.07 MB – CC BY 4.0
Word statistics:
stats_rd-kammakt.csv
2023-02-07 – 9.48 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: KOM
EU-kommissionens förslag och redogörelser, så kallade KOM-dokument
44,678,107
Swedish
Dataset:
rd-kom.xml.bz2
2024-01-08 – 621 MB – CC BY 4.0
Word statistics:
stats_rd-kom.csv
2024-01-09 – 36.35 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Motion
Motioner från riksdagens ledamöter
162,923,798
Swedish
Dataset:
rd-mot.xml.bz2
2022-10-11 – 3.4 GB – CC BY 4.0
Word statistics:
stats_rd-mot.csv
2022-10-11 – 162.55 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Övrigt
Dokumentserierna Riksrevisionens granskningsrapporter, Utredningar från Riksdagsförvaltningen och Rapporter från riksdagen samt planeringsdokument, bilagor till dokument och uttag ur riksdagens databaser och de gamla dokumentserierna Utredningar från riksdag
21,916,385
Swedish
Dataset:
rd-ovr.xml.bz2
2022-09-08 – 417.6 MB – CC BY 4.0
Word statistics:
stats_rd-ovr.csv
2022-09-08 – 29.38 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Proposition
Propositioner och skrivelser från regeringen
379,103,550
Swedish
Dataset:
rd-prop.xml.bz2
2022-10-12 – 6.98 GB – CC BY 4.0
Word statistics:
stats_rd-prop.csv
2022-10-12 – 432.47 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Protokoll
Protokoll från kammarens sammanträden
247,384,265
Swedish
Dataset:
rd-prot.xml.bz2
2024-01-11 – 4.69 GB – CC BY 4.0
Word statistics:
stats_rd-prot.csv
2024-01-12 – 330.37 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Riksdagsskrivelse
Skrivelser från riksdagen till regeringen
236,335
Swedish
Dataset:
rd-rskr.xml.bz2
2022-09-09 – 2.55 MB – CC BY 4.0
Word statistics:
stats_rd-rskr.csv
2022-09-09 – 633.65 KB – CC BY 4.0
Explore in:
Riksdagens öppna data: Sammanträden
87,453
Swedish
Dataset:
rd-samtr.xml.bz2
2022-09-09 – 1.61 MB – CC BY 4.0
Word statistics:
stats_rd-samtr.csv
2022-09-09 – 603.69 KB – CC BY 4.0
Explore in:
Riksdagens öppna data: Skriftliga frågor
Skriftliga frågor från ledamöterna till regeringen och svaren på dessa
14,599,076
Swedish
Dataset:
rd-skfr.xml.bz2
2022-09-09 – 320.97 MB – CC BY 4.0
Word statistics:
stats_rd-skfr.csv
2022-09-09 – 17.85 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Statens offentliga utredningar
Olika utredningars förslag till regeringen
273,083,646
Swedish
Dataset:
rd-sou.xml.bz2
2024-01-11 – 5.09 GB – CC BY 4.0
Word statistics:
stats_rd-sou.csv
2024-01-12 – 164.59 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Talarlista
Talarlistor för kammarens sammanträden
320,875
Swedish
Dataset:
rd-tlista.xml.bz2
2022-09-09 – 3.63 MB – CC BY 4.0
Word statistics:
stats_rd-tlista.csv
2022-09-09 – 842.28 KB – CC BY 4.0
Explore in:
Riksdagens öppna data: Utredningar
Kommittédirektiv och kommittéberättelser för utredningar som regeringen tillsätter
1,548,660
Swedish
Dataset:
rd-utr.xml.bz2
2022-09-09 – 25.19 MB – CC BY 4.0
Word statistics:
stats_rd-utr.csv
2022-09-09 – 4.72 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Utskottsdokument
Dokument från utskotten, bland annat KU-anmälningar, protokoll, verksamhetsberättelser och den gamla dokumentserien Utredningar från riksdagen
5,865,972
Swedish
Dataset:
rd-utsk.xml.bz2
2022-09-09 – 80.14 MB – CC BY 4.0
Word statistics:
stats_rd-utsk.csv
2022-09-09 – 5.93 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Yttrande
Utskottens yttranden
9,511,225
Swedish
Dataset:
rd-yttr.xml.bz2
2024-01-10 – 190.75 MB – CC BY 4.0
Word statistics:
stats_rd-yttr.csv
2024-01-11 – 20.03 MB – CC BY 4.0
Explore in:
Rösträtt för kvinnor
Annual volumes 1912–1918 of the journal Rösträtt för kvinnor
1,873,503
Swedish
Dataset:
runeberg-rost.xml.bz2
2014-12-08 – 23.62 MB – CC BY 4.0
Word statistics:
stats_RUNEBERG-ROST.txt
2015-06-25 – 7.8 MB – CC BY 4.0
Explore in:
SALT – Swedish-Dutch
Dutch-Swedish parallel corpus of 20th century fictional and nonfictional texts.
2,845,857
Swedish, Dutch
Dataset:
saltnld-sv.xml.bz2
2016-05-03 – 18.23 MB – CC BY 4.0
Dataset:
saltnld-nl.xml.bz2
2016-05-03 – 9.81 MB – CC BY 4.0
Word statistics:
stats_SALTNLD-SV.txt
2016-05-08 – 5.42 MB – CC BY 4.0
Word statistics:
stats_SALTNLD-NL.txt
2016-05-08 – 2.79 MB – CC BY 4.0
Explore in:
SAOB1950
Scanned books from 1950 to 2007 that are used as source material for updating SAOB, with a selection that reflects the Swedish vocabulary during the 20th century.
50,285,466
Swedish
Dataset:
saob-bocker.xml.bz2
2023-11-30 – 1006.14 MB – CC BY 4.0
Word statistics:
stats_saob-bocker.csv
2023-12-01 – 367.54 MB – CC BY 4.0
Explore in:
ScandiSent
Sentiment Corpus for Swedish, Norwegian, Danish, Finnish and English crawled from trustpilot.
Swedish, Norwegian Bokmål, Danish, English, Finnish
Dataset:
ScandiSent.zip
2024-01-25 – 5.16 MB – CC BY 4.0
Dataset:
ScandiSent-mt.zip
2024-01-25 – 3.62 MB – CC BY 4.0
Segregations språk: Riksdagens öppna data: Betänkande
Texts that treat segregation
26,706,890
Swedish
Dataset:
segreg-rd-bet.xml.bz2
2024-11-18 – 496.58 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-bet.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Departementsserien
Texts that treat segregation
6,820,996
Swedish
Dataset:
segreg-rd-ds.xml.bz2
2024-11-18 – 112.8 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-ds.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: EUN
Texts that treat segregation
3,703
Swedish
Dataset:
segreg-rd-eun.xml.bz2
2024-11-18 – 55.98 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-eun.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Faktapromemoria
Texts that treat segregation
18,678
Swedish
Dataset:
segreg-rd-fpm.xml.bz2
2024-11-18 – 383.69 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-fpm.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Föredragningslista
Texts that treat segregation
5,149
Swedish
Dataset:
segreg-rd-flista.xml.bz2
2024-11-18 – 68.35 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-flista.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Framställning/redogörelse
Texts that treat segregation
1,316,348
Swedish
Dataset:
segreg-rd-frsrdg.xml.bz2
2024-11-18 – 23.93 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-frsrdg.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Interpellation
Texts that treat segregation
948,204
Swedish
Dataset:
segreg-rd-ip.xml.bz2
2024-11-18 – 18.59 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-ip.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Kammaraktiviteter
Texts that treat segregation
1,953,686
Swedish
Dataset:
segreg-rd-kammakt.xml.bz2
2024-11-18 – 39.34 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-kammakt.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: KOM
Texts that treat segregation
1,962,097
Swedish
Dataset:
segreg-rd-kom.xml.bz2
2024-11-18 – 13.21 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-kom.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Motion
Texts that treat segregation
16,208,509
Swedish
Dataset:
segreg-rd-mot.xml.bz2
2024-11-18 – 338.26 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-mot.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Övrigt
Texts that treat segregation
1,854,388
Swedish
Dataset:
segreg-rd-ovr.xml.bz2
2024-11-18 – 30.76 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-ovr.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Proposition
Texts that treat segregation
35,480,771
Swedish
Dataset:
segreg-rd-prop.xml.bz2
2024-11-18 – 629.25 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-prop.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Protokoll
Texts that treat segregation
57,270,162
Swedish
Dataset:
segreg-rd-prot.xml.bz2
2024-11-18 – 1.06 GB – CC BY 4.0
Word statistics:
stats_segreg-rd-prot.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Skriftliga frågor
Texts that treat segregation
139,993
Swedish
Dataset:
segreg-rd-skfr.xml.bz2
2024-11-18 – 2.83 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-skfr.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Statens offentliga utredningar
Texts that treat segregation
66,695,400
Swedish
Dataset:
segreg-rd-sou.xml.bz2
2024-11-18 – 1.23 GB – CC BY 4.0
Word statistics:
stats_segreg-rd-sou.csv
CC BY 4.0
Explore in:
Segregations språk: Riksdagens öppna data: Utredningar
Texts that treat segregation
4,121
Swedish
Dataset:
segreg-rd-utr.xml.bz2
2024-11-18 – 77.88 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-utr.csv
CC BY 4.0
Explore in:
SemEval2020 Task 1
Swedish Test Data for SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection (extracts from Kubhist v2)
182,000,000
Swedish
Dataset:
semeval2020_ulscd_swe.zip
2024-01-25 – 956.05 MB – CC BY 4.0
Sibirian-German
Siberian German is transcribed German spoken of about 36 000 people in the region of Krasnoyarsk in Siberia (Russia).
34,205
Swedish
Word statistics:
stats_SIBERIANGERMANDIALOGS.txt
2013-02-23 – 101.72 KB – CC BY 4.0
Explore in:
Sibirientyska kvinnor
Dialogs between four women born in 1927 to 1937 in the Soviet Volga Republic
16,208
Swedish
Word statistics:
stats_SIBERIANGERMANWOMEN.txt
2017-03-19 – 44.62 KB – CC BY 4.0
Explore in:
SIC2 - Stockholm Internet Corpus
The Stockholm Internet Corpus (SIC2) contains Swedish blog posts, annotated with part of speech, morphological features, and named entities.
13,562
Swedish
Dataset:
sic2.xml.bz2
2020-11-25 – 262.36 KB – CC BY 4.0
Word statistics:
stats_sic2.csv
2021-08-12 – 177.44 KB – CC BY 4.0
Dataset:
sic2.zip
CC BY 4.0
Dataset:
readme.txt
2020-11-17 – 2.18 KB – CC BY 4.0
Explore in:
Smittskydd
The newspaper Smittskydd by Smittskyddsinstitutet (Swedish Institute for Communicable Disease Control) 2002–2010
691,716
Swedish
Dataset:
smittskydd.xml.bz2
2017-04-05 – 11.26 MB – CC BY 4.0
Word statistics:
stats_SMITTSKYDD.txt
2017-04-09 – 3 MB – CC BY 4.0
Explore in:
SNP 1978–79
Swedish parliament proceedings 1978–1979
4,865,138
Swedish
Dataset:
snp7879.xml.bz2
2017-04-05 – 81.35 MB – CC BY 4.0
Word statistics:
stats_SNP7879.txt
2017-04-09 – 7.13 MB – CC BY 4.0
Explore in:
Collection
Somali corpora
A collection of Samli corpora
Somali
See 26 collected resources
Explore in:
Somali Wikipedia
Corpus of Somali Wikipedia
869,335
Somali
Dataset:
wikipedia-so.xml.bz2
2016-10-27 – 2.34 MB – CC BY 4.0
Word statistics:
stats_WIKIPEDIA-SO.txt
2020-02-25 – 1.9 MB – CC BY 4.0
Explore in:
Somali: Af Soomaali 1971-79
50,794
Somali
Dataset:
somali-1971-79.xml.bz2
2023-02-17 – 135.14 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2001 Somaliland
35,043
Somali
Dataset:
somali-as-2001.xml.bz2
2021-08-27 – 113.01 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2001 Soomaaliya
129,947
Somali
Dataset:
somali-2001.xml.bz2
2023-02-17 – 288.17 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2006 Itoobiya
64,351
Somali
Dataset:
somali-itoobiya.xml.bz2
2017-06-28 – 125.93 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2010 Somaliland
51,513
Somali
Dataset:
somali-hargeysa-2010.xml.bz2
2017-11-27 – 145.67 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2013 Somaliland
25,247
Somali
Dataset:
somali-as-2013.xml.bz2
2019-02-18 – 59.78 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2018 Soomaaliya
15,677
Somali
Dataset:
somali-as-2018.xml.bz2
2019-10-01 – 32.81 KB – CC BY 4.0
Explore in:
Somali: Afka Hooyo 1992-02 Kanada
706
Somali
Dataset:
somali-ah-1992-02-kanada.xml.bz2
2017-01-30 – 2.99 KB – CC BY 4.0
Explore in:
Somali: Afka Hooyo 2010–19 Iswiidhan
21,542
Somali
Dataset:
somali-ah-2010-19.xml.bz2
2021-08-30 – 65.49 KB – CC BY 4.0
Explore in:
Somali: BBC
82,437
Somali
Dataset:
somali-bbc.xml.bz2
2017-05-31 – 181.65 KB – CC BY 4.0
Explore in:
Somali: Caafimaad 1972–79
13,550
Somali
Dataset:
somali-caafimaad-1972-79.xml.bz2
2021-08-30 – 38.92 KB – CC BY 4.0
Explore in:
Somali: Caafimaad 1994
8,977
Somali
Dataset:
somali-caafimaad-1994.xml.bz2
2017-05-16 – 24.79 KB – CC BY 4.0
Explore in:
Somali: Cilmi-Afeed
190,429
Somali
Dataset:
somali-cilmi.xml.bz2
2021-08-27 – 683.26 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 1971–1980
79,005
Somali
Dataset:
somali-cb.xml.bz2
2018-03-12 – 212.86 KB – CC BY 4.0
Word statistics:
stats_SOMALI-CB.txt
2020-02-25 – 265.81 KB – CC BY 4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Next page
Next ›
Last page
Last »
News and events
News archive
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Sustainable language representations
Position statements
Workshop on Profiling second language vocabulary and grammar - 2023
Blog
Calendar
Previous events
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Tools
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Karp
Web API
Sparv
Sparv Pipeline
Sparv's user manual
Annotations by Sparv
Web service (API)
Web Sparv
Mink
User manual
Tutorial
Web API
Privacy and data policy
Lärka
Other tools
Catta
IT-baserad grammatikinlärning
Data
FAQ
About us
Staff
Organisation
Språkbanken Text i världen
Språkbanken 50 years
Celebration
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Workshops
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Research meetings
Gruppmöten
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Cookies
Internal
Contact us
Help desk