Menu
News and events
Open submenu
Research
Open submenu
Data
Analyses
Platforms
Open submenu
About us
Open submenu
Contact us
Open submenu
FAQ
Close submenu
News and events
News archive
Conferences and workshops
Open submenu
Blog
Calendar
Open submenu
Close submenu
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Open submenu
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Open submenu
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Open submenu
Sustainable language representations
Open submenu
Workshop on Profiling second language vocabulary and grammar - 2023
Close submenu
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
Close submenu
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
Close submenu
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Close submenu
Sustainable language representations
Position statements
Close submenu
Calendar
Previous events
Close submenu
Research
Publications
Doktorandutbildning
Open submenu
Close submenu
Doktorandutbildning
For PhD students and supervisors
Close submenu
Platforms
Korp
Open submenu
Karp
Open submenu
Sparv
Open submenu
Mink
Open submenu
Lärka
Other tools
Open submenu
Close submenu
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Close submenu
Karp
Web API
Close submenu
Sparv
Sparv Pipeline
Sparv's user manual
Annotations by Sparv
Web service (API)
Web Sparv
Close submenu
Mink
User manual
Tutorial
Video: Overview (sv)
Web API
Privacy and data policy
Close submenu
Other tools
Catta
IT-baserad grammatikinlärning
Close submenu
About us
Staff
Organisation
Språkbanken Text i världen
Språkbanken 50 years
Open submenu
A brief history
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Open submenu
Cookies
Internal
Close submenu
Språkbanken 50 years
Celebration
Close submenu
Meetings and workshops
Kick-off meetings
Open submenu
Workshops
Open submenu
Forskningsmöten
SBX Retreat
Open submenu
Working group meetings
Close submenu
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Close submenu
Workshops
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Close submenu
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Close submenu
Contact us
Help desk
Skip to main content
Svenska
English
Språkbanken Text is a part of
Språkbanken
.
News and events
Research
Data
Analyses
Platforms
About us
Contact us
FAQ
Menu
Breadcrumb
Home
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1328)
Collections (31)
Corpora (1199)
Lexicons (66)
Training and evaluation data (15)
Models (48)
Name or description
Language
- Any -
Swedish
Albanian
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Typ
Språk
Åtkomst
Riksdag of the Estates: Prästeståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-prastestandet.xml.bz2
2024-06-19 – 422.82 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-prastestandet.csv
2024-08-05 – 45.59 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Riksdagsakter
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-riksdagsakter.xml.bz2
2024-06-17 – 44.69 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-riksdagsakter.csv
2024-08-05 – 12.27 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Riksdagsbeslut
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-riksdagsbeslut.xml.bz2
2024-06-18 – 3.44 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-riksdagsbeslut.csv
2024-08-05 – 2.96 MB – CC BY 4.0
Explore in:
Collection
Riksdagens öppna data
Data from the Swedish parliament collected from data.riksdagen.se
Corpus
Swedish
See 21 collected resources
Explore in:
Riksdagens öppna data: Betänkande
Utskottens betänkanden och utlåtanden, inklusive rksdagens beslut, en sammanfattning av voteringsresultaten och Beslut i korthet
Corpus
Swedish
Dataset:
rd-bet.xml.bz2
2022-10-11 – 3.84 GB – CC BY 4.0
Word statistics:
stats_rd-bet.csv
2022-10-11 – 290.32 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Departementsserien
Utredningar från regeringens departement
Corpus
Swedish
Dataset:
rd-ds.xml.bz2
2022-09-06 – 928.31 MB – CC BY 4.0
Word statistics:
stats_rd-ds.csv
2022-09-06 – 48.44 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: EUN
Dokument från EU-nämnden, bland annat möteskallelser, föredragningslistor, protokoll och skriftliga samråd med regeringen
Corpus
Swedish
Dataset:
rd-eun.xml.bz2
2023-02-03 – 8.72 MB – CC BY 4.0
Word statistics:
stats_rd-eun.csv
2023-02-04 – 1.01 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Faktapromemoria
Regeringens faktapromemorior om EU-kommissionens förslag
Corpus
Swedish
Dataset:
rd-fpm.xml.bz2
2024-01-08 – 68.67 MB – CC BY 4.0
Word statistics:
stats_rd-fpm.csv
2023-01-26 – 7.05 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Föredragningslista
Föredragningslistor för kammarens sammanträden
Corpus
Swedish
Dataset:
rd-flista.xml.bz2
2023-02-03 – 11.77 MB – CC BY 4.0
Word statistics:
stats_rd-flista.csv
2023-02-04 – 1.9 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Framställning/redogörelse
Framställningar och redogörelser från organ som utsetts av riksdagen
Corpus
Swedish
Dataset:
rd-frsrdg.xml.bz2
2022-09-06 – 350.59 MB – CC BY 4.0
Word statistics:
stats_rd-frsrdg.csv
2022-09-06 – 20.91 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Interpellation
Interpellationer från ledamöterna till regeringen
Corpus
Swedish
Dataset:
rd-ip.xml.bz2
2022-09-06 – 521.92 MB – CC BY 4.0
Word statistics:
stats_rd-ip.csv
2022-09-06 – 22.57 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Kammaraktiviteter
Corpus
Swedish
Dataset:
rd-kammakt.xml.bz2
2023-02-06 – 129.07 MB – CC BY 4.0
Word statistics:
stats_rd-kammakt.csv
2023-02-07 – 9.48 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: KOM
EU-kommissionens förslag och redogörelser, så kallade KOM-dokument
Corpus
Swedish
Dataset:
rd-kom.xml.bz2
2024-01-08 – 621 MB – CC BY 4.0
Word statistics:
stats_rd-kom.csv
2024-01-09 – 36.35 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Motion
Motioner från riksdagens ledamöter
Corpus
Swedish
Dataset:
rd-mot.xml.bz2
2022-10-11 – 3.4 GB – CC BY 4.0
Word statistics:
stats_rd-mot.csv
2022-10-11 – 162.55 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Övrigt
Dokumentserierna Riksrevisionens granskningsrapporter, Utredningar från Riksdagsförvaltningen och Rapporter från riksdagen samt planeringsdokument, bilagor till dokument och uttag ur riksdagens databaser och de gamla dokumentserierna Utredningar från riksdag
Corpus
Swedish
Dataset:
rd-ovr.xml.bz2
2022-09-08 – 417.6 MB – CC BY 4.0
Word statistics:
stats_rd-ovr.csv
2022-09-08 – 29.38 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Proposition
Propositioner och skrivelser från regeringen
Corpus
Swedish
Dataset:
rd-prop.xml.bz2
2022-10-12 – 6.98 GB – CC BY 4.0
Word statistics:
stats_rd-prop.csv
2022-10-12 – 432.47 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Protokoll
Protokoll från kammarens sammanträden
Corpus
Swedish
Dataset:
rd-prot.xml.bz2
2024-01-11 – 4.69 GB – CC BY 4.0
Word statistics:
stats_rd-prot.csv
2024-01-12 – 330.37 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Riksdagsskrivelse
Skrivelser från riksdagen till regeringen
Corpus
Swedish
Dataset:
rd-rskr.xml.bz2
2022-09-09 – 2.55 MB – CC BY 4.0
Word statistics:
stats_rd-rskr.csv
2022-09-09 – 633.65 KB – CC BY 4.0
Explore in:
Riksdagens öppna data: Sammanträden
Corpus
Swedish
Dataset:
rd-samtr.xml.bz2
2022-09-09 – 1.61 MB – CC BY 4.0
Word statistics:
stats_rd-samtr.csv
2022-09-09 – 603.69 KB – CC BY 4.0
Explore in:
Riksdagens öppna data: Skriftliga frågor
Skriftliga frågor från ledamöterna till regeringen och svaren på dessa
Corpus
Swedish
Dataset:
rd-skfr.xml.bz2
2022-09-09 – 320.97 MB – CC BY 4.0
Word statistics:
stats_rd-skfr.csv
2022-09-09 – 17.85 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Statens offentliga utredningar
Olika utredningars förslag till regeringen
Corpus
Swedish
Dataset:
rd-sou.xml.bz2
2024-01-11 – 5.09 GB – CC BY 4.0
Word statistics:
stats_rd-sou.csv
2024-01-12 – 164.59 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Talarlista
Talarlistor för kammarens sammanträden
Corpus
Swedish
Dataset:
rd-tlista.xml.bz2
2022-09-09 – 3.63 MB – CC BY 4.0
Word statistics:
stats_rd-tlista.csv
2022-09-09 – 842.28 KB – CC BY 4.0
Explore in:
Riksdagens öppna data: Utredningar
Kommittédirektiv och kommittéberättelser för utredningar som regeringen tillsätter
Corpus
Swedish
Dataset:
rd-utr.xml.bz2
2022-09-09 – 25.19 MB – CC BY 4.0
Word statistics:
stats_rd-utr.csv
2022-09-09 – 4.72 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Utskottsdokument
Dokument från utskotten, bland annat KU-anmälningar, protokoll, verksamhetsberättelser och den gamla dokumentserien Utredningar från riksdagen
Corpus
Swedish
Dataset:
rd-utsk.xml.bz2
2022-09-09 – 80.14 MB – CC BY 4.0
Word statistics:
stats_rd-utsk.csv
2022-09-09 – 5.93 MB – CC BY 4.0
Explore in:
Riksdagens öppna data: Yttrande
Utskottens yttranden
Corpus
Swedish
Dataset:
rd-yttr.xml.bz2
2024-01-10 – 190.75 MB – CC BY 4.0
Word statistics:
stats_rd-yttr.csv
2024-01-11 – 20.03 MB – CC BY 4.0
Explore in:
Rösträtt för kvinnor
Annual volumes 1912–1918 of the journal Rösträtt för kvinnor
Corpus
Swedish
Dataset:
runeberg-rost.xml.bz2
2014-12-08 – 23.62 MB – CC BY 4.0
Word statistics:
stats_RUNEBERG-ROST.txt
2015-06-25 – 7.8 MB – CC BY 4.0
Explore in:
Russian Constructicon
A Russian Constructicon
Lexicon
Russian, English
Dataset:
konstruktikon-rus.xml
2021-11-09 – 2.72 MB – CC BY 4.0
SALDO
SALDO is an extensive lexicon resource for modern Swedish written language.
Lexicon
Swedish
Dataset:
saldo.xml
2017-09-19 – 70.98 MB – CC BY 4.0
Explore in:
SALDO's morphology
Semantic and morphological lexicon for language technology
Lexicon
Swedish
Dataset:
saldom.xml
2017-09-19 – 242.34 MB – CC BY 4.0
Explore in:
SALDO: examples
Example sentences for senses in SALDO
Lexicon
Swedish
Dataset:
saldoe.xml
2017-09-19 – 1.09 MB – CC BY 4.0
Explore in:
SALT – Swedish-Dutch
Dutch-Swedish parallel corpus of 20th century fictional and nonfictional texts.
Corpus
Swedish, Dutch
Dataset:
saltnld-sv.xml.bz2
2016-05-03 – 18.23 MB – CC BY 4.0
Dataset:
saltnld-nl.xml.bz2
2016-05-03 – 9.81 MB – CC BY 4.0
Word statistics:
stats_SALTNLD-SV.txt
2016-05-08 – 5.42 MB – CC BY 4.0
Word statistics:
stats_SALTNLD-NL.txt
2016-05-08 – 2.79 MB – CC BY 4.0
Explore in:
SAOB1950
Scanned books from 1950 to 2007 that are used as source material for updating SAOB, with a selection that reflects the Swedish vocabulary during the 20th century.
Corpus
Swedish
Dataset:
saob-bocker.xml.bz2
2023-11-30 – 1006.14 MB – CC BY 4.0
Word statistics:
stats_saob-bocker.csv
2023-12-01 – 367.54 MB – CC BY 4.0
Explore in:
ScandiSent
Sentiment Corpus for Swedish, Norwegian, Danish, Finnish and English crawled from trustpilot.
Corpus
Swedish, Norwegian Bokmål, Danish, English, Finnish
Dataset:
ScandiSent.zip
2024-01-25 – 5.16 MB – CC BY 4.0
Dataset:
ScandiSent-mt.zip
2024-01-25 – 3.62 MB – CC BY 4.0
Schlyter
Dictionary of Old Swedish
Lexicon
Swedish
Dataset:
schlyter.xml
2017-09-19 – 6.87 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Activities in the Chamber
Corpus
Swedish
Dataset:
segreg-rd-kammakt.xml.bz2
2025-03-07 – 39.94 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-kammakt.csv.zip
2025-03-06 – 833.54 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Committee on EU Affairs
Documents from the Committee on EU Affairs
Corpus
Swedish
Dataset:
segreg-rd-eun.xml.bz2
2025-03-07 – 58.53 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-eun.csv.zip
2025-03-06 – 12.52 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Committee reports and statements
Utskottens betänkanden och utlåtanden, inklusive riksdagens beslut, en sammanfattning av voteringsresultaten och Beslut i korthet
Corpus
Swedish
Dataset:
segreg-rd-bet.xml.bz2
2025-03-07 – 504.76 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-bet.csv.zip
2025-03-06 – 3.7 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Documents from Committees
Dokument från utskotten, bland annat KU-anmälningar, protokoll, verksamhetsberättelser och den gamla dokumentserien Utredningar från riksdagen
Corpus
Swedish
Dataset:
segreg-rd-utsk.xml.bz2
2025-03-07 – 1.11 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-utsk.csv.zip
2025-03-06 – 117.28 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: EU initiatives
EU initiatives are documents from the European Commission, “COM documents”.
Corpus
Swedish
Dataset:
segreg-rd-kom.xml.bz2
2025-03-07 – 13.62 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-kom.csv.zip
2025-03-06 – 579.99 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Explanatory memorandums on EU proposals
Regeringens faktapromemorior om EU-kommissionens förslag
Corpus
Swedish
Dataset:
segreg-rd-fpm.xml.bz2
2025-03-07 – 388.78 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-fpm.csv.zip
2025-03-06 – 54.98 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Government bills
Propositioner och skrivelser från regeringen
Corpus
Swedish
Dataset:
segreg-rd-prop.xml.bz2
2025-03-07 – 641.52 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-prop.csv.zip
2025-03-06 – 6.69 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Interpellations
Interpellations from members of the Riksdag to the government
Corpus
Swedish
Dataset:
segreg-rd-ip.xml.bz2
2025-03-07 – 18.87 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-ip.csv.zip
2025-03-06 – 526.53 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Ministry Publications Series
Utredningar från regeringens departement
Corpus
Swedish
Dataset:
segreg-rd-ds.xml.bz2
2025-03-07 – 114.52 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-ds.csv.zip
2025-03-06 – 2.71 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Motions
Motions from the members of the Riksdag
Corpus
Swedish
Dataset:
segreg-rd-mot.xml.bz2
2025-03-07 – 343.2 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-mot.csv.zip
2025-03-06 – 3.17 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Order papers
Föredragningslistor för kammarens sammanträden
Corpus
Swedish
Dataset:
segreg-rd-flista.xml.bz2
2025-03-07 – 71.12 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-flista.csv.zip
2025-03-06 – 15.54 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Other documents
Dokumentserierna Riksrevisionens granskningsrapporter, Utredningar från Riksdagsförvaltningen och Rapporter från riksdagen samt planeringsdokument, bilagor till dokument och uttag ur riksdagens databaser och de gamla dokumentserierna Utredningar från riksdag
Corpus
Swedish
Dataset:
segreg-rd-ovr.xml.bz2
2025-03-07 – 31.4 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-ovr.csv.zip
2025-03-06 – 1.19 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Records of proceedings in the Chamber
Records of proceedings in the Chamber
Corpus
Swedish
Dataset:
segreg-rd-prot.xml.bz2
2025-03-07 – 1.08 GB – CC BY 4.0
Word statistics:
stats_segreg-rd-prot.csv.zip
2025-03-06 – 6.44 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Reports
Framställningar och redogörelser från organ som utsetts av riksdagen
Corpus
Swedish
Dataset:
segreg-rd-frsrdg.xml.bz2
2025-03-07 – 24.42 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-frsrdg.csv.zip
2025-03-06 – 825.7 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Statements of opinion
Utskottens yttranden
Corpus
Swedish
Dataset:
segreg-rd-yttr.xml.bz2
2025-03-04 – 13.25 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-yttr.csv.zip
2025-03-06 – 435.64 KB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Swedish Government Official Reports (SOU series)
Olika utredningars förslag till regeringen
Corpus
Swedish
Dataset:
segreg-rd-sou.xml.bz2
2025-03-07 – 1.25 GB – CC BY 4.0
Word statistics:
stats_segreg-rd-sou.csv.zip
2025-03-06 – 10.83 MB – CC BY 4.0
Explore in:
Segregation texts: Riksdag's open data: Written questions
Written questions from members of the Riksdag to the Government and the answer to these
Corpus
Swedish
Dataset:
segreg-rd-skfr.xml.bz2
2025-03-07 – 2.85 MB – CC BY 4.0
Word statistics:
stats_segreg-rd-skfr.csv.zip
2025-03-06 – 175.24 KB – CC BY 4.0
Explore in:
Segregationstexter: Riksdagens öppna data: Utredningar
Kommittédirektiv och kommittéberättelser för utredningar som regeringen tillsätter
Corpus
Swedish
Dataset:
segreg-rd-utr.xml.bz2
2025-03-07 – 78.6 KB – CC BY 4.0
Word statistics:
stats_segreg-rd-utr.csv.zip
2025-03-06 – 11.46 KB – CC BY 4.0
Explore in:
SemEval2020 Task 1
Swedish Test Data for SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection (extracts from Kubhist v2)
Corpus
Swedish
Dataset:
semeval2020_ulscd_swe.zip
2024-01-25 – 956.05 MB – CC BY 4.0
Sen*Lex
Sen*Lex is a sense-based lexicon of vocabulary for Swedish as a second language, featuring frequencies from essays (productive vocabulary, based on SweLL-pilot) and from course books (receptive vocabulary, based on COCTAILL)
Lexicon
Swedish
Dataset:
sen-lex.xlsx
2025-02-19 – 85 bytes – CC BY 4.0
Dataset:
sen-lex.csv
2025-02-19 – 5.08 MB – CC BY-NC-SA 4.0
Explore in:
SenSALDO
SenSALDO, SALDO entries and text word forms with sentiment information (prior polarity)
Lexicon
Swedish
Dataset:
sensaldo-v02.zip
2019-04-04 – 301.78 KB – CC BY 4.0
Dataset:
sensaldo.zip
2018-03-05 – 235.62 KB – CC BY 4.0
Sentiment Lexicon
Sentiment lexicon for Swedish based on SALDO
Lexicon
Swedish
Dataset:
sentimentlex.xml
2017-09-19 – 1.78 MB – CC BY 4.0
Dataset:
sentimentlex.csv
2016-07-11 – 486.9 KB – CC BY 4.0
Sibirian-German
Siberian German is transcribed German spoken of about 36 000 people in the region of Krasnoyarsk in Siberia (Russia).
Corpus
Swedish
Word statistics:
stats_SIBERIANGERMANDIALOGS.txt
2013-02-23 – 101.72 KB – CC BY 4.0
Explore in:
Sibirientyska kvinnor
Dialogs between four women born in 1927 to 1937 in the Soviet Volga Republic
Corpus
Swedish
Word statistics:
stats_SIBERIANGERMANWOMEN.txt
2017-03-19 – 44.62 KB – CC BY 4.0
Explore in:
SIC2 - Stockholm Internet Corpus
The Stockholm Internet Corpus (SIC2) contains Swedish blog posts, annotated with part of speech, morphological features, and named entities.
Corpus
Swedish
Dataset:
sic2.xml.bz2
2020-11-25 – 262.36 KB – CC BY 4.0
Word statistics:
stats_sic2.csv
2021-08-12 – 177.44 KB – CC BY 4.0
Dataset:
sic2.zip
CC BY 4.0
Dataset:
readme.txt
2020-11-17 – 2.18 KB – CC BY 4.0
Explore in:
Simple lexicon
The Swedish SIMPLE Lexicon - A language technology resource with access to semantic information in Swedish
Lexicon
Swedish
Dataset:
simple_n.html
2010-06-10 – 1.61 MB – CC BY 4.0
Dataset:
simple_v.html
2010-06-10 – 559.34 KB – CC BY 4.0
Dataset:
simple_adj.html
2010-06-10 – 104.41 KB – CC BY 4.0
Simple+
The Swedish SIMPLE Lexicon - A language technology resource with access to semantic information in Swedish, connected to SALDO senses
Lexicon
Swedish
Dataset:
simpleplus.xml
2017-09-19 – 6.9 MB – CC BY 4.0
Dataset:
simple_n.html
2010-06-10 – 1.61 MB – CC BY 4.0
Dataset:
simple_v.html
2010-06-10 – 559.34 KB – CC BY 4.0
Dataset:
simple_adj.html
2010-06-10 – 104.41 KB – CC BY 4.0
Explore in:
SKBL
The Biographical Dictionary of Swedish Women
Lexicon
Swedish, English
Dataset:
skbl.json
2025-03-15 – 47.73 MB – CC BY 4.0
Explore in:
Smittskydd
The newspaper Smittskydd by Smittskyddsinstitutet (Swedish Institute for Communicable Disease Control) 2002–2010
Corpus
Swedish
Dataset:
smittskydd.xml.bz2
2017-04-05 – 11.26 MB – CC BY 4.0
Word statistics:
stats_SMITTSKYDD.txt
2017-04-09 – 3 MB – CC BY 4.0
Explore in:
SNP 1978–79
Swedish parliament proceedings 1978–1979
Corpus
Swedish
Dataset:
snp7879.xml.bz2
2017-04-05 – 81.35 MB – CC BY 4.0
Word statistics:
stats_SNP7879.txt
2017-04-09 – 7.13 MB – CC BY 4.0
Explore in:
Söderwall
Dictionary of Old Swedish
Lexicon
Swedish
Dataset:
soederwall.xml
2017-09-19 – 23.42 MB – CC BY 4.0
Explore in:
Söderwall Supplement
Dictionary of Old Swedish
Lexicon
Swedish
Dataset:
soederwall-supp.xml
2017-09-19 – 15.45 MB – CC BY 4.0
Explore in:
Collection
Somali corpora
A collection of Samli corpora
Corpus
Somali
See 26 collected resources
Explore in:
Somali Wikipedia
Corpus of Somali Wikipedia
Corpus
Somali
Dataset:
wikipedia-so.xml.bz2
2016-10-27 – 2.34 MB – CC BY 4.0
Word statistics:
stats_WIKIPEDIA-SO.txt
2020-02-25 – 1.9 MB – CC BY 4.0
Explore in:
Somali: Af Soomaali 1971-79
Corpus
Somali
Dataset:
somali-1971-79.xml.bz2
2023-02-17 – 135.14 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2001 Somaliland
Corpus
Somali
Dataset:
somali-as-2001.xml.bz2
2021-08-27 – 113.01 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2001 Soomaaliya
Corpus
Somali
Dataset:
somali-2001.xml.bz2
2023-02-17 – 288.17 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2006 Itoobiya
Corpus
Somali
Dataset:
somali-itoobiya.xml.bz2
2017-06-28 – 125.93 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2010 Somaliland
Corpus
Somali
Dataset:
somali-hargeysa-2010.xml.bz2
2017-11-27 – 145.67 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2013 Somaliland
Corpus
Somali
Dataset:
somali-as-2013.xml.bz2
2019-02-18 – 59.78 KB – CC BY 4.0
Explore in:
Somali: Af-Soomaali 2018 Soomaaliya
Corpus
Somali
Dataset:
somali-as-2018.xml.bz2
2019-10-01 – 32.81 KB – CC BY 4.0
Explore in:
Somali: Afka Hooyo 1992-02 Kanada
Corpus
Somali
Dataset:
somali-ah-1992-02-kanada.xml.bz2
2017-01-30 – 2.99 KB – CC BY 4.0
Explore in:
Somali: Afka Hooyo 2010–19 Iswiidhan
Corpus
Somali
Dataset:
somali-ah-2010-19.xml.bz2
2021-08-30 – 65.49 KB – CC BY 4.0
Explore in:
Somali: BBC
Corpus
Somali
Dataset:
somali-bbc.xml.bz2
2017-05-31 – 181.65 KB – CC BY 4.0
Explore in:
Somali: Caafimaad 1972–79
Corpus
Somali
Dataset:
somali-caafimaad-1972-79.xml.bz2
2021-08-30 – 38.92 KB – CC BY 4.0
Explore in:
Somali: Caafimaad 1994
Corpus
Somali
Dataset:
somali-caafimaad-1994.xml.bz2
2017-05-16 – 24.79 KB – CC BY 4.0
Explore in:
Somali: Cilmi-Afeed
Corpus
Somali
Dataset:
somali-cilmi.xml.bz2
2021-08-27 – 683.26 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 1971–1980
Corpus
Somali
Dataset:
somali-cb.xml.bz2
2018-03-12 – 212.86 KB – CC BY 4.0
Word statistics:
stats_SOMALI-CB.txt
2020-02-25 – 265.81 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 1980-89
Corpus
Somali
Dataset:
somali-cb-1980-89.xml.bz2
2018-03-12 – 13.05 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 2001 Somaliland
Corpus
Somali
Dataset:
somali-hargeysa.xml.bz2
2017-09-20 – 72.85 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 2001-03 Soomaaliya
Corpus
Somali
Dataset:
somali-cb-2001-03-soomaaliya.xml.bz2
2021-08-27 – 159.48 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 2010 Somaliland
Corpus
Somali
Dataset:
somali-cb-2010.xml.bz2
2019-02-18 – 27.54 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 2011 Itoobiya
Corpus
Somali
Dataset:
somali-cb-2011.xml.bz2
2019-02-18 – 64 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 2016 Somaliland
Corpus
Somali
Dataset:
somali-cb-2016.xml.bz2
2021-08-27 – 179.66 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Bulshada 2018 Soomaaliya
Corpus
Somali
Dataset:
somali-cb-2018.xml.bz2
2019-10-01 – 77.07 KB – CC BY 4.0
Explore in:
Somali: Cilmiga Deegaanka 2012 Itoobiya
Corpus
Somali
Dataset:
somali-cd-2012-itoobiya.xml.bz2
2018-03-12 – 132.13 KB – CC BY 4.0
Explore in:
Somali: Golaha Wakiillada Somaliland
Corpus
Somali
Dataset:
somali-wakiillada.xml.bz2
2017-05-31 – 1.17 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2002
Corpus
Somali
Dataset:
somali-haatuf-news-2002.xml.bz2
2018-06-27 – 3.34 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2003
Corpus
Somali
Dataset:
somali-haatuf-news-2003.xml.bz2
2018-06-27 – 5.29 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2004
Corpus
Somali
Dataset:
somali-haatuf-news-2004.xml.bz2
2018-06-27 – 4.08 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2005
Corpus
Somali
Dataset:
somali-haatuf-news-2005.xml.bz2
2018-06-27 – 4.57 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2006
Corpus
Somali
Dataset:
somali-haatuf-news-2006.xml.bz2
2018-06-27 – 4.69 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2007
Corpus
Somali
Dataset:
somali-haatuf-news-2007.xml.bz2
2018-06-27 – 3.93 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2008
Corpus
Somali
Dataset:
somali-haatuf-news-2008.xml.bz2
2018-06-27 – 2.81 MB – CC BY 4.0
Explore in:
Somali: Haatuf News 2009
Corpus
Somali
Dataset:
somali-haatuf-news-2009.xml.bz2
2018-06-27 – 817.25 KB – CC BY 4.0
Explore in:
Somali: Kitaabka Quduuska Ah
Corpus
Somali
Dataset:
somali-kqa.xml.bz2
2016-09-29 – 1.67 MB – CC BY 4.0
Word statistics:
stats_SOMALI-KQA.txt
2020-02-25 – 957.24 KB – CC BY 4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Next page
Next ›
Last page
Last »
Close menu