Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Type Language Access
The Swedish Culturomics Gigaword Corpus
One billion Swedish words from 1950 and onwards. Code to extract data from the corpus, as well as usage instructions, can be downloaded from https://svn.spraakdata.gu.se/sb-arkiv/tools/gigaword/
Corpus Swedish
Familjeliv: Delicate Room
Material from the Familjeliv internet forum.
Corpus Swedish
Flashback: Politics
Material from the Flashback internet forum.
Corpus Swedish
Flashback: Society
Material from the Flashback internet forum.
Corpus Swedish
Swedish Twitter 2016
Material collected from a selection of Swedish speaking twitter users from 2016
Corpus Swedish
Familjeliv: Parent
Material from the Familjeliv internet forum.
Corpus Swedish
Swedish Twitter 2017
Material collected from a selection of Swedish speaking twitter users from 2017
Corpus Swedish
Twitter Mix
Material from a selection of Swedish Twitter users. Is regularly updated.
Corpus Swedish
Flashback: Science & Humanities
Material from the Flashback internet forum.
Corpus Swedish
Familjeliv: Pregnant
Material from the Familjeliv internet forum.
Corpus Swedish
Flashback: Culture & Media
Material from the Flashback internet forum.
Corpus Swedish
KBs digitaliserade SOU:er (1922–1996)
Statens offentliga utredningar (SOU) i digitaliserat format från <a href="https://sou.kb.se" target="_blank">Kungliga biblioteket</a>. Samlingen är inte komplett.
Corpus Swedish
Swedish Twitter 2015
Material collected from a selection of Swedish speaking twitter users from 2015
Corpus Swedish
Riksdagens öppna data: Proposition
Propositioner och skrivelser från regeringen
Corpus Swedish
Flashback: Home, Housing & Family
Material from the Flashback internet forum.
Corpus Swedish
The Swedish Literature Bank: Free Works
E-texts and searchable facsimiles fron the Swedish Literature Bank (litteraturbanken.se)
Corpus Swedish
Bicameral riksdag: Protocols
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Propositions and letters
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Flashback: Computers & IT
Material from the Flashback internet forum.
Corpus Swedish
Familjeliv: General Threads – Society
Material from the Familjeliv internet forum.
Corpus Swedish
Familjeliv: Member Threads – Expecting Children
Material from the Familjeliv internet forum.
Corpus Swedish
Familjeliv: Member Threads – Parents
Material from the Familjeliv internet forum.
Corpus Swedish
Familjeliv: Member Threads – General
Material from the Familjeliv internet forum.
Corpus Swedish
Riksdagens öppna data: Statens offentliga utredningar
Olika utredningars förslag till regeringen
Corpus Swedish
Flashback: Sports & Fitness
Material from the Flashback internet forum.
Corpus Swedish
Flashback: Drugs
Material from the Flashback internet forum.
Corpus Swedish
Riksdagens öppna data: Protokoll
Protokoll från kammarens sammanträden
Corpus Swedish
DReaM-Copyright-Protected
A multilingual corpus of linguistic descriptions of the world's natural languages.
Corpus English
Riksdagens öppna data: Betänkande
Utskottens betänkanden och utlåtanden, inklusive rksdagens beslut, en sammanfattning av voteringsresultaten och Beslut i korthet
Corpus Swedish
Familjeliv: Family Planning
Material from the Familjeliv internet forum.
Corpus Swedish
Bicameral riksdag: Reports, memorandums and opinions
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Swedish Wikipedia
Corpus of Swedish Wikipedia
Corpus Swedish
SemEval2020 Task 1
Swedish Test Data for SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection (extracts from Kubhist v2)
Corpus Swedish
Familjeliv: Difficult to get children
Material from the Familjeliv internet forum.
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1880's
Part of the collection Kubhist 2
Corpus Swedish
WordReference
A large corpus of native and non-native written speech in four languages.
Corpus English, Spanish, French, Italian
Familjeliv: Sex & Relationships
Material from the Familjeliv internet forum.
Corpus Swedish
Riksdagens öppna data: Motion
Motioner från riksdagens ledamöter
Corpus Swedish
Flashback: Other
Material from the Flashback internet forum.
Corpus Swedish
Familjeliv: General Threads – Body & Soul
Material from the Familjeliv internet forum.
Corpus Swedish
Flashback: Economy
Material from the Flashback internet forum.
Corpus Swedish
Familjeliv: Member Threads – Family Planning
Material from the Familjeliv internet forum.
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1870's
Part of the collection Kubhist 2
Corpus Swedish
The Swedish Literature Bank: Restricted Works
E-texts and searchable facsimiles fron the Swedish Literature Bank (litteraturbanken.se)
Corpus Swedish
Kubhist 2: Nya Dagligt Allehanda 1880's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Göteborgs Handels- och Sjöfartstidning 1880's
Part of the collection Kubhist 2
Corpus Swedish
The Riksdag's open data - Debates
Debates from the Swedish parliament in the period 1993/94-2017/18
Corpus Swedish
Flashback: Lifestyle
Material from the Flashback internet forum.
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1860's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Nya Dagligt Allehanda 1870's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Göteborgsposten 1880's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Göteborgs Handels- och Sjöfartstidning 1870's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1890's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Nya Dagligt Allehanda 1860's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Aftonbladet 1890's
Part of the collection Kubhist 2
Corpus Swedish
Blog mix 2011
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Kubhist 2: Aftonbladet 1880's
Part of the collection Kubhist 2
Corpus Swedish
Blog mix 2010
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Kubhist 2: Aftonbladet 1870's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Aftonbladet 1860's
Part of the collection Kubhist 2
Corpus Swedish
Flashback: Sex
Material from the Flashback internet forum.
Corpus Swedish
Familjeliv: General Threads – Entertainment
Material from the Familjeliv internet forum.
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1850's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Göteborgs Handels- och Sjöfartstidning 1890's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Göteborgs Handels- och Sjöfartstidning 1860's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Post- och Inrikes Tidningar 1880's
Part of the collection Kubhist 2
Corpus Swedish
Flashback: Vehicles & Traffic
Material from the Flashback internet forum.
Corpus Swedish
Kubhist 2: Nya Dagligt Allehanda 1890's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Aftonbladet 1850's
Part of the collection Kubhist 2
Corpus Swedish
Blog mix 2012
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1840's
Part of the collection Kubhist 2
Corpus Swedish
Flashback: Food, Beverages & Tobacco
Material from the Flashback internet forum.
Corpus Swedish
Kubhist 2: Göteborgsposten 1870's
Part of the collection Kubhist 2
Corpus Swedish
Blog mix 2009
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
DReaM
A multilingual corpus of linguistic descriptions of the world's natural languages.
Corpus English
Kubord 1 - Word frequencies Dagens Nyheter 2000
Part of the collection Kubord 1
Corpus Swedish
Europarl: Swedish-French
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, French
Bicameral riksdag: Motions
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Kubhist 2: Post- och Inrikes Tidningar 1860's
Part of the collection Kubhist 2
Corpus Swedish
Europarl: Swedish-Portuguese
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Portuguese
Europarl: Swedish-Spanish
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Spanish
Familjeliv: General Threads – House & Home
Material from the Familjeliv internet forum.
Corpus Swedish
Kubord 1 - Word frequencies Dagens Nyheter 2001
Part of the collection Kubord 1
Corpus Swedish
Kubhist 2: Stockholms Dagblad 1830's
Part of the collection Kubhist 2
Corpus Swedish
Europarl: Swedish-Dutch
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Dutch
Europarl: Swedish-English
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, English
Europarl: Swedish-Italian
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Italian
Kubhist 2: Post- och Inrikes Tidningar 1870's
Part of the collection Kubhist 2
Corpus Swedish
Kubord 1 - Word frequencies Dagens Nyheter 2002
Part of the collection Kubord 1
Corpus Swedish
Europarl: Swedish-Danish
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Danish
Europarl: Swedish-German
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, German
Kubhist 2: Norrköpings Tidningar 1880's
Part of the collection Kubhist 2
Corpus Swedish
Kubhist 2: Göteborgsposten 1890's
Part of the collection Kubhist 2
Corpus Swedish
Familjeliv: General Threads – Familjeliv.se
Material from the Familjeliv internet forum.
Corpus Swedish
Kubhist 2: Aftonbladet 1840's
Part of the collection Kubhist 2
Corpus Swedish
Kubord 1 - Word frequencies Dagens Nyheter 2003
Part of the collection Kubord 1
Corpus Swedish
Blog mix 2013
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Kubord 1 - Word frequencies Dagens Nyheter 2005
Part of the collection Kubord 1
Corpus Swedish
Bicameral riksdag: Narratives and accounts
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Kubord 1 - Word frequencies Dagens Nyheter 2004
Part of the collection Kubord 1
Corpus Swedish