Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Sort ascending Type Language Access
Familjeliv: Adoption
Material from the Familjeliv internet forum.
Corpus Swedish
Collection
Familjeliv
Material from the Familjeliv internet forum
Corpus Swedish
Faluposten 1870's
Part of the collection Kubhist2
Corpus Swedish
Falköpings Tidning 1870's
Part of the collection Kubhist2
Corpus Swedish
Fahlu Weckoblad 1810's
Part of the collection Kubhist2
Corpus Swedish
Europarl: Swedish-Spanish
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Spanish
Europarl: Swedish-Portuguese
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Portuguese
Europarl: Swedish-Italian
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Italian
Europarl: Swedish-Greek
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Modern Greek (1453-)
Europarl: Swedish-German
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, German
Europarl: Swedish-French
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, French
Europarl: Swedish-Finnish
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Finnish
Europarl: Swedish-English
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, English
Europarl: Swedish-Dutch
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Dutch
Europarl: Swedish-Danish
Part of European Parliament Proceedings Parallel Corpus
Corpus Swedish, Danish
Europarl: Swedish
The Swedish part of European Parliament Proceedings Parallel Corpus
Corpus Swedish
Collection
Europarl
European Parliament Proceedings Parallel Corpus
Corpus Swedish, Danish, German, Modern Greek (1453-), English, Spanish, Finnish, French, Italian, Dutch, Portuguese
Eukalyptus Treebank of Written Swedish
A treebank with written Swedish data, with parts-of-speech, TIGER-style syntax, multiword expressions and sense annotation
Corpus Swedish
Ethnological question lists
Question lists of the Nordic Museum
Corpus Swedish
Ekeblad's letters
Ekeblad's letter corpus is based on the digital edition Breven till Claes 1639–1655 by Sture Allén.
Corpus Swedish
DReaM-Copyright-Protected
A multilingual corpus of linguistic descriptions of the world's natural languages.
Corpus English
DReaM
A multilingual corpus of linguistic descriptions of the world's natural languages.
Corpus English
Dramawebben (demo)
Texts from Dramawebben, a digital archive of free Swedish drama.
Corpus Swedish
DN 1987
Dagens Nyheter 1987
Corpus Swedish
Diverse tidningar
Fourteen annual volumes of eight different periodicals (1810–1933) digitized by Project Runeberg
Corpus Swedish
Diachronic pivot
Diachronic pivot resource where historical lexical information is linked to SALDO
Lexicon Swedish
DiabetologNytt (1996–1999)
The paper DiabetologNytt (Diabetologynews) 1996-1999
Corpus Swedish
Detektiva avdelningen
Corpus Swedish
Detective Department
Data from the Detective Department at the Gothenburg police, from late 1800s to early 1900s.
Corpus Swedish
Dependency parsing model: Stanza
Pretrained models for dependency parsing.
Model Swedish
Databank of Eleven Spanish Novels 1951–1971
Corpus consisting of 11 Spanish novels. Part of SOL - Spanish Online.
Corpus Spanish
Databank of 1977 Spanish Press
Text from two Spanish newspapers from 1977. Part of SOL - Spanish Online.
Corpus Spanish
Dalpilen 1860's
Part of the collection Kubhist2
Corpus Swedish
Dalin: Then Swänska Argus 1732-1734
Manual transcription of Then Swänska Argus by Olof von Dalin, Stockholm, 1732–1734. For OCR analysis.
Corpus Swedish
Dalin's morphology
A morphology from Dalin's Dictionary of 19th century Swedish that is derived from Dalin's base material.
Lexicon Swedish
Dalin Dictionary - Base Material
Dalin's Dictionary of 19th century Swedish - base material
Lexicon Swedish
Dalin Dictionary
Dalin's Dictionary of 19th century Swedish
Lexicon Swedish
DaLAJ-GED-SuperLim 2.0
Dataset for Linguistic Acceptability Judgments (and more), v.2.0
Corpus Swedish
Dagens Arena
News texts from dagensarena.se
Corpus Swedish
Corpus word statistics
Accumulated word statistics from many of our modern Swedish corpora
Corpus
Constructicon
Swedish Constructicon
Lexicon Swedish
CoDeRooMor, v.01
Morphological dataset (word-building morphology), Swedish L2 profiles project
Lexicon Swedish
COCTAILL
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated.
Corpus Swedish
Caafimaad 1983
Corpus Somali
Bring
A digital version of Bring's thesaurus (1930)
Lexicon Swedish
Bonniers novels II (1980–81)
A corpus of 60 Bonnier novels from 1980–81
Corpus Swedish
Bonnier novels I (1976–77)
A corpus of 69 Bonnier novels from 1976–77
Corpus Swedish
Bloggmix 2017
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Bloggmix 2016
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Bloggmix 2015
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix unknown date
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2014
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2013
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2012
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2011
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2010
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2009
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2008
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2007
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2006
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2005
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2004
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2003
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2002
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2001
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2000
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 1999
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 1998
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Collection
Blog mix
Material from a selection of Swedish blogs. Regularly updated.
Corpus Swedish
Bliss
Blissymbolics is a constructed symbol language which is mainly used by people with severe communicative and phsyical disabilities. It consists of around 5000 graphical symbols.
Lexicon Blissymbols
Blingbring
Blingbring, an enhanced and modernized version of Bring's thesaurus (1930)
Lexicon Swedish
Bicameral riksdag: The constitution of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Reports, memorandums and opinions
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Regulations
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Register
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Protocols
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Propositions and letters
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Narratives and accounts
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Motions
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Letters of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Government official investigations
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Collection
Bicameral Riksdag
Collection of textual documents from the Swedish bicameral parliament data
Corpus Swedish
Biblioteksbladet
The earliest volumes of "Biblioteksbladet: Organ för Sveriges allmänna biblioteksförening" from 1916–1940, digitized by Project Runeberg
Corpus Swedish
Betänkande ang. läroböcker (1882)
A report from 1882, digitized by the Gothenburg University Library
Corpus Swedish
Bellman
Collected works of C.M. Bellman
Corpus Swedish
Aventinus
Drug related terminology
Lexicon Swedish
August Strindberg's novels
Part of the collected works of August Strindberg
Corpus Swedish
August Strindberg's letters
Part of the collected works of August Strindberg
Corpus Swedish
ASU
Structural development of the second language
Corpus Swedish
ASPAC: Swedish-Upper Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Upper Sorbian
ASPAC: Swedish-Ukrainian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Ukrainian
ASPAC: Swedish-Turkmen
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Turkmen
ASPAC: Swedish-Spanish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Spanish
ASPAC: Swedish-Slovene
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Slovenian
ASPAC: Swedish-Slovak
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Slovak
ASPAC: Swedish-Serbian (latin)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Serbian
ASPAC: Swedish-Serbian (cyrillic)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Kele (Papua New Guinea), Swedish
ASPAC: Swedish-Russian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Russian
ASPAC: Swedish-Romanian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Romanian
ASPAC: Swedish-Portuguese
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Portuguese