Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Type Language Access
Collection
ASPAC
The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Belarusian, Bulgarian, Czech, German, Lower Sorbian, Modern Greek (1453-), English, Spanish, French, Croatian, Upper Sorbian, Latin, Macedonian, Dutch, Polish, Portuguese, Romanian, Russian, Kele (Papua New Guinea), Slovak, Slovenian, Serbian, Slavomolisano, Turkmen, Ukrainian
Collection
Bicameral Riksdag
Collection of textual documents from the Swedish bicameral parliament data
Corpus Swedish
Collection
Blog mix
Material from a selection of Swedish blogs. Regularly updated.
Corpus Swedish
Collection
Europarl
European Parliament Proceedings Parallel Corpus
Corpus Swedish, Danish, German, Modern Greek (1453-), English, Spanish, Finnish, French, Italian, Dutch, Portuguese
Collection
Familjeliv
Material from the Familjeliv internet forum
Corpus Swedish
Collection
Finland Swedish
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Collection
Flashback
Material from the Flashback internet forum
Corpus Swedish
Collection
Fornsvenska textbankens material
A collection of Old Swedish texts from Fornsvenska textbanken
Corpus Swedish
Collection
Göteborgsposten
A corpus with texts from the newspaper Göteborgs-Posten
Corpus Swedish
Collection
Kubhist
Diachronic collection of Swedish historical newspaper texts from the period of 1749–1926
Corpus Swedish
Collection
Kubhist 2
Diachronic collection of Swedish historical newspaper texts from the period of 1645–1926. Kubhist 2 is an updated version av Kubhist with improved OCR and more material.
Corpus Swedish
Collection
Kubord 1
Word frequencies from modern newspaper texts from the National Library of Sweden
Corpus Swedish
Collection
Kubord 2
Word relations from modern newspaper texts from the National Library of Sweden
Corpus Swedish
Collection
Kvinnotidningar
Material from historical women's periodicals
Corpus Swedish
Collection
Läkartidningen medical journal
Corpus for health care technical language
Corpus Swedish
Collection
Medieval letters
Swedish medieval charters from Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus Latin, German, Norwegian, Swedish
Collection
NPEGL
The Noun Phrases in Early Germanic Languages database.
Lexicon Old English (ca. 450-1100), Old High German (ca. 750-1050), Old Norse, Old Saxon
Collection
Old Finland Swedish
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Collection
Press
Swedish press
Corpus Swedish
Collection
Riksdagens öppna data
Data from the Swedish parliament collected from data.riksdagen.se
Corpus Swedish
Collection
Somali corpora
A collection of Samli corpora
Corpus Somali
Collection
SuperLim 2
A standardized suite for evaluation and analysis of Swedish natural language understanding systems.
Corpus Swedish
Collection
SVT news
News texts from svt.se
Corpus Swedish
Collection
SweLL-pilot
Essays written by adult learners of Swedish, manually anonymised and correction annotated. The corpus contains both the original learner text and a corrected version of each essay. Collection period 2006-2015.
Corpus Swedish
Collection
Web News
News from Swedish newspapers' websites
Corpus Swedish