Skip to main content

Resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Explore Type Language Download
SemEval 2020 Task 1 (repackaged for SuperLim)
Swedish Test Data for SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection
Corpus Swedish
SemEval2020 Task 1
Swedish Test Data for SemEval 2020 Task 1: Unsupervised Lexical Semantic Change Detection (extracts from Kubhist v2)
Corpus Swedish
SenSALDO
SenSALDO, SALDO entries and text word forms with sentiment information (prior polarity)
Lexicon Swedish
Sentiment Lexicon
Lexicon Swedish
Sheekooyin Carruureed
Korp
Corpus Somali
Sheekooyin Carruureed (Turjuman)
Korp
Corpus Somali
Sheekooyin Gaagaaban
Korp
Corpus Somali
Sibirian-German
Korp
Corpus Swedish
Sibirientyska kvinnor
Korp
Corpus Swedish
SIC2 - Stockholm Internet Corpus
The Stockholm Internet Corpus (SIC2) contains Swedish blog posts, annotated with part of speech, morphological features, and named entities.
Korp
Corpus Swedish
Simple lexicon
The Swedish SIMPLE Lexicon - A language technology resource with access to semantic information in Swedish
Lexicon Swedish
Simple+
The Swedish SIMPLE Lexicon - A language technology resource with access to semantic information in Swedish, connected to SALDO senses
Karp
Lexicon Swedish
SKBL
The Biographical Dictionary of Swedish Women
Karp
Lexicon Swedish, English
Skriftliga frågor
Part of the Riksdag's open data
Korp
Corpus Swedish
Smittskydd
The newspaper Smittskydd by Smittskyddsinstitutet (Swedish Institute for Communicable Disease Control)
Korp
Corpus Swedish
SNP 1978-79
Swedish parliament proceedings
Korp
Corpus Swedish
Söderwall
Dictionary of Old Swedish
Karp
Lexicon Swedish
Söderwall Supplement
Dictionary of Old Swedish
Karp
Lexicon Swedish
Somali Wikipedia
Corpus of Somali Wikipedia
Korp
Corpus Somali
Spanska Flugan 1839–1841
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Korp
Corpus Swedish
Sports anglicisms
English loan-words in the Swedish sports press
Lexicon Swedish
Språkprov SO 2009
Korp
Corpus Swedish
Statens offentliga utredningar
Korp
Corpus Swedish
Statens offentliga utredningar
Part of the Riksdag's open data
Korp
Corpus Swedish
Stockholms Dagblad 1820's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1830's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1840's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1850's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1860's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1870's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1880's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms Dagblad 1890's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholms stads tänkeböcker
Korp
Corpus Swedish
Stockholmsposten 1770's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholmsposten 1770's
A corpus with texts from Stockholmsposten in the 1770's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1780's
A corpus with texts from Stockholmsposten in the 1780's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1780's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholmsposten 1790's
A corpus with texts from Stockholmsposten in the 1790's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1790's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholmsposten 1800's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholmsposten 1800's
A corpus with texts from Stockholmsposten in the 1800's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1810's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholmsposten 1810's
A corpus with texts from Stockholmsposten in the 1810's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1820's
A corpus with texts from Stockholmsposten in the 1820's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1820's
Part of the collection Kubhist2
Korp
Corpus Swedish
Stockholmsposten 1830's
A corpus with texts from Stockholmsposten in the 1830's; part of the Kubhist corpus
Korp
Corpus Swedish
Stockholmsposten 1830's
Part of the collection Kubhist2
Korp
Corpus Swedish
Studentbladet 2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Korp
Corpus Swedish
SUC 2.0
Stockholm-Umeå corpus 2.0
Corpus Swedish
SUC 3.0
Stockholm-Umeå corpus 3.0
Korp
Corpus Swedish
SUC Novels (StorSUC)
Stockholm-Umeå corpus
Korp
Corpus Swedish
SUCX 2.0
Stockholm-Umeå corpus 2.0 scrambled
Korp
Corpus Swedish
SUCX 3.0
Stockholm-Umeå corpus 3.0 scrambled
Korp
Corpus Swedish
Sundsvalls Tidning 1880's
Part of the collection Kubhist2
Korp
Corpus Swedish
Sundsvalls Tidning 1890's
Part of the collection Kubhist2
Korp
Corpus Swedish
Sundsvalls Tidning Norrländska Korrespondenten 1870's
Part of the collection Kubhist2
Korp
Corpus Swedish
SuperSim (repackaged for Superlim)
A test set for word similarity and relatedness in Swedish
Corpus Swedish
Suugaan
Korp
Corpus Somali
Suugaan (Turjuman)
Korp
Corpus Somali
Suugaan 2
Korp
Corpus Somali
sv-COVID-19
Miscellaneous articles related to corona.
Korp
Corpus Swedish
Svensk Tidskrift
Korp
Corpus Swedish
Svenskbygden 2010-2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Korp
Corpus Finland Swedish
SVT news 2004
News texts from svt.se
Korp
Corpus Swedish
SVT news 2005
News texts from svt.se
Korp
Corpus Swedish
SVT news 2006
News texts from svt.se
Korp
Corpus Swedish
SVT news 2007
News texts from svt.se
Korp
Corpus Swedish
SVT news 2008
News texts from svt.se
Korp
Corpus Swedish
SVT news 2009
News texts from svt.se
Korp
Corpus Swedish
SVT news 2010
News texts from svt.se
Korp
Corpus Swedish
SVT news 2011
News texts from svt.se
Korp
Corpus Swedish
SVT news 2012
News texts from svt.se
Korp
Corpus Swedish
SVT news 2013
News texts from svt.se
Korp
Corpus Swedish
SVT news 2014
News texts from svt.se
Korp
Corpus Swedish
SVT news 2015
News texts from svt.se
Korp
Corpus Swedish
SVT news 2016
News texts from svt.se
Korp
Corpus Swedish
SVT news 2017
News texts from svt.se
Korp
Corpus Swedish
SVT news 2018
News texts from svt.se
Korp
Corpus Swedish
SVT news 2019
News texts from svt.se
Korp
Corpus Swedish
SVT news 2020
News texts from svt.se
Korp
Corpus Swedish
SVT news 2021
News texts from svt.se
Korp
Corpus Swedish
SVT news 2022
News texts from svt.se
Korp
Corpus Swedish
SVT news okänt datum
News texts from svt.se
Korp
Corpus Swedish
SW1203-essays
Essays written by L2 Swedish language learners, university courses
Korp
Corpus Swedish
Swedberg's Swensk Ordabok
Swedberg's Swensk Ordabok
Karp
Lexicon Swedish, Latin
Swedberg's Swensk Ordabok (morphology, rudimentary)
Swedberg's Swensk Ordabok (morphology, rudimentary)
Karp
Lexicon Swedish
SweDiagnostics
Swedish version (Super)GLUE Diagnostic
Corpus Swedish, English
Swedish ABSAbank
An annotated Swedish corpus for aspect-based sentiment analysis
Corpus Swedish
Swedish ABSAbank-Imm 1.0
An annotated Swedish corpus for aspect-based sentiment analysis (a version of Absabank)
Corpus Swedish
Swedish analogy test set v1.0
Swedish semantic and syntactic similarity: test set
Corpus Swedish
Swedish Bible 1873
Korp
Corpus Swedish
Swedish Bible 1917
Korp
Corpus Swedish
Swedish Diachronic Word Embeddings
Swedish Diachronic Word Embedding Models Trained on Historical Newspaper Data
Model
Swedish FAQ (mismatched) 1.0
Frequently asked questions from Swedish authorities' websites with shuffled answers
Corpus Swedish
Swedish fraktur 1626-1816
A selection of fraktur texts printed between 1626 and 1816 from the collections of the University Library of University of Gothenburg (UB). For OCR analysis.
Corpus Swedish
Swedish FrameNet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This part of the resource contains the frames and the manually annotated semantic content.
Karp
Lexicon Swedish
Swedish framenet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This part of the resource contains the corpus examples, automatically enriched with linguistic information.
Korp
Corpus Swedish
Swedish FrameNet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. It is the complete resource containing both the semantic and the syntactic content.
Lexicon Swedish
Swedish newspapers 1818-1870
A selection of Swedish newspapers printed between 1818 and 1870 from the collections of Kungliga biblioteket (KB). For OCR analysis.
Corpus Swedish
Swedish newspapers 1871-1906
A selection of Swedish newspapers printed between 1871 and 1906 from the collections of Kungliga biblioteket (KB). For OCR analysis.
Corpus Swedish