Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Type Language Access
Söderwall
Dictionary of Old Swedish
Lexicon Swedish
Corpus word statistics
Accumulated word statistics from many of our modern Swedish corpora
Corpus
Multilingual Constructicon
A multilingual Constructicon
Lexicon Swedish, Russian
Swedish analogy 2.0
Swedish semantic and syntactic similarity
Corpus Swedish
Collection
Kvinnotidningar
Material from historical women's periodicals
Corpus Swedish
Pretrained embeddings
A list of pretrained embeddings for Swedish
Model Swedish
Parole+
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information, partially linked to SALDO senses
Lexicon Swedish
Collection
Fornsvenska textbankens material
A collection of Old Swedish texts from Fornsvenska textbanken
Corpus Swedish
Collection
Flashback
Material from the Flashback internet forum
Corpus Swedish
Collection
SweLL-pilot
Essays written by adult learners of Swedish, manually anonymised and correction annotated. The corpus contains both the original learner text and a corrected version of each essay. Collection period 2006-2015.
Corpus Swedish
Bring
A digital version of Bring's thesaurus (1930)
Lexicon Swedish
Söderwall Supplement
Dictionary of Old Swedish
Lexicon Swedish
DaLAJ-GED-SuperLim 2.0
Dataset for Linguistic Acceptability Judgments (and more), v.2.0
Corpus Swedish
Russian Constructicon
A Russian Constructicon
Lexicon Russian, English
SweDN 1.0
A Swedish text summarization corpus
Corpus Swedish
Swedish Diachronic Word Embeddings
Swedish Diachronic Word Embedding Models Trained on Historical Newspaper Data
Model Swedish
SALDO
SALDO is an extensive lexicon resource for modern Swedish written language.
Lexicon Swedish
Argumentation sentences 1.0
A translated corpus for classifying sentence stance in relation to a topic.
Corpus Swedish
Collection
Göteborgsposten
A corpus with texts from the newspaper Göteborgs-Posten
Corpus Swedish
SweParaphrase 2.0
Semantic Textual Similarity reference data (STS Benchmark).
Corpus Swedish
CoDeRooMor, v.01
Morphological dataset (word-building morphology), Swedish L2 profiles project
Lexicon Swedish
Sports anglicisms
English loan-words in the Swedish sports press
Lexicon Swedish
ScandiSent
Sentiment Corpus for Swedish, Norwegian, Danish, Finnish and English crawled from trustpilot.
Corpus Swedish, Norwegian Bokmål, Danish, English, Finnish
LingFN
A Framenet for the linguistic domain
Lexicon Swedish
SweFAQ 2.0
Frequently asked questions from Swedish authorities' websites with shuffled answers
Corpus Swedish
Swedish EAT: question classification
A translated version of the QAQC dataset for expected-answer-type classification.
Corpus Swedish
Word Embeddings trained on English Wikipedia
Word Embeddings trained on English Wikipedia
Model English