Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Type Language Access
Dependency parsing model: Stanza
Pretrained models for dependency parsing.
Model Swedish
Collection
Kubord-fasttext
A collection of fasttext models trained on modern newspaper texts from the National Library of Sweden
Model Swedish
Kubord-fasttext - Aftonbladet 2010–2022 - lemma
Fasttext model trained on Aftonbladet 2010–2022
Model Swedish
Kubord-fasttext - Aftonbladet 2010–2022 - token
Fasttext model trained on Aftonbladet 2010–2022
Model Swedish
Kubord-fasttext - Dagens Nyheter 2010–2022 - lemma
Fasttext model trained on Dagens Nyheter 2010–2022
Model Swedish
Kubord-fasttext - Dagens Nyheter 2010–2022 - token
Fasttext model trained on Dagens Nyheter 2010–2022
Model Swedish
Kubord-fasttext - Göteborgsposten 2013–2022 - lemma
Fasttext model trained on Göteborgsposten 2013–2022
Model Swedish
Kubord-fasttext - Göteborgsposten 2013–2022 - token
Fasttext model trained on Göteborgsposten 2013–2022
Model Swedish
Lemmatization model: Stanza
Pretrained model for lemmatization.
Model Swedish
POS-tagging model: Flair
Pretrained models for POS-tagging.
Model Swedish
POS-tagging model: Marmot
Pretrained models for POS-tagging.
Model Swedish
POS-tagging model: Stanza
Pretrained models for POS-tagging.
Model Swedish
Pretrained embeddings
A list of pretrained embeddings for Swedish
Model Swedish
Swedish Diachronic Word Embeddings
Swedish Diachronic Word Embedding Models Trained on Historical Newspaper Data
Model Swedish
Word Embeddings trained on English Wikipedia
Word Embeddings trained on English Wikipedia
Model English