Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Explore Type Language Download
8 Sidor
News articles from 8 SIDOR.
Korp
Corpus Swedish
Academic texts: Humanities
A corpus with academic texts
Korp
Corpus Swedish
Academic texts: Social science
A corpus with academic texts
Korp
Corpus Swedish
Academic wordlist
Academic wordlist
Karp
Lexicon Swedish
Agriculture
Agricultural manuals: "Engelska Åker-Mannen" and "En Grundelig Kundskap Om Swenska Åkerbruket"
Korp
Corpus Swedish
Applications
Anonymised job applications. The corpus is protected, contact Lena Rogström (lena.rogstroem@svenska.gu.se) for more information and access.
Korp
Corpus Swedish
Argumentation sentences 1.0
A translated corpus for classifying sentence stance in relation to a topic.
Corpus Swedish
Collection
ASPAC
The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Belarusian, Bulgarian, Czech, German, Lower Sorbian, Modern Greek (1453-), English, Spanish, French, Croatian, Upper Sorbian, Latin, Macedonian, Dutch, Polish, Portuguese, Romanian, Russian, Kele (Papua New Guinea), Slovak, Slovenian, Serbian, Slavomolisano, Turkmen, Ukrainian
ASPAC: Swedish
The Swedish part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish
ASPAC: Swedish-Belarussian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Belarusian
ASPAC: Swedish-Bulgarian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Bulgarian
ASPAC: Swedish-Croatian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Croatian
ASPAC: Swedish-Czech
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Czech
ASPAC: Swedish-Dutch
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Dutch
ASPAC: Swedish-English
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, English
ASPAC: Swedish-French
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, French
ASPAC: Swedish-German
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, German
ASPAC: Swedish-Greek
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Modern Greek (1453-), Swedish
ASPAC: Swedish-Italian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Italian
ASPAC: Swedish-Latin
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Latin
ASPAC: Swedish-Lower Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Lower Sorbian
ASPAC: Swedish-Macedonian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Macedonian
ASPAC: Swedish-Molise Slavik
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Slavomolisano, Swedish
ASPAC: Swedish-Polish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Polish
ASPAC: Swedish-Portuguese
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Portuguese
ASPAC: Swedish-Romanian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Romanian
ASPAC: Swedish-Russian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Russian
ASPAC: Swedish-Serbian (cyrillic)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Kele (Papua New Guinea), Swedish
ASPAC: Swedish-Serbian (latin)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Serbian
ASPAC: Swedish-Slovak
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Slovak
ASPAC: Swedish-Slovene
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Slovenian
ASPAC: Swedish-Spanish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Spanish
ASPAC: Swedish-Turkmen
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Turkmen
ASPAC: Swedish-Ukrainian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Ukrainian
ASPAC: Swedish-Upper Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Korp
Corpus Swedish, Upper Sorbian
ASU
Structural development of the second language
Korp
Corpus Swedish
August Strindberg's letters
Part of the collected works of August Strindberg
Korp
Corpus Swedish
August Strindberg's novels
Part of the collected works of August Strindberg
Korp
Corpus Swedish
Aventinus
Drug related terminology
Lexicon Swedish
Bellman
Collected works of C.M. Bellman
Korp
Corpus Swedish
Betänkande ang. läroböcker (1882)
A report from 1882, digitized by the Gothenburg University Library
Korp
Corpus Swedish
Biblioteksbladet
The earliest volumes of "Biblioteksbladet: Organ för Sveriges allmänna biblioteksförening" from 1916–1940, digitized by Project Runeberg
Korp
Corpus Swedish
Blingbring
Blingbring, an enhanced and modernized version of Bring's thesaurus (1930)
Karp
Lexicon Swedish
Bliss
Blissymbolics is a constructed symbol language which is mainly used by people with severe communicative and phsyical disabilities. It consists of around 5000 graphical symbols.
Karp
Lexicon Blissymbols
Collection
Blog mix
Material from a selection of Swedish blogs. Regularly updated.
Korp
Corpus Swedish
Blog mix 1998
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 1999
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2000
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2001
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2002
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2003
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2004
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2005
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2006
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2007
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2008
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2009
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2010
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2011
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2012
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2013
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix 2014
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Blog mix unknown date
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Bloggmix 2015
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Bloggmix 2016
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Bloggmix 2017
Material from a selection of Swedish blogs. Is updated regularly.
Korp
Corpus Swedish
Bonnier novels I (1976–77)
A corpus of 69 Bonnier novels from 1976–77
Korp
Corpus Swedish
Bonniers novels II (1980–81)
A corpus of 60 Bonnier novels from 1980–81
Korp
Corpus Swedish
Bring
A digital version of Bring's thesaurus (1930)
Lexicon Swedish
COCTAILL
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated.
Korp
Corpus Swedish
CoDeRooMor, v.01
Morphological dataset (word-building morphology), Swedish L2 profiles project
Lärka
Lexicon Swedish
Constructicon
A Swedish Constructicon
Karp
Lexicon Swedish
Dagens Arena 2007–2022
News texts from dagensarena.se
Korp
Corpus Swedish
DaLAJ-GED-SuperLim 2.0
Dataset for Linguistic Acceptability Judgments (and more), v.2.0
Corpus Swedish
Dalin Dictionary
Dalin's Dictionary of 19th century Swedish
Karp
Lexicon Swedish
Dalin Dictionary - Base Material
Dalin's Dictionary of 19th century Swedish - base material
Karp
Lexicon Swedish
Dalin's morphology
A morphology from Dalin's Dictionary of 19th century Swedish that is derived from Dalin's base material.
Karp
Lexicon Swedish
Dalin: Then Swänska Argus 1732-1734
Manual transcription of Then Swänska Argus by Olof von Dalin, Stockholm, 1732–1734. For OCR analysis.
Corpus Swedish
Databank of 1977 Spanish Press
Text from two Spanish newspapers from 1977. Part of SOL - Spanish Online.
Korp
Corpus Spanish
Databank of Eleven Spanish Novels 1951–1971
Corpus consisting of 11 Spanish novels. Part of SOL - Spanish Online.
Korp
Corpus Spanish
Dependency parsing model: Stanza
Pretrained models for dependency parsing.
Model Swedish
Detective Department
Data from the Detective Department at the Gothenburg police, from late 1800s to early 1900s.
Corpus Swedish
DiabetologNytt (1996–1999)
The paper DiabetologNytt (Diabetologynews) 1996-1999
Korp
Corpus Swedish
Diachronic pivot
Diachronic pivot resource where historical lexical information is linked to SALDO
Karp
Lexicon Swedish
Diverse tidningar
Fourteen annual volumes of eight different periodicals (1810–1933) digitized by Project Runeberg
Korp
Corpus Swedish
DN 1987
Dagens Nyheter 1987
Korp
Corpus Swedish
Dramawebben (demo)
Texts from Dramawebben, a digital archive of free Swedish drama.
Korp
Corpus Swedish
DReaM
A multilingual corpus of linguistic descriptions of the world's natural languages.
Korp
Corpus English
DReaM-Copyright-Protected
A multilingual corpus of linguistic descriptions of the world's natural languages.
Corpus English
Ekeblad's letters
Ekeblad's letter corpus is based on the digital edition Breven till Claes 1639–1655 by Sture Allén.
Korp
Corpus Swedish
Ethnological question lists
Question lists of the Nordic Museum
Korp
Corpus Swedish
Eukalyptus Treebank of Written Swedish
A treebank with written Swedish data, with parts-of-speech, TIGER-style syntax, multiword expressions and sense annotation
Corpus Swedish
Collection
Europarl
European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, Danish, German, Modern Greek (1453-), English, Spanish, Finnish, French, Italian, Dutch, Portuguese
Europarl: Swedish
The Swedish part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish
Europarl: Swedish-Danish
Part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, Danish
Europarl: Swedish-Dutch
Part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, Dutch
Europarl: Swedish-English
Part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, English
Europarl: Swedish-Finnish
Part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, Finnish
Europarl: Swedish-French
Part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, French
Europarl: Swedish-German
Part of European Parliament Proceedings Parallel Corpus
Korp
Corpus Swedish, German