Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Sort descending Type Language Access
8 Sidor
News articles from 8 SIDOR.
Corpus Swedish
Academic texts: Humanities
A corpus with academic texts
Corpus Swedish
Academic texts: Social science
A corpus with academic texts
Corpus Swedish
Academic wordlist
Academic wordlist
Lexicon Swedish
Af Soomaali 1993-94
Corpus Somali
Af-Soomaali 2016 Somaliland
Corpus Somali
Aftonbladet 1830's
Part of the collection Kubhist2
Corpus Swedish
Agriculture
Agricultural manuals: "Engelska Åker-Mannen" and "En Grundelig Kundskap Om Swenska Åkerbruket"
Corpus Swedish
Applications
Anonymised job applications. The corpus is protected, contact Lena Rogström (lena.rogstroem@svenska.gu.se) for more information and access.
Corpus Swedish
Argumentation sentences 1.0
A translated corpus for classifying sentence stance in relation to a topic.
Corpus Swedish
Collection
ASPAC
The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Belarusian, Bulgarian, Czech, German, Lower Sorbian, Modern Greek (1453-), English, Spanish, French, Croatian, Upper Sorbian, Latin, Macedonian, Dutch, Polish, Portuguese, Romanian, Russian, Kele (Papua New Guinea), Slovak, Slovenian, Serbian, Slavomolisano, Turkmen, Ukrainian
ASPAC: Swedish
The Swedish part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish
ASPAC: Swedish-Belarussian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Belarusian
ASPAC: Swedish-Bulgarian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Bulgarian
ASPAC: Swedish-Croatian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Croatian
ASPAC: Swedish-Czech
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Czech
ASPAC: Swedish-Dutch
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Dutch
ASPAC: Swedish-English
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, English
ASPAC: Swedish-French
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, French
ASPAC: Swedish-German
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, German
ASPAC: Swedish-Greek
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Modern Greek (1453-), Swedish
ASPAC: Swedish-Italian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Italian
ASPAC: Swedish-Latin
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Latin
ASPAC: Swedish-Lower Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Lower Sorbian
ASPAC: Swedish-Macedonian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Macedonian
ASPAC: Swedish-Molise Slavik
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Slavomolisano, Swedish
ASPAC: Swedish-Polish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Polish
ASPAC: Swedish-Portuguese
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Portuguese
ASPAC: Swedish-Romanian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Romanian
ASPAC: Swedish-Russian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Russian
ASPAC: Swedish-Serbian (cyrillic)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Kele (Papua New Guinea), Swedish
ASPAC: Swedish-Serbian (latin)
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Serbian
ASPAC: Swedish-Slovak
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Slovak
ASPAC: Swedish-Slovene
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Slovenian
ASPAC: Swedish-Spanish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Spanish
ASPAC: Swedish-Turkmen
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Turkmen
ASPAC: Swedish-Ukrainian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Ukrainian
ASPAC: Swedish-Upper Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Upper Sorbian
ASU
Structural development of the second language
Corpus Swedish
August Strindberg's letters
Part of the collected works of August Strindberg
Corpus Swedish
August Strindberg's novels
Part of the collected works of August Strindberg
Corpus Swedish
Aventinus
Drug related terminology
Lexicon Swedish
Bellman
Collected works of C.M. Bellman
Corpus Swedish
Betänkande ang. läroböcker (1882)
A report from 1882, digitized by the Gothenburg University Library
Corpus Swedish
Biblioteksbladet
The earliest volumes of "Biblioteksbladet: Organ för Sveriges allmänna biblioteksförening" from 1916–1940, digitized by Project Runeberg
Corpus Swedish
Collection
Bicameral Riksdag
Collection of textual documents from the Swedish bicameral parliament data
Corpus Swedish
Bicameral riksdag: Government official investigations
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Letters of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Motions
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Narratives and accounts
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Propositions and letters
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Protocols
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Register
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Regulations
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: Reports, memorandums and opinions
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Bicameral riksdag: The constitution of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Blingbring
Blingbring, an enhanced and modernized version of Bring's thesaurus (1930)
Lexicon Swedish
Bliss
Blissymbolics is a constructed symbol language which is mainly used by people with severe communicative and phsyical disabilities. It consists of around 5000 graphical symbols.
Lexicon Blissymbols
Collection
Blog mix
Material from a selection of Swedish blogs. Regularly updated.
Corpus Swedish
Blog mix 1998
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 1999
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2000
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2001
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2002
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2003
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2004
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2005
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2006
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2007
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2008
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2009
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2010
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2011
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2012
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2013
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix 2014
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Blog mix unknown date
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Bloggmix 2015
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Bloggmix 2016
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Bloggmix 2017
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Bonnier novels I (1976–77)
A corpus of 69 Bonnier novels from 1976–77
Corpus Swedish
Bonniers novels II (1980–81)
A corpus of 60 Bonnier novels from 1980–81
Corpus Swedish
Bring
A digital version of Bring's thesaurus (1930)
Lexicon Swedish
Caafimaad 1983
Corpus Somali
COCTAILL
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated.
Corpus Swedish
CoDeRooMor, v.01
Morphological dataset (word-building morphology), Swedish L2 profiles project
Lexicon Swedish
Constructicon
A Swedish Constructicon
Lexicon Swedish
Corpus word statistics
Accumulated word statistics from many of our modern Swedish corpora
Corpus
Dagens Arena
News texts from dagensarena.se
Corpus Swedish
DaLAJ-GED-SuperLim 2.0
Dataset for Linguistic Acceptability Judgments (and more), v.2.0
Corpus Swedish
Dalin Dictionary
Dalin's Dictionary of 19th century Swedish
Lexicon Swedish
Dalin Dictionary - Base Material
Dalin's Dictionary of 19th century Swedish - base material
Lexicon Swedish
Dalin's morphology
A morphology from Dalin's Dictionary of 19th century Swedish that is derived from Dalin's base material.
Lexicon Swedish
Dalin: Then Swänska Argus 1732-1734
Manual transcription of Then Swänska Argus by Olof von Dalin, Stockholm, 1732–1734. For OCR analysis.
Corpus Swedish
Dalpilen 1860's
Part of the collection Kubhist2
Corpus Swedish
Databank of 1977 Spanish Press
Text from two Spanish newspapers from 1977. Part of SOL - Spanish Online.
Corpus Spanish
Databank of Eleven Spanish Novels 1951–1971
Corpus consisting of 11 Spanish novels. Part of SOL - Spanish Online.
Corpus Spanish
Dependency parsing model: Stanza
Pretrained models for dependency parsing.
Model Swedish
Detective Department
Data from the Detective Department at the Gothenburg police, from late 1800s to early 1900s.
Corpus Swedish
Detektiva avdelningen
Corpus Swedish