Skip to main content

Language resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.
Resource Type Language Access
Fornsvenska textbankens material: Verser
Corpus Swedish
Somali: Sheekooyin Gaagaaban
Corpus Somali
Finland Swedish: Bullen 2010–2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Medieval letters: German
Charters written in German, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus German
Old Finland Swedish: Finsk Tidskrift 1900–1912
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Diaries 1700-tal
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Psalm book (1937)
The Swedish psalm book from 1937
Corpus Swedish
Finland Swedish: Svenskbygden 2010–2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
Fornsvenska textbankens material: Yngre tänkeböcker
Corpus Swedish
Finland Swedish: Finsk tidskrift 2011–2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Finland Swedish: Astra 1960–1979
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Somali: Suugaan
Corpus Somali
Finland Swedish: Studentbladet 2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Kubhist 2: Carlscronas Tidningar 1760's
Part of the collection Kubhist2
Corpus Swedish
Swe-NERC
A resource for training and evaluation of Named Entity Recognition for Swedish.
Corpus Swedish
Swedish framenet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This part of the resource contains the corpus examples, automatically enriched with linguistic information.
Corpus Swedish
ASPAC: Swedish-Latin
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Latin
Kubhist 2: Folkets Röst 1840's
Part of the collection Kubhist 2
Corpus Swedish
Somali: Af-Soomaali 2001 Soomaaliya
Corpus Somali
Old Finland Swedish: Argus 1907–1911
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Ateneum 1898–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Kubhist 2: Norrköpings Weckotidningar 1750's
Part of the collection Kubhist 2
Corpus Swedish
Finland Swedish: Hanken 2008–2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
Old Finland Swedish: Diaries 1800–1849
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Memory books of the city of Stockholm
Minutes and memorial notes from Stockholm City Hall, 1626.
Corpus Swedish
Fornsvenska textbankens material: Yngre lagar
Corpus Swedish
Finland Swedish: Fanbäraren 2011–2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Euterpe 1900–1905
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Syntag treebank
A Swedish treebank with syntactic analysis of 158 articles from Press-65.
Corpus Swedish
Eukalyptus Treebank of Written Swedish
A treebank with written Swedish data, with parts-of-speech, TIGER-style syntax, multiword expressions and sense annotation
Corpus Swedish
Laws of 1734
Materialet utgörs av balkarna i själva lagtexten, förordet samt domarreglerna. Materialet är inskrivet för hand och korrekturläst, men en del fel finns fortfarande kvar.
Corpus Swedish
KVAH
Kungl. Vetenskapsakademiens Handlingar
Corpus Swedish
Old Finland Swedish: Letters 1850–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Kubhist 2: Carlscronas Wekoblad 1750's
Part of the collection Kubhist 2
Corpus Swedish
TalbankenSBX
Talbanken is a Swedish treebank. This is the Språkbanken Text version of Talbanken.
Corpus Swedish
TalbankenSTB
Talbanken is a Swedish treebank.
Corpus Swedish
Old Finland Swedish: Husmodern 1903–1912
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
ASPAC: Swedish-Romanian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Romanian
ASPAC: Swedish-Italian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Italian
Agriculture
Agricultural manuals: "Engelska Åker-Mannen" and "En Grundelig Kundskap Om Swenska Åkerbruket"
Corpus Swedish
Finland Swedish: Vasabladet 2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
Riksdagens öppna data: Sammanträden
Corpus Swedish
Finland Swedish: Pargas Kungörelser 2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
ASPAC: Swedish-Upper Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Upper Sorbian
Old Finland Swedish: Non-fiction 1700–1749
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Bicameral riksdag: The constitution of the Riksdag
Part of the data set "Bicameral Riksdag"
Corpus Swedish
Old Finland Swedish: Filosofia.fi 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Letters 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Somali: Cilmiga Bulshada 1971–1980
Corpus Somali
Kubhist 2: Posttidningar 1640's
Part of the collection Kubhist 2
Corpus Swedish
Old Finland Swedish: Letters 1700-tal
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Filosofia.fi 1850–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Ateneum 1900–1903
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Nils Matsson Kiöping's journeys
Reseskildringar från 1674 och 1743
Corpus Swedish
Old Finland Swedish: Literary magazine published in Helsingfors
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Wasabladet 1866–1896
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Helsingfors Dagblad 1866–1886
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Letters 1800–1849
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Old Finland Swedish: Typograftidning 1889–1890
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
ASPAC: Swedish-Spanish
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Spanish
Old Finland Swedish: Åbo Tidning 1883–1903
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
TISUS texts
Essays written by L2 Swedish learners as part of a TISUS exam
Corpus Swedish
Finland Swedish: Magma columns 2009–2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
Old Finland Swedish: Hufvudstadsbladet 1893–1903
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Somali: Cilmiga Bulshada 2016 Somaliland
Corpus Somali
Ekeblad's letters
Ekeblad's letter corpus is based on the digital edition Breven till Claes 1639–1655 by Sture Allén.
Corpus Swedish
Old Finland Swedish: Landtmannen 1877–1879
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
SW1203-essays
Essays written by L2 Swedish language learners, university courses
Corpus Swedish
Förvaltningsmyndigheters texter
Corpus Swedish
Af-Soomaali 2016 Somaliland
Corpus Somali
InterFra Swedish
To promote research in the field of French L2 second language acquisition in a developmental, interactional and variationist perspective. The HLP (High Level Proficiency in Second language use) project also investigates learners of other L2s such as Swedish, Spanish, English and Italian.
Corpus Swedish
Somali: Af Soomaali 1971-79
Corpus Somali
Somali: Xisaab 2001 Soomaaliya
Corpus Somali
Interrogations
The corpus is protected, contact Ylva Burman (ylva.byrman@svenska.gu.se) for more information and access.
Corpus Swedish
Somali: Cilmiga Bulshada 2001-03 Soomaaliya
Corpus Somali
Swedish fraktur 1626-1816
A selection of fraktur texts printed between 1626 and 1816 from the collections of the University Library of University of Gothenburg (UB). For OCR analysis.
Corpus Swedish
Old Finland Swedish: Frågebrevsvar 1900–1949
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
SpIn v1
256 essays collected from Language Introduction course (mid-term exams) for newly arrived refugees. Some of the students are recurrent.
Corpus Swedish
Sæmundaredda
Ancient Icelandic poetry collection also known as The King's Book
Corpus Old Norse
Fornsvenska textbankens material: Nysvenska bibelböcker
Corpus Swedish
Old Finland Swedish: Spanska Flugan 1839–1841
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
Somali: Xisaab 2016 Somaliland
Corpus Somali
Betänkande ang. läroböcker (1882)
A report from 1882, digitized by the Gothenburg University Library
Corpus Swedish
Finland Swedish: Syd-Österbotten 2013
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
Medieval letters: Other languages
Charters written in other languages, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus Swedish
Old Finland Swedish: Åland 1891–1911
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
SVT news unknown date
News texts from svt.se
Corpus Swedish
ASPAC: Swedish-Lower Sorbian
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Lower Sorbian
Somali: Taariikh iyo Dhaqan (Turjuman)
Corpus Somali
ASPAC: Swedish-Molise Slavik
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Slavomolisano, Swedish
Somali: Af-Soomaali 2001 Somaliland
Corpus Somali
Sibirian-German
Siberian German is transcribed German spoken of about 36 000 people in the region of Krasnoyarsk in Siberia (Russia).
Corpus Swedish
Old Finland Swedish: Björneborgs Tidning 1897–1907
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Swedish
MAÞiR Trees
An Old Swedish treebank, with lemma, parts-of-speech, and PROIEL-style dependency syntax.
Corpus Swedish
Somali: Saynis 1980–89
Corpus Somali
Finland Swedish: Österbottens tidning 2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus Finland Swedish
ASPAC: Swedish-Turkmen
Part of The Amsterdam Slavic Parallel Aligned Corpus
Corpus Swedish, Turkmen
Blog mix 1998
Material from a selection of Swedish blogs. Is updated regularly.
Corpus Swedish
Medieval letters: Norwegian
Charters written in Norwegian, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus Norwegian
Applications
Anonymised job applications. The corpus is protected, contact Lena Rogström (lena.rogstroem@svenska.gu.se) for more information and access.
Corpus Swedish