Skip to main content

Resources

On this page you can browse and search our corpora and lexicons. Click on a resource name to see what files are available for download. You can go directly to the search interface by clicking on the Korp or Karp logo.

Filter

Resource Explore Type Language Download
Samling
SuperLim

A standardized suite for evaluation and analysis of Swedish natural language understanding systems.

Corpus swe
8 Sidor

News articles from 8 SIDOR.

Korp
Corpus swe
Åbo Tidning 1883–1903

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Åbo Underrättelser 2012

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Åbo Underrättelser 2013

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Academic texts – Humanities

A corpus with academic texts.

Korp
Corpus swe
Academic texts – Social science

A corpus with academic texts.

Korp
Corpus swe
Academic wordlist

Academic wordlist

Karp
Lexicon swe
Adult bloggers

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Af Soomaali 1971-79
Korp
Corpus som
Af Soomaali 1993-94
Korp
Corpus som
Af-Soomaali 2001 Somaliland
Korp
Corpus som
Af-Soomaali 2001 Soomaaliya
Korp
Corpus som
Af-Soomaali 2016 Somaliland
Korp
Corpus som
Afka Hooyo 2010–19 Iswiidhan
Korp
Corpus som
Aftonbladet 1830's

A corpus with texts from Aftonbladet in the 1830's; part of the Kubhist corpus

Korp
Corpus swe
Aftonbladet 1830's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1840's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1840's

A corpus with texts from Aftonbladet in the 1840's; part of the Kubhist corpus

Korp
Corpus swe
Aftonbladet 1850's

A corpus with texts from Aftonbladet in the 1850's; part of the Kubhist corpus

Korp
Corpus swe
Aftonbladet 1850's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1860's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1860's

A corpus with texts from Aftonbladet in the 1860's; part of the Kubhist corpus

Korp
Corpus swe
Aftonbladet 1870's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1880's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1890's

Part of the collection Kubhist2

Korp
Corpus swe
Aftonbladet 1900's

Part of the collection Kubhist2

Korp
Corpus swe
Agriculture

Agricultural manuals: "Engelska Åker-Mannen" and "En Grundelig Kundskap Om Swenska Åkerbruket"

Korp
Corpus swe
Åland 1891–1911

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Ålandstidningen 2012

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Äldre lagar – Fornsvenska textbankens material
Korp
Corpus swe
Äldre religiös prosa – Fornsvenska textbankens material
Korp
Corpus swe
Alfwar och Skämt 1840's

Part of the collection Kubhist2

Korp
Corpus swe
Anföranden

Part of the Riksdag's open data

Korp
Corpus swe
Applications

Anonymised job applications

Korp
Corpus swe
Argus 1907–1911

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
ASPAC – Swedish

The Swedish part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe
ASPAC – Swedish-Belarussian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, bel
ASPAC – Swedish-Bulgarian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, bul
ASPAC – Swedish-Croatian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, hrv
ASPAC – Swedish-Czech

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, ces
ASPAC – Swedish-Dutch

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, nld
ASPAC – Swedish-English

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, eng
ASPAC – Swedish-French

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, fra
ASPAC – Swedish-German

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, deu
ASPAC – Swedish-Greek

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, ell
ASPAC – Swedish-Italian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, ita
ASPAC – Swedish-Latin

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, lat
ASPAC – Swedish-Lower Sorbian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, dsb
ASPAC – Swedish-Macedonian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, mkd
ASPAC – Swedish-Molise Slavik

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, svm
ASPAC – Swedish-Polish

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, pol
ASPAC – Swedish-Portuguese

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, por
ASPAC – Swedish-Romanian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, ron
ASPAC – Swedish-Russian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, rus
ASPAC – Swedish-Serbian (cyrillic)

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, sbc
ASPAC – Swedish-Serbian (latin)

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, srp
ASPAC – Swedish-Slovak

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, slk
ASPAC – Swedish-Slovene

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, slv
ASPAC – Swedish-Spanish

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, spa
ASPAC – Swedish-Turkmen

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, tuk
ASPAC – Swedish-Ukrainian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, ukr
ASPAC – Swedish-Upper Sorbian

Part of The Amsterdam Slavic Parallel Aligned Corpus

Korp
Corpus swe, hsb
Astra 1920–1959

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Astra 1960–1979

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Astra Nova 2008-2010

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
ASU

Structural development of the second language

Korp
Corpus swe
Ateneum 1898–1899

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Ateneum 1900–1903

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
August Strindberg's letters

Part of the collected works of August Strindberg

Korp
Corpus swe
August Strindberg's novels

Part of the collected works of August Strindberg

Korp
Corpus swe
Aventinus

Drug related terminology

Lexicon swe
Banco de Datos de Once Novelas Españolas 1951—1971 (SOL)

Part of SOL - Spanish Online

Korp
Corpus spa
Banco de Datos de Prensa Española 1977 (SOL)

Part of SOL - Spanish Online

Korp
Corpus spa
Barometern 1840's

Part of the collection Kubhist2

Korp
Corpus swe
Barometern 1850's

Part of the collection Kubhist2

Korp
Corpus swe
Barometern 1860's

Part of the collection Kubhist2

Korp
Corpus swe
Barometern 1870's

Part of the collection Kubhist2

Korp
Corpus swe
Barometern 1880's

Part of the collection Kubhist2

Korp
Corpus swe
Barometern 1890's

Part of the collection Kubhist2

Korp
Corpus swe
Bellman

Collected works of C.M. Bellman

Korp
Corpus swe
Betänkande

Part of the Riksdag's open data

Korp
Corpus swe
Betänkande angående likformig uppställning av grammatiska läroböcker
Korp
Corpus swe
Biblioteksbladet
Korp
Corpus swe
Björneborgs Tidning 1897–1907

Part of the Finland Swedish language bank over Swedish in Finland today and yesterday

Korp
Corpus swe
Blekingsposten 1850's

Part of the collection Kubhist2

Korp
Corpus swe
Blekingsposten 1850's

A corpus with texts from Blekingeposten in the 1850's; part of the Kubhist corpus

Korp
Corpus swe
Blekingsposten 1860's

Part of the collection Kubhist2

Korp
Corpus swe
Blekingsposten 1860's

A corpus with texts from Blekingeposten in the 1860's; part of the Kubhist corpus

Korp
Corpus swe
Blekingsposten 1870's

Part of the collection Kubhist2

Korp
Corpus swe
Blekingsposten 1870's

A corpus with texts from Blekingeposten in the 1870's; part of the Kubhist corpus

Korp
Corpus swe
Blekingsposten 1880's

A corpus with texts from Blekingeposten in the 1880's; part of the Kubhist corpus

Korp
Corpus swe
Blekingsposten 1880's

Part of the collection Kubhist2

Korp
Corpus swe
Blingbring

Blingbring, an enhanced and modernized version of Bring's thesaurus (1930)

Karp
Lexicon swe
Bliss

Blissymbolics is a constructed symbol language which is mainly used by people with severe communicative and phsyical disabilities. It consists of around 5000 graphical symbols.

Karp
Lexicon language-independent
Blog mix 1998

Material from a selection of Swedish blogs. Is updated regularly.

Korp
Corpus swe
Blog mix 1999

Material from a selection of Swedish blogs. Is updated regularly.

Korp
Corpus swe
Blog mix 2000

Material from a selection of Swedish blogs. Is updated regularly.

Korp
Corpus swe
Blog mix 2001

Material from a selection of Swedish blogs. Is updated regularly.

Korp
Corpus swe
Blog mix 2002

Material from a selection of Swedish blogs. Is updated regularly.

Korp
Corpus swe