Menu
News and events
Open submenu
Research
Open submenu
Data
Analyses
Platforms
Open submenu
About us
Open submenu
Contact us
Open submenu
FAQ
Close submenu
News and events
News archive
Conferences and workshops
Open submenu
Blog
Calendar
Open submenu
Close submenu
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Open submenu
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Open submenu
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Open submenu
Sustainable language representations
Open submenu
Workshop on Profiling second language vocabulary and grammar - 2023
Close submenu
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
Close submenu
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
Close submenu
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Close submenu
Sustainable language representations
Position statements
Close submenu
Calendar
Previous events
Close submenu
Research
Publications
Doktorandutbildning
Open submenu
Close submenu
Doktorandutbildning
For PhD students and supervisors
Close submenu
Platforms
Korp
Open submenu
Karp
Open submenu
Sparv
Open submenu
Mink
Open submenu
Lärka
Other tools
Open submenu
Close submenu
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Close submenu
Karp
Web API
Close submenu
Sparv
Sparv Pipeline
Sparv's user manual
Annotations by Sparv
Web service (API)
Web Sparv
Close submenu
Mink
User manual
Tutorial
Video: Overview (sv)
Web API
Privacy and data policy
Close submenu
Other tools
Catta
IT-baserad grammatikinlärning
Close submenu
About us
Staff
Organisation
Språkbanken Text i världen
Språkbanken 50 years
Open submenu
A brief history
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Open submenu
Cookies
Internal
Close submenu
Språkbanken 50 years
Celebration
Close submenu
Meetings and workshops
Kick-off meetings
Open submenu
Workshops
Open submenu
Forskningsmöten
SBX Retreat
Open submenu
Working group meetings
Close submenu
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Close submenu
Workshops
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Close submenu
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Close submenu
Contact us
Help desk
Skip to main content
Svenska
English
Språkbanken Text is a part of
Språkbanken
.
News and events
Research
Data
Analyses
Platforms
About us
Contact us
FAQ
Menu
Breadcrumb
Home
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1329)
Collections (31)
Corpora (1200)
Lexicons (66)
Training and evaluation data (15)
Models (48)
Name or description
Language
- Any -
Swedish
Albanian
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Typ
Språk
Åtkomst
LWT
Loan Word Typology list
Lexicon
Swedish, English
Dataset:
lwt.xml
2017-09-19 – 665.94 KB – CC BY 4.0
Explore in:
LWT-PWN
LWT-PWN is the IDS/LWT concept list linked to Princeton WordNet 3.0 word sense identifiers.
Lexicon
Swedish
Dataset:
lwt-pwn.txt
2015-03-31 – 204.14 KB – CC BY 4.0
MAÞiR Trees
An Old Swedish treebank, with lemma, parts-of-speech, and PROIEL-style dependency syntax.
Corpus
Swedish
Dataset:
mathir_trees_v0.1.tgz
2024-04-17 – 5.49 MB – CC BY-NC 4.0
MAÞiR Words
Old Swedish lexical resource based upon Söderwall's dictionary, suitable for creating lemmatizers, amonst other things.
Lexicon
Swedish
Dataset:
mathir_words_v1.0.tgz
2024-01-25 – 306.42 KB – CC BY 4.0
Collection
Medieval letters
Swedish medieval charters from Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
Latin, German, Norwegian, Swedish
See 5 collected resources
Explore in:
Medieval letters: German
Charters written in German, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
German
Dataset:
sdhk-tyska.xml.bz2
2015-05-20 – 335.84 KB – CC BY 4.0
Word statistics:
stats_SDHK-TYSKA.txt
2013-04-28 – 396.26 KB – CC BY 4.0
Explore in:
Medieval letters: Latin
Charters written in Latin, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
Latin
Dataset:
sdhk-latin.xml.bz2
2015-05-20 – 4.71 MB – CC BY 4.0
Word statistics:
stats_SDHK-LATIN.txt
2013-04-28 – 2.58 MB – CC BY 4.0
Explore in:
Medieval letters: Norwegian
Charters written in Norwegian, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
Norwegian
Dataset:
sdhk-norska.xml.bz2
2015-05-20 – 58.95 KB – CC BY 4.0
Word statistics:
stats_SDHK-NORSKA.txt
2013-04-28 – 152.5 KB – CC BY 4.0
Explore in:
Medieval letters: Other languages
Charters written in other languages, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
Swedish
Dataset:
sdhk-ovrigt.xml.bz2
2015-05-20 – 91.05 KB – CC BY 4.0
Word statistics:
stats_SDHK-OVRIGT.txt
2013-04-28 – 243.42 KB – CC BY 4.0
Explore in:
Medieval letters: Swedish
Charters written in Swedish, from the Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
Swedish
Dataset:
sdhk-svenska.xml.bz2
2014-12-09 – 1.77 MB – CC BY 4.0
Word statistics:
stats_SDHK-SVENSKA.txt
2013-04-28 – 1.82 MB – CC BY 4.0
Explore in:
Memory books of the city of Stockholm
Minutes and memorial notes from Stockholm City Hall, 1626.
Corpus
Swedish
Dataset:
tankebok.xml.bz2
2014-12-08 – 661.68 KB – CC BY 4.0
Word statistics:
stats_TANKEBOK.txt
2015-06-25 – 595.1 KB – CC BY 4.0
Explore in:
MEPAC bloggar
Corpus
Swedish
Word statistics:
stats_MEPAC.txt
2017-03-19 – 6.1 MB – CC BY 4.0
Explore in:
MEPAC intervjuer
Corpus
Swedish
Word statistics:
stats_MEPAC-I.txt
2017-03-19 – 878.01 KB – CC BY 4.0
Explore in:
MuClaGED
MuClaGED is a dataset for multi-class Grammatical Error Detection for Swedish. The dataset is based on the SweLL-gold corpus.
Corpus
Swedish
Explore in:
MultiGEC
MultiGEC is a dataset for Grammatical Error Correction containing parallel data for 12 languages and 17 subcorpora. Each subcorpus contains two or more parallel versions of the same texts (typically, full learner essays), where one version (orig) is the one that the author originally wrote, and the others (ref1, ref2, ...) are corrected versions of the same text. Languages included: Czech, English, Estonian, German, Greek, Icelandic, Italian, Latvian, Russian, Slovene, Swedish and Ukrainian (English and Russian are available on request). Texts come from different original corpora, but are reformatted to a unified format.
Corpus
Czech, German, Modern Greek (1453-), English, Estonian, Icelandic, Italian, Latvian, Russian, Slovenian, Swedish, Ukrainian
Explore in:
MultiGED
MultiGEC is a dataset for Grammatical Error Detection (a task within NLP) containing data for 5 languages (Czech, English, German, Italian and Swedish).
Corpus
Czech, German, English, Italian, Swedish
Dataset:
multiged-2023.tar.bz2
2025-01-22 – 3.82 MB – varies
Explore in:
Multilingual Constructicon
A multilingual Constructicon
Lexicon
Swedish, Russian
Dataset:
konstruktikon-multi.xml
2021-11-09 – 1.47 MB – CC BY 4.0
Nils Matsson Kiöping's journeys
Reseskildringar från 1674 och 1743
Corpus
Swedish
Dataset:
kioping.xml.bz2
2015-05-20 – 761.93 KB – CC BY 4.0
Word statistics:
stats_KIOPING.txt
2014-04-29 – 895.08 KB – CC BY 4.0
Explore in:
NordiCon
NordiCon is a database that collects medieval North Germanic personal names in sources outside Scandinavia.
Lexicon
English
Explore in:
Norstedts novels (1999)
A collection of 23 novels issued by Norstedts publishing house in 1999
Corpus
Swedish
Dataset:
rom99.xml.bz2
2017-03-17 – 40.83 MB – CC BY 4.0
Word statistics:
stats_ROM99.txt
2017-03-19 – 7.38 MB – CC BY 4.0
Explore in:
Collection
NPEGL
The Noun Phrases in Early Germanic Languages database.
Lexicon
Old English (ca. 450-1100), Old High German (ca. 750-1050), Old Norse, Old Saxon
See 5 collected resources
Explore in:
NPEGL: Old English
Corpus
English
Word statistics:
stats_NPEGL-ENG.txt
2021-07-04 – 2.07 MB – CC BY 4.0
Explore in:
NPEGL: Old High German
Corpus
German
Word statistics:
stats_NPEGL-GER.txt
2021-06-20 – 69.41 KB – CC BY 4.0
Explore in:
NPEGL: Old Icelandic
Corpus
Icelandic
Word statistics:
stats_NPEGL-ICE.txt
2021-06-20 – 517.04 KB – CC BY 4.0
Explore in:
NPEGL: Old Saxon
Corpus
Word statistics:
stats_NPEGL-SAX.txt
2021-06-20 – 158.12 KB – CC BY 4.0
Explore in:
NPEGL: Old Swedish
Corpus
Swedish
Word statistics:
stats_NPEGL-SWE.txt
2021-07-04 – 116.36 KB – CC BY 4.0
Explore in:
NyLLex v2
A lexical resource derived from books published by Sweden´s largest publisher of easy language texts. The entries are annotated with frequency counts distributed over six reading proficiency levels.
Lexicon
Swedish
Dataset:
nyllex_v2.csv
2023-06-09 – 1.46 MB – CC BY 4.0
Collection
Old Finland Swedish
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
See 42 collected resources
Explore in:
Old Finland Swedish: Åbo Tidning 1883–1903
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ABOTIDNING.txt
2017-06-18 – 868.49 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Åland 1891–1911
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ALAND.txt
2017-06-18 – 590.93 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Argus 1907–1911
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ARGUS.txt
2017-06-18 – 1.52 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Astra 1920–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ASTRA1920-1959.txt
2017-06-18 – 1.95 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Ateneum 1898–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ATENEUM-1800TAL.txt
2017-06-18 – 1.34 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Ateneum 1900–1903
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ATENEUM-1900TAL.txt
2017-06-18 – 1014.48 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Björneborgs Tidning 1897–1907
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_BJORNEBORGSTIDNING.txt
2017-06-18 – 567.04 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Borgåbladet 1885
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_BORGABLADET2.txt
2017-06-18 – 76.2 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Diaries 1700-tal
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_DAGBOCKER1700TAL.txt
2017-06-18 – 1.17 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Diaries 1800–1849
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_DAGBOCKER1800-1849.txt
2017-06-18 – 1.03 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Euterpe 1900–1905
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_EUTERPE.txt
2017-06-18 – 1.17 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Fiction 1800–1849
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBSKONLIT1800-1849.txt
2017-06-18 – 1.64 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Fiction 1850–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBSKONLIT1850-1899.txt
2017-06-18 – 3.78 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Fiction 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBSKONLIT1900-1959.txt
2017-06-25 – 4.22 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Filosofia.fi 1850–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FILOSOFIA1850-1899.txt
2017-06-18 – 779.32 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Filosofia.fi 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FILOSOFIA1900-1959.txt
2017-06-18 – 865.96 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Finsk Tidskrift 1878–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FINSKTIDSKRIFT1800TAL.txt
2017-06-18 – 1.96 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Finsk Tidskrift 1900–1912
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FINSKTIDSKRIFT1900TAL.txt
2017-06-18 – 1.71 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Frågebrevsvar 1900–1949
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_DAGBOCKER1900-1949.txt
2017-06-18 – 616.15 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Fredrikshamns Tidning 1888–1908
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FREDRIKSHAMNSTIDNING.txt
2017-06-18 – 392.92 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Helsingfors Dagblad 1866–1886
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_DAGBLADET1866-1886.txt
2017-06-18 – 908.34 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Hufvudstadsbladet 1893–1903
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_HBL1800.txt
2017-06-18 – 807.94 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Husmodern 1903–1912
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_HUSMODERN.txt
2017-06-18 – 923.19 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Landtmannen 1877–1879
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_LANDTMANNEN.txt
2017-06-18 – 600.33 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Letters 1700-tal
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBBREV1700TAL.txt
2017-06-18 – 696.95 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Letters 1800–1849
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBBREV1800-1849.txt
2017-06-18 – 632.33 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Letters 1850–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBBREV1850-1899.txt
2017-06-18 – 780.02 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Letters 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBBREV1900TAL.txt
2017-06-18 – 736.89 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Literary magazine published in Helsingfors
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_LITTERARTIDSKRIFT-HELSINGFORS.txt
2017-06-18 – 870.8 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Non-fiction 1700–1749
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_SAKPROSA1700-1749.txt
2017-06-18 – 771.81 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Non-fiction 1750–1799
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_SAKPROSA1750-1799.txt
2017-06-18 – 1.62 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Non-fiction 1800–1849
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_SAKPROSA1800-1849.txt
2017-06-18 – 3.6 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Non-fiction 1850–1899
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_SAKPROSA1850-1899.txt
2017-06-18 – 2.94 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Non-fiction 1900–1959
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_SAKPROSA1900-1959.txt
2017-06-18 – 2.43 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Protokoll vid lantdagen i Borgå år 1809
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_FSBMYNDIGHET1800TAL.txt
2017-06-18 – 1.77 MB – CC BY 4.0
Explore in:
Old Finland Swedish: Spanska Flugan 1839–1841
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_SPANSKAFLUGAN.txt
2017-06-25 – 527.61 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Tidningar Utgifne af et Sällskap i Åbo 1771–1783
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_TIDNINGARSALLSKAPETIABO.txt
2017-06-18 – 123.03 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Typografiskt minnesblad 1891
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_TYPOGRAFISKTMINNESBLAD.txt
2017-06-18 – 196.54 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Typograftidning 1889–1890
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_TYPOGRAFTIDNING.txt
2017-06-18 – 753.61 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Uleåborgs Tidning 1877–1887
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_ULEABORGSTIDNING.txt
2017-06-18 – 273.23 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Wasabladet 1866–1896
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_WASABLADET.txt
2017-06-18 – 894 KB – CC BY 4.0
Explore in:
Old Finland Swedish: Wiborgs Tidning 1867–1877
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
Word statistics:
stats_WIBORGSTIDNING.txt
2017-06-18 – 343.96 KB – CC BY 4.0
Explore in:
Old Swedish morphology
Old Swedish morphology from Söderwall and Schlyter
Lexicon
Swedish
Dataset:
fsvm.xml
2017-09-13 – 31.61 MB – CC BY 4.0
Explore in:
Older Swedish novels
A collection of more than 50 older Swedish novels by 14 different authors
Corpus
Swedish
Dataset:
romg.xml.bz2
2017-03-17 – 60.15 MB – CC BY 4.0
Word statistics:
stats_ROMG.txt
2017-03-19 – 8.91 MB – CC BY 4.0
Explore in:
OpenEDGeS
The public license subset of the EDGeS Diachronic Bible Corpus, a diachronically and synchronically parallel corpus of Bible translations in Dutch,English, German and Swedish, with texts from the 14th century until today.
Corpus
Swedish, English, German, Dutch
Dataset:
OpenEDGeS_v1.01.zip
2024-01-25 – 121.17 MB – CC BY-NC-SA 4.0
Dataset:
OpenEDGeS_v1.0.0.zip
2024-01-25 – 72.89 MB – For license details of the previous versions, see the 'Read me.txt' file in the download.
Oral Copus for Reference of Contemporary Spanish
Corpus with transcriptions from recorded audio tapes from 1991 to 1992. Part of SOL - Spanish Online
Corpus
Spanish
Dataset:
cor92.xml.bz2
2017-11-10 – 2.33 MB – CC BY 4.0
Explore in:
ORDAT
Yearbook of Svenska Dagbladet 1923–1958
Corpus
Swedish
Dataset:
ordat.xml.bz2
2017-05-16 – 28.07 MB – CC BY 4.0
Word statistics:
stats_ORDAT.txt
2017-05-21 – 7.13 MB – CC BY 4.0
Explore in:
OSA (SAOB)
The Swedish Academy Dictionary online
Lexicon
Swedish
Östgötalagen
Corpus
Swedish
Word statistics:
stats_OGL.txt
2015-11-01 – 90.5 KB – CC BY 4.0
Explore in:
Parole
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information
Lexicon
Swedish
Dataset:
PAROLE_usyn_descr.txt
2012-03-27 – 913.17 KB – CC BY 4.0
PAROLE
A corpus annotated with morphological and syntactic information
Corpus
Swedish
Dataset:
parole.xml.bz2
2017-05-17 – 425.19 MB – CC BY 4.0
Dataset:
parole.zip
2024-01-25 – 67.62 MB – CC BY 4.0
Word statistics:
stats_PAROLE.txt
2017-05-21 – 39.18 MB – CC BY 4.0
Explore in:
Parole+
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information, partially linked to SALDO senses
Lexicon
Swedish
Dataset:
parolelexplus.xml
2017-09-19 – 13.93 MB – CC BY 4.0
Explore in:
Podiet
Articles from the consert magazine Podiet
Corpus
Swedish
Dataset:
podiet.xml.bz2
2024-01-02 – 14.4 MB – CC BY 4.0
Word statistics:
stats_podiet.csv
2024-01-03 – 11.99 MB – CC BY 4.0
Explore in:
Poeter.se
Poetry from Poeter.se
Corpus
Swedish
Dataset:
poeter.xml.bz2
2017-04-20 – 1.65 GB – CC BY 4.0
Word statistics:
stats_POETER.txt
2017-04-23 – 71.35 MB – CC BY 4.0
Explore in:
POS-tagging model: Flair
Pretrained models for POS-tagging.
Model
Swedish
Dataset:
flair_eval.zip
2020-06-18 – 1.37 GB – CC BY 4.0
Dataset:
flair_full.zip
2020-06-18 – 1.37 GB – CC BY 4.0
POS-tagging model: Marmot
Pretrained models for POS-tagging.
Model
Swedish
Dataset:
marmot_eval.marmot
2020-06-29 – 108.59 MB – CC BY 4.0
Dataset:
marmot_full.marmot
2020-06-29 – 113.41 MB – CC BY 4.0
Dataset:
saldo_marmot.txt
2020-06-29 – 46.33 MB – CC BY 4.0
POS-tagging model: Stanza
Pretrained models for POS-tagging.
Model
Swedish
Dataset:
morph_stanza_eval.zip
2020-12-09 – 19.94 MB – CC BY 4.0
Dataset:
morph_stanza_full2.zip
2020-12-09 – 20.19 MB – CC BY 4.0
Dataset:
stanza_pretrain.zip
2025-02-20 – 91.7 MB – CC BY 4.0
Preparatory work 1734
Material från lagkommissionen till 1734 års lag
Corpus
Swedish
Dataset:
forarbeten1734.xml.bz2
2014-12-08 – 9.11 MB – CC BY 4.0
Word statistics:
stats_FORARBETEN1734.txt
2015-06-25 – 4.11 MB – CC BY 4.0
Explore in:
Collection
Press
Swedish press
Corpus
Swedish
See 6 collected resources
Explore in:
Press 65
Swedish press 1965
Corpus
Swedish
Dataset:
press65.xml.bz2
2017-03-14 – 20.88 MB – CC BY 4.0
Word statistics:
stats_PRESS65.txt
2017-03-19 – 6.71 MB – CC BY 4.0
Explore in:
Press 76
Swedish press 1976
Corpus
Swedish
Dataset:
press76.xml.bz2
2017-03-17 – 24.45 MB – CC BY 4.0
Word statistics:
stats_PRESS76.txt
2017-03-19 – 7.37 MB – CC BY 4.0
Explore in:
Press 95
Swedish press 1995
Corpus
Swedish
Dataset:
press95.xml.bz2
2017-03-15 – 139.65 MB – CC BY 4.0
Word statistics:
stats_PRESS95.txt
2017-03-19 – 19.6 MB – CC BY 4.0
Explore in:
Press 96
Swedish press 1996
Corpus
Swedish
Dataset:
press96.xml.bz2
2017-03-15 – 117.54 MB – CC BY 4.0
Word statistics:
stats_PRESS96.txt
2017-03-19 – 18.49 MB – CC BY 4.0
Explore in:
Press 97
Swedish press 1997
Corpus
Swedish
Dataset:
press97.xml.bz2
2017-03-17 – 241.09 MB – CC BY 4.0
Word statistics:
stats_PRESS97.txt
2017-03-19 – 28.62 MB – CC BY 4.0
Explore in:
Press 98
Swedish press 1998
Corpus
Swedish
Dataset:
press98.xml.bz2
2017-03-17 – 187.08 MB – CC BY 4.0
Word statistics:
stats_PRESS98.txt
2017-03-19 – 24.51 MB – CC BY 4.0
Explore in:
Pretrained embeddings
A list of pretrained embeddings for Swedish
Model
Swedish
Psalm book (1937)
The Swedish psalm book from 1937
Corpus
Swedish
Dataset:
psalmboken.xml.bz2
2017-05-18 – 1.72 MB – CC BY 4.0
Word statistics:
stats_PSALMBOKEN.txt
2016-03-20 – 670.48 KB – CC BY 4.0
Explore in:
Questions and answers about the Swedish language
Counselling mails of the Language Council of Sweden
Corpus
Swedish
Explore in:
Collection
Riksdag of the Estates
Collection of textual documents from the Swedish Riksdag of the Estates
Corpus
Swedish
See 7 collected resources
Explore in:
Riksdag of the Estates: Adelsståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-adelsstandet.xml.bz2
2024-06-17 – 852.82 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-adelsstandet.csv
2024-08-05 – 75.66 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Bihang m.m.
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-bihang.xml.bz2
2024-06-18 – 841.09 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-bihang.csv
2024-08-05 – 88.43 MB – CC BY 4.0
Explore in:
Riksdag of the Estates: Bondeståndet
Part of the data set "Riksdag of the Estates"
Corpus
Swedish
Dataset:
standsriksdagen-bondestandet.xml.bz2
2024-06-18 – 411.81 MB – CC BY 4.0
Word statistics:
stats_standsriksdagen-bondestandet.csv
2024-08-05 – 54.4 MB – CC BY 4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Next page
Next ›
Last page
Last »
Close menu