Skip to main content
Svenska
English
Språkbanken Text is a part of
Språkbanken
.
News and events
Research
Tools
Data
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1283)
Collections (28)
Corpora (1162)
Lexicons (60)
Training and evaluation data (15)
Models (46)
Name or description
Language
- Any -
Swedish
Albanian
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Standard Estonian
Standard Latvian
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Antal ingångar
Språk
Åtkomst
Academic wordlist
Academic wordlist
655
Swedish
Dataset:
ao.xml
2017-09-13 – 265.72 KB – CC BY 4.0
Explore in:
Aventinus
Drug related terminology
13,803
Swedish
Dataset:
aventinus.zip
2023-06-12 – 339.93 KB – CC BY 4.0
Blingbring
Blingbring, an enhanced and modernized version of Bring's thesaurus (1930)
126,910
Swedish
Dataset:
blingbring.txt
2017-09-20 – 7.52 MB – CC BY 4.0
Dataset:
blingbring.xml
2021-11-09 – 42.68 MB – CC BY 4.0
Explore in:
Bliss
Blissymbolics is a constructed symbol language which is mainly used by people with severe communicative and phsyical disabilities. It consists of around 5000 graphical symbols.
5,596
Blissymbols
Dataset:
bliss.xml
2017-09-13 – 2.73 MB – CC BY 4.0
Bring
A digital version of Bring's thesaurus (1930)
148,815
Swedish
Dataset:
bring.txt
2017-09-11 – 6.69 MB – CC BY 4.0
CoDeRooMor, v.01
Morphological dataset (word-building morphology), Swedish L2 profiles project
Swedish
Dataset:
CodeRoomor_v01_lemgramView.csv
2021-04-13 – 1.96 MB – CC BY 4.0
Dataset:
CodeRoomor_v01_morphemeView.csv
2021-04-13 – 856.29 KB – CC BY 4.0
Dataset:
CodeRoomor_v01_lemgramView.xlsx
2021-04-13 – 1.72 MB – CC BY 4.0
Dataset:
CodeRoomor_v01_morphemeView.xlsx
2021-04-13 – 699.46 KB – CC BY 4.0
Explore in:
Constructicon
Swedish Constructicon
441
Swedish
Dataset:
konstruktikon.xml
2021-11-09 – 2.03 MB – CC BY 4.0
Explore in:
Dalin Dictionary
Dalin's Dictionary of 19th century Swedish
62,975
Swedish
Dataset:
dalin.xml
2017-09-13 – 32.26 MB – CC BY 4.0
Explore in:
Dalin Dictionary - Base Material
Dalin's Dictionary of 19th century Swedish - base material
62,327
Swedish
Dataset:
dalin-base.xml
2017-09-13 – 25.76 MB – CC BY 4.0
Explore in:
Dalin's morphology
A morphology from Dalin's Dictionary of 19th century Swedish that is derived from Dalin's base material.
62,327
Swedish
Dataset:
dalinm.xml
2017-09-13 – 133.24 MB – CC BY 4.0
Explore in:
Diachronic pivot
Diachronic pivot resource where historical lexical information is linked to SALDO
29,432
Swedish
Dataset:
diapivot.xml
2017-09-13 – 21.57 MB – CC BY 4.0
Explore in:
Hellquist's Swedish etymology
Hellquist's Swedish etymology
12,368
Swedish
Dataset:
hellqvist.xml
2021-11-09 – 20.88 MB – CC BY 4.0
Explore in:
Idioms from the NEO lexicon DB
Idioms with explanations extracted from the database for the dictionary Nationalencyklopediens ordbok
Swedish
Dataset:
idiom_ur_neodatabasen.xlsx
2015-03-24 – 429.97 KB – CC BY 4.0
Dataset:
neo_idiom_m_alternativformer.xml
2015-03-24 – 2.55 MB – CC BY 4.0
Kelly
Keywords for Language Learning for Young and adults alike
8,425
Swedish
Dataset:
kelly.xml
2017-09-15 – 5.56 MB – CC BY 4.0
Dataset:
Swedish-Kelly_M3_CEFR.xls
2012-02-15 – 1.28 MB – CC BY 4.0
Explore in:
LingFN
A Framenet for the linguistic domain
173
Swedish
Explore in:
LingFN-thesis
A Framenet for the linguistic domain
16
Swedish
LingFN-V2
A Framenet for the linguistic domain
169
Swedish
Explore in:
lsilex
A lexicon developed within the LSI project
41
Swedish
Explore in:
LWT
Loan Word Typology list
1,460
Swedish, English
Dataset:
lwt.xml
2017-09-19 – 665.94 KB – CC BY 4.0
Explore in:
LWT-PWN
LWT-PWN is the IDS/LWT concept list linked to Princeton WordNet 3.0 word sense identifiers.
1,375
Swedish
Dataset:
lwt-pwn.txt
2015-03-31 – 204.14 KB – CC BY 4.0
MAÞiR Words
Old Swedish lexical resource based upon Söderwall's dictionary, suitable for creating lemmatizers, amonst other things.
28,357
Swedish
Dataset:
mathir_words_v1.0.tgz
2024-01-25 – 306.42 KB – CC BY 4.0
Multilingual Constructicon
A multilingual Constructicon
740
Swedish, Russian
Dataset:
konstruktikon-multi.xml
2021-11-09 – 1.47 MB – CC BY 4.0
NordiCon
NordiCon is a database that collects medieval North Germanic personal names in sources outside Scandinavia.
English
Explore in:
Collection
NPEGL
The Noun Phrases in Early Germanic Languages database.
Old English (ca. 450-1100), Old High German (ca. 750-1050), Old Norse, Old Saxon
See 5 collected resources
Explore in:
NyLLex v2
A lexical resource derived from books published by Sweden´s largest publisher of easy language texts. The entries are annotated with frequency counts distributed over six reading proficiency levels.
Swedish
Dataset:
nyllex_v2.csv
2023-06-09 – 1.46 MB – CC BY 4.0
Old Swedish morphology
Old Swedish morphology from Söderwall and Schlyter
41,958
Swedish
Dataset:
fsvm.xml
2017-09-13 – 31.61 MB – CC BY 4.0
Explore in:
OSA (SAOB)
The Swedish Academy Dictionary online
Swedish
Parole
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information
29,298
Swedish
Dataset:
PAROLE_usyn_descr.txt
2012-03-27 – 913.17 KB – CC BY 4.0
Parole+
The Swedish PAROLE Lexicon - A language technology resource with access to syntactic information, partially linked to SALDO senses
29,621
Swedish
Dataset:
parolelexplus.xml
2017-09-19 – 13.93 MB – CC BY 4.0
Explore in:
Russian Constructicon
A Russian Constructicon
715
Russian, English
Dataset:
konstruktikon-rus.xml
2021-11-09 – 2.72 MB – CC BY 4.0
SALDO
SALDO is an extensive lexicon resource for modern Swedish written language.
131,020
Swedish
Dataset:
saldo.xml
2017-09-19 – 70.98 MB – CC BY 4.0
Explore in:
SALDO's morphology
Semantic and morphological lexicon for language technology
128,036
Swedish
Dataset:
saldom.xml
2017-09-19 – 242.34 MB – CC BY 4.0
Explore in:
SALDO: examples
Example sentences for senses in SALDO
3,334
Swedish
Dataset:
saldoe.xml
2017-09-19 – 1.09 MB – CC BY 4.0
Explore in:
Schlyter
Dictionary of Old Swedish
10,067
Swedish
Dataset:
schlyter.xml
2017-09-19 – 6.87 MB – CC BY 4.0
Explore in:
SenSALDO
SenSALDO, SALDO entries and text word forms with sentiment information (prior polarity)
12,287
Swedish
Dataset:
sensaldo-v02.zip
2019-04-04 – 301.78 KB – CC BY 4.0
Dataset:
sensaldo.zip
2018-03-05 – 235.62 KB – CC BY 4.0
Sentiment Lexicon
Sentiment lexicon for Swedish based on SALDO
2,067
Swedish
Dataset:
sentimentlex.xml
2017-09-19 – 1.78 MB – CC BY 4.0
Dataset:
sentimentlex.csv
2016-07-11 – 486.9 KB – CC BY 4.0
Simple lexicon
The Swedish SIMPLE Lexicon - A language technology resource with access to semantic information in Swedish
11,624
Swedish
Dataset:
simple_n.html
2010-06-10 – 1.61 MB – CC BY 4.0
Dataset:
simple_v.html
2010-06-10 – 559.34 KB – CC BY 4.0
Dataset:
simple_adj.html
2010-06-10 – 104.41 KB – CC BY 4.0
Simple+
The Swedish SIMPLE Lexicon - A language technology resource with access to semantic information in Swedish, connected to SALDO senses
8,630
Swedish
Dataset:
simpleplus.xml
2017-09-19 – 6.9 MB – CC BY 4.0
Dataset:
simple_n.html
2010-06-10 – 1.61 MB – CC BY 4.0
Dataset:
simple_v.html
2010-06-10 – 559.34 KB – CC BY 4.0
Dataset:
simple_adj.html
2010-06-10 – 104.41 KB – CC BY 4.0
Explore in:
SKBL
The Biographical Dictionary of Swedish Women
1,411
Swedish, English
Dataset:
skbl.json
2025-01-18 – 47.72 MB – CC BY 4.0
Explore in:
Söderwall
Dictionary of Old Swedish
22,572
Swedish
Dataset:
soederwall.xml
2017-09-19 – 23.42 MB – CC BY 4.0
Explore in:
Söderwall Supplement
Dictionary of Old Swedish
19,172
Swedish
Dataset:
soederwall-supp.xml
2017-09-19 – 15.45 MB – CC BY 4.0
Explore in:
Sports anglicisms
English loan-words in the Swedish sports press
Swedish
Explore in:
Swedberg's Swensk Ordabok
Swedberg's Swensk Ordabok
17,565
Swedish, Latin
Dataset:
swedberg.xml
2017-09-19 – 8.89 MB – CC BY 4.0
Explore in:
Swedberg's Swensk Ordabok (morphology, rudimentary)
Swedberg's Swensk Ordabok (morphology, rudimentary)
17,565
Swedish
Dataset:
swedbergm.xml
2017-09-19 – 5.76 MB – CC BY 4.0
Explore in:
Swedish FrameNet (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This part of the resource contains the frames and the manually annotated semantic content.
1,195
Swedish
Dataset:
swefn.xml
2021-11-09 – 7 MB – CC BY 4.0
Dataset:
swefn-full.zip
2021-12-21 – 7.53 MB – CC BY 4.0
Explore in:
Swedish FrameNet 2.0 (SweFN)
A lexical semantic resource based on the same principles as the English Berkeley FrameNet. This version is updated to correspond to BFN 1.7.
1,329
Swedish
Dataset:
swefn-2-0.json.zip
2024-10-16 – 1006.51 KB – CC BY 4.0
Dataset:
swefn-2-0.tsv.zip
2024-10-16 – 969.61 KB – CC BY 4.0
Swedish words, LEXIN
Lexicon for immigrants. Second edition
29,111
Swedish, Albanian, Bosnian, English, Finnish, Modern Greek (1453-), Croatian, Kurdish, Iranian Persian, Russian, Serbian, Somali, Spanish, Turkish
Dataset:
LEXIN.zip
2024-01-25 – 1.05 MB – CC BY 4.0
Explore in:
Swedish-Finnish word lists
Swedish-Finnish word lists within various domains
9,607
Swedish
SweSAT Swedish Scholastic Aptitude Test Synonyms 1.1
Swedish Scholastic Aptitude Test Synonyms
782
Swedish
Dataset:
swesat-synonyms.zip
2023-03-30 – 37.73 KB – CC BY 4.0
Swesaurus
A Swedish WordNet
15,010
Swedish
Dataset:
swesaurus.xml
2017-09-19 – 12.16 MB – CC BY 4.0
Explore in:
The Swedish PoliGraph
An extensible knowledge graph with information on members of the Swedish parliament
505,989
Swedish
Dataset:
poligraph.tar.bz2
2020-01-14 – 2.29 MB – GNU GPLv3 or later
Explore in:
UNSC-Graph
An extensible knowledge graph for the UNSC corpus, detailing participants and debates from the UN Security Council 1995-2020
1,017,406
English
Dataset:
unsc-graph-1.0.tar.gz
2023-08-31 – 4.8 MB – GNU GPLv3 or later
Vocation list
A list of vocations in Swedish
13,833
Swedish
Dataset:
vocationTerms150120.utf.txt.gz
2024-01-25 – 67.12 KB – CC BY 4.0
WordNet-SALDO
A linking between SALDO senses and Core WordNet
6,989
Swedish, English
Dataset:
wordnet-saldo.xml
2017-09-19 – 5.71 MB – CC BY 4.0
Explore in:
News and events
News archive
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Sustainable language representations
Position statements
Workshop on Profiling second language vocabulary and grammar - 2023
Blog
Calendar
Previous events
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Tools
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Karp
Web API
Sparv
Sparv Pipeline
Sparv's user manual
Annotations by Sparv
Web service (API)
Web Sparv
Mink
User manual
Tutorial
Web API
Privacy and data policy
Lärka
Other tools
Catta
IT-baserad grammatikinlärning
Data
FAQ
About us
Staff
Organisation
Språkbanken Text i världen
Språkbanken 50 years
Celebration
A brief history
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Workshops
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Forskningsmöten
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Working group meetings
Cookies
Internal
Contact us
Help desk