Menu
News and events
Open submenu
Research
Open submenu
Tools
Open submenu
Data
FAQ
About us
Open submenu
Contact us
Open submenu
Close submenu
News and events
News archive
Conferences and workshops
Open submenu
Blog
Calendar
Open submenu
Close submenu
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Open submenu
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Open submenu
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Open submenu
Sustainable language representations
Open submenu
Workshop on Profiling second language vocabulary and grammar - 2023
Close submenu
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
Close submenu
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
Close submenu
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Close submenu
Sustainable language representations
Position statements
Close submenu
Calendar
Previous events
Close submenu
Research
Publications
Doktorandutbildning
Open submenu
Close submenu
Doktorandutbildning
For PhD students and supervisors
Close submenu
Tools
Korp
Open submenu
Karp
Open submenu
Sparv
Open submenu
Mink
Open submenu
Lärka
Other tools
Open submenu
Close submenu
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Close submenu
Karp
Web API
Close submenu
Sparv
Sparv Pipeline
Sparv's user manual
Annotations by Sparv
Web service (API)
Web Sparv
Close submenu
Mink
User manual
Tutorial
Web API
Privacy and data policy
Close submenu
Other tools
Catta
IT-baserad grammatikinlärning
Close submenu
About us
Staff
Organisation
Språkbanken Text i världen
Språkbanken 50 years
Open submenu
A brief history
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Open submenu
Cookies
Internal
Close submenu
Språkbanken 50 years
Celebration
Close submenu
Meetings and workshops
Kick-off meetings
Open submenu
Workshops
Open submenu
Forskningsmöten
SBX Retreat
Open submenu
Working group meetings
Close submenu
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Close submenu
Workshops
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Close submenu
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Close submenu
Contact us
Help desk
Skip to main content
Svenska
English
Språkbanken Text is a part of
Språkbanken
.
News and events
Research
Tools
Data
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1323)
Collections (30)
Corpora (1198)
Lexicons (62)
Training and evaluation data (15)
Models (48)
Name or description
Language
- Any -
Swedish
Albanian
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Typ
Språk
Åtkomst
Collection
ASPAC
The Amsterdam Slavic Parallel Aligned Corpus
Corpus
Swedish, Belarusian, Bulgarian, Czech, German, Lower Sorbian, Modern Greek (1453-), English, Spanish, French, Croatian, Upper Sorbian, Latin, Macedonian, Dutch, Polish, Portuguese, Romanian, Russian, Kele (Papua New Guinea), Slovak, Slovenian, Serbian, Slavomolisano, Turkmen, Ukrainian
See 27 collected resources
Explore in:
Collection
Bicameral Riksdag
Collection of textual documents from the Swedish bicameral parliament data
Corpus
Swedish
See 10 collected resources
Explore in:
Collection
Blog mix
Material from a selection of Swedish blogs. Regularly updated.
Corpus
Swedish
See 21 collected resources
Explore in:
Collection
Europarl
European Parliament Proceedings Parallel Corpus
Corpus
Swedish, Danish, German, Modern Greek (1453-), English, Spanish, Finnish, French, Italian, Dutch, Portuguese
See 11 collected resources
Explore in:
Collection
Familjeliv
Material from the Familjeliv internet forum
Corpus
Swedish
See 23 collected resources
Explore in:
Collection
Finland Swedish
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
See 56 collected resources
Explore in:
Collection
Flashback
Material from the Flashback internet forum
Corpus
Swedish
See 16 collected resources
Explore in:
Collection
Fornsvenska textbankens material
A collection of Old Swedish texts from Fornsvenska textbanken
Corpus
Swedish
See 12 collected resources
Explore in:
Collection
Göteborgsposten
A corpus with texts from the newspaper Göteborgs-Posten
Corpus
Swedish
See 14 collected resources
Explore in:
Collection
Kubhist
Diachronic collection of Swedish historical newspaper texts from the period of 1749–1926
Corpus
Swedish
See 78 collected resources
Collection
Kubhist 2
Diachronic collection of Swedish historical newspaper texts from the period of 1645–1926. Kubhist 2 is an updated version av Kubhist with improved OCR and more material.
Corpus
Swedish
See 328 collected resources
Explore in:
Collection
Kubord 1
Word frequencies from modern newspaper texts from the National Library of Sweden
Corpus
Swedish
See 84 collected resources
Explore in:
Collection
Kubord 2
Word relations from modern newspaper texts from the National Library of Sweden
Corpus
Swedish
See 82 collected resources
Explore in:
Collection
Kubord-fasttext
A collection of fasttext models trained on modern newspaper texts from the National Library of Sweden
Model
Swedish
See 6 collected resources
Collection
Kvinnotidningar
Material from historical women's periodicals
Corpus
Swedish
See 7 collected resources
Explore in:
Collection
Läkartidningen medical journal
Corpus for health care technical language
Corpus
Swedish
See 11 collected resources
Explore in:
Collection
Learner Language
Learner Language is a collection of corpor and lexicons that describe learner language. Corpora include both texts/audio produced by language learners, as well as texts/language they are exposed to (reading or listening to, e.g. course book texts). Even some derivative resources based on these corpora are included in this collection.
Corpus
Swedish, Multiple languages
See 14 collected resources
Collection
Medieval letters
Swedish medieval charters from Diplomatarium Suecanum (Svenskt Diplomatariums huvudkartotek, SDHK)
Corpus
Latin, German, Norwegian, Swedish
See 5 collected resources
Explore in:
Collection
NPEGL
The Noun Phrases in Early Germanic Languages database.
Lexicon
Old English (ca. 450-1100), Old High German (ca. 750-1050), Old Norse, Old Saxon
See 5 collected resources
Explore in:
Collection
Old Finland Swedish
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
Corpus
Swedish
See 42 collected resources
Explore in:
Collection
Press
Swedish press
Corpus
Swedish
See 6 collected resources
Explore in:
Collection
Riksdag of the Estates
Collection of textual documents from the Swedish Riksdag of the Estates
Corpus
Swedish
See 7 collected resources
Explore in:
Collection
Riksdagens öppna data
Data from the Swedish parliament collected from data.riksdagen.se
Corpus
Swedish
See 21 collected resources
Explore in:
Collection
Somali corpora
A collection of Samli corpora
Corpus
Somali
See 26 collected resources
Explore in:
Collection
SuperLim 2
A standardized suite for evaluation and analysis of Swedish natural language understanding systems.
Corpus
Swedish
Dataset:
SuperLim-2-2.0.4.zip
2024-01-25 – 156.63 MB – CC BY 4.0
Dataset:
SuperLim_maintenance.odt
2024-01-25 – 16.96 KB
Collection
SVT news
News texts from svt.se
Corpus
Swedish
See 21 collected resources
Explore in:
Collection
SweLL
SweLL -- Swedish Learner Language -- is a collection of SweLL corpora and derivative resources coming from these corpora. SweLL corpora consisf of learner texts written by learners with other mother tongues than Swedish. All texts have been collected in test situations (none of them coming from home-written tasks).
Corpus
Swedish, Multiple languages
See 9 collected resources
Collection
SweLL-pilot
Essays written by adult learners of Swedish, manually labeled with the CEFR levels (a European scale of language proficiency levels within language learning). Collection period 2006-2015.
Corpus
Swedish
See 3 collected resources
Explore in:
Collection
Web News
News from Swedish newspapers' websites
Corpus
Swedish
See 13 collected resources
Explore in:
Close menu