Skip to main content
Svenska
English
News and events
Research
Data
Analyses
Platforms
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Language resources
Language resources
Language resources
On this page you can browse and search our datasets. Click on a row name to see what files are available for download. You can go directly to the search interface by clicking on the tool logo.
All (1367)
Collections (31)
Corpora (1223)
Lexicons (69)
Training and evaluation data (27)
Models (48)
Title
Free search
Language
- Any -
Swedish
Albanian
Arabic
Belarusian
Blissymbols
Bosnian
Bulgarian
Croatian
Czech
Danish
Dutch
English
Estonian
Faroese
Finland Swedish
Finnish
French
German
Icelandic
Iranian Persian
Italian
Kele (Papua New Guinea)
Kurdish
Latin
Latvian
Lower Sorbian
Macedonian
Modern Greek (1453-)
Multiple languages
Norwegian
Norwegian Bokmål
Old English (ca. 450-1100)
Old High German (ca. 750-1050)
Old Norse
Old Saxon
Polish
Portuguese
Romanian
Russian
Serbian
Slavomolisano
Slovak
Slovenian
Somali
Spanish
Turkish
Turkmen
Ukrainian
Upper Sorbian
Xhosa
Resurs
Antal tokens
Språk
Åtkomst
Finland Swedish: Vasabladet 2014
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
1,351,730
Swedish
Word statistics:
stats_VASABLADET2014.txt.zip
2025-04-22 – 1.34 MB – CC BY 4.0
Explore in:
Finland Swedish: Västra Nyland 2012–2013
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
1,837,323
Swedish
Word statistics:
stats_VASTRANYLAND.txt.zip
2025-04-22 – 1.5 MB – CC BY 4.0
Explore in:
Finland Swedish: Youth Literature 1992–2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
483,702
Swedish
Word statistics:
stats_UNGDOMSLITTERATUR.txt.zip
2025-04-22 – 399.97 KB – CC BY 4.0
Explore in:
Finland Swedish: Åbo Underrättelser 2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
1,176,307
Swedish
Word statistics:
stats_ABOUNDERRATTELSER2012.txt.zip
2025-04-22 – 1.14 MB – CC BY 4.0
Explore in:
Finland Swedish: Åbo Underrättelser 2013
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
956,462
Swedish
Word statistics:
stats_ABOUNDERRATTELSER2013.txt.zip
2025-04-22 – 1.03 MB – CC BY 4.0
Explore in:
Finland Swedish: Ålandstidningen 2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
1,180,214
Swedish
Word statistics:
stats_AT2012.txt.zip
2025-04-22 – 1.05 MB – CC BY 4.0
Explore in:
Finland Swedish: Österbottens tidning 2011
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
32,950
Finland Swedish
Word statistics:
stats_OSTERBOTTENSTIDNING2011.txt.zip
2025-04-22 – 91.81 KB – CC BY 4.0
Explore in:
Finland Swedish: Österbottens tidning 2012
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
217,453
Finland Swedish
Word statistics:
stats_OSTERBOTTENSTIDNING2012.txt.zip
2025-04-22 – 386.68 KB – CC BY 4.0
Explore in:
Finland Swedish: Österbottens tidning 2013
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
380,571
Finland Swedish
Word statistics:
stats_OSTERBOTTENSTIDNING2013.txt.zip
2025-04-22 – 541.09 KB – CC BY 4.0
Explore in:
Finland Swedish: Österbottens Tidning 2014
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
939,380
Swedish
Word statistics:
stats_OSTERBOTTENSTIDNING2014.txt.zip
2025-04-22 – 1018.09 KB – CC BY 4.0
Explore in:
Finland Swedish: Östra Nyland 2012–2013
Part of the Finland Swedish language bank over Swedish in Finland today and yesterday
690,133
Swedish
Word statistics:
stats_OSTRANYLAND.txt.zip
2025-04-22 – 783.74 KB – CC BY 4.0
Explore in:
Collection
Flashback
Material from the Flashback internet forum
Swedish
See 16 collected resources
Explore in:
Flashback: About Flashback
Material from the Flashback internet forum.
35,051,270
Swedish
Dataset:
flashback-flashback.xml.bz2
2025-04-15 – 916.03 MB – CC BY 4.0
Word statistics:
stats_flashback-flashback.csv.zip
2025-04-15 – 6.28 MB – CC BY 4.0
Explore in:
Flashback: Computers & IT
Material from the Flashback internet forum.
319,364,042
Swedish
Dataset:
flashback-dator.xml.bz2
2025-04-22 – 7.69 GB – CC BY 4.0
Word statistics:
stats_flashback-dator.csv.zip
2025-04-22 – 39.05 MB – CC BY 4.0
Explore in:
Flashback: Culture & Media
Material from the Flashback internet forum.
490,912,727
Swedish
Dataset:
flashback-kultur.xml.bz2
2025-04-25 – 12.52 GB – CC BY 4.0
Word statistics:
stats_flashback-kultur.csv.zip
2025-04-25 – 58.3 MB – CC BY 4.0
Explore in:
Flashback: Drugs
Material from the Flashback internet forum.
251,658,977
Swedish
Dataset:
flashback-droger.xml.bz2
2025-04-23 – 6.42 GB – CC BY 4.0
Word statistics:
stats_flashback-droger.csv.zip
2025-04-23 – 21.32 MB – CC BY 4.0
Explore in:
Flashback: Economy
Material from the Flashback internet forum.
145,991,925
Swedish
Dataset:
flashback-ekonomi.xml.bz2
2025-04-26 – 3.88 GB – CC BY 4.0
Word statistics:
stats_flashback-ekonomi.csv.zip
2025-04-26 – 16.93 MB – CC BY 4.0
Explore in:
Flashback: Food, Beverages & Tobacco
Material from the Flashback internet forum.
80,643,621
Swedish
Dataset:
flashback-mat.xml.bz2
2025-04-28 – 2.16 GB – CC BY 4.0
Word statistics:
stats_flashback-mat.csv.zip
2025-04-28 – 13.7 MB – CC BY 4.0
Explore in:
Flashback: Home, Housing & Family
Material from the Flashback internet forum.
384,217,183
Swedish
Dataset:
flashback-hem.xml.bz2
2025-04-24 – 9.91 GB – CC BY 4.0
Word statistics:
stats_flashback-hem.csv.zip
2025-04-24 – 27.01 MB – CC BY 4.0
Explore in:
Flashback: Lifestyle
Material from the Flashback internet forum.
113,600,855
Swedish
Dataset:
flashback-livsstil.xml.bz2
2025-04-28 – 2.9 GB – CC BY 4.0
Word statistics:
stats_flashback-livsstil.csv.zip
2025-04-28 – 19.46 MB – CC BY 4.0
Explore in:
Flashback: Other
Material from the Flashback internet forum.
166,922,862
Swedish
Dataset:
flashback-ovrigt.xml.bz2
2025-04-28 – 4.34 GB – CC BY 4.0
Word statistics:
stats_flashback-ovrigt.csv.zip
2025-04-28 – 25.58 MB – CC BY 4.0
Explore in:
Flashback: Politics
Material from the Flashback internet forum.
865,635,058
Swedish
Dataset:
flashback-politik.xml.bz2
2025-05-09 – 23.27 GB – CC BY 4.0
Word statistics:
stats_flashback-politik.csv.zip
2025-05-09 – 65.57 MB – CC BY 4.0
Explore in:
Flashback: Science & Humanities
Material from the Flashback internet forum.
502,126,049
Swedish
Dataset:
flashback-vetenskap.xml.bz2
2025-05-05 – 12.99 GB – CC BY 4.0
Word statistics:
stats_flashback-vetenskap.csv.zip
2025-05-05 – 45.43 MB – CC BY 4.0
Explore in:
Flashback: Sex
Material from the Flashback internet forum.
95,472,664
Swedish
Dataset:
flashback-sex.xml.bz2
2025-04-23 – 2.47 GB – CC BY 4.0
Word statistics:
stats_flashback-sex.csv.zip
2025-04-23 – 13.2 MB – CC BY 4.0
Explore in:
Flashback: Society
Material from the Flashback internet forum.
855,882,897
Swedish
Dataset:
flashback-samhalle.xml.bz2
2025-05-16 – 22.62 GB – CC BY 4.0
Word statistics:
stats_flashback-samhalle.csv.zip
2025-05-16 – 68.49 MB – CC BY 4.0
Explore in:
Flashback: Sports & Fitness
Material from the Flashback internet forum.
279,549,842
Swedish
Dataset:
flashback-sport.xml.bz2
2025-04-29 – 7.14 GB – CC BY 4.0
Word statistics:
stats_flashback-sport.csv.zip
2025-04-29 – 26.83 MB – CC BY 4.0
Explore in:
Flashback: Travel
Material from the Flashback internet forum.
39,293,276
Swedish
Dataset:
flashback-resor.xml.bz2
2025-04-14 – 1.03 GB – CC BY 4.0
Word statistics:
stats_flashback-resor.csv.zip
2025-04-14 – 7.72 MB – CC BY 4.0
Explore in:
Flashback: Vehicles & Traffic
Material from the Flashback internet forum.
88,143,880
Swedish
Dataset:
flashback-fordon.xml.bz2
2025-04-24 – 2.37 GB – CC BY 4.0
Word statistics:
stats_flashback-fordon.csv.zip
2025-04-24 – 13.1 MB – CC BY 4.0
Explore in:
Folkbiblioteksbladet
Annual volumes 1904–1908 of the Folkbiblioteksbladet journal
398,330
Swedish
Dataset:
runeberg-folkbbl.xml.bz2
2014-12-08 – 4.78 MB – CC BY 4.0
Word statistics:
stats_RUNEBERG-FOLKBBL.txt.zip
2025-04-22 – 659.85 KB – CC BY 4.0
Explore in:
Folke
Historical folklore records from Isof archives
3,642,351
Swedish
Dataset:
sbs-folke.xml.bz2
2024-12-03 – 62.79 MB – CC BY 4.0
Word statistics:
stats_sbs-folke.csv.zip
2025-04-22 – 1.57 MB – CC BY 4.0
Explore in:
Folkets Röst 1850's
Part of the collection Kubhist2
24,047,961
Swedish
Dataset:
kubhist2-folketsrost-1850.xml.bz2
2024-01-10 – 770.24 MB – CC BY 4.0
Word statistics:
stats_kubhist2-folketsrost-1850.csv.zip
2025-04-22 – 11.65 MB – CC BY 4.0
Explore in:
Collection
Fornsvenska textbankens material
A collection of Old Swedish texts from Fornsvenska textbanken
Swedish
See 12 collected resources
Explore in:
Fornsvenska textbankens material: Dalin's Then Swänska Argus 1732-1734
Thän Swänska Argus (1732–1734) by Olof Dalins digitized by Fornsvenska textbanken
260,093
Swedish
Dataset:
fsv-nysvenskdalin.xml.bz2
2013-03-26 – 1009.45 KB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Nysvenska bibelböcker
44,990
Swedish
Dataset:
fsv-nysvenskbibel.xml.bz2
2013-03-26 – 120.62 KB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Nysvenska krönikor
333,670
Swedish
Dataset:
fsv-nysvenskkronikor.xml.bz2
2013-03-26 – 825.09 KB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Nysvenska lagar
22,701
Swedish
Dataset:
fsv-nysvensklagar.xml.bz2
2013-03-26 – 70.27 KB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Nysvenska, övrigt
344,590
Swedish
Dataset:
fsv-nysvenskovrigt.xml.bz2
2013-03-26 – 1.08 MB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Profan prosa
239,805
Swedish
Dataset:
fsv-profanprosa.xml.bz2
2013-03-26 – 2.02 MB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Verser
184,059
Swedish
Dataset:
fsv-verser.xml.bz2
2013-03-07 – 1.62 MB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Yngre lagar
117,773
Swedish
Dataset:
fsv-yngrelagar.xml.bz2
2013-03-07 – 936.18 KB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Yngre religiös prosa
1,365,544
Swedish
Dataset:
fsv-yngrereligiosprosa.xml.bz2
2013-03-26 – 12.08 MB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Yngre tänkeböcker
160,831
Swedish
Dataset:
fsv-yngretankebocker.xml.bz2
2013-03-26 – 1.38 MB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Äldre lagar
531,192
Swedish
Dataset:
fsv-aldrelagar.xml.bz2
2013-10-08 – 4.18 MB – CC BY 4.0
Explore in:
Fornsvenska textbankens material: Äldre religiös prosa
491,605
Swedish
Dataset:
fsv-aldrereligiosprosa.xml.bz2
2013-03-26 – 4.15 MB – CC BY 4.0
Explore in:
Forskning & Framsteg
A corpus of popular science articles from Forskning & Framsteg
743,831
Swedish
Dataset:
fof.xml.bz2
2017-03-22 – 12.5 MB – CC BY 4.0
Word statistics:
stats_FOF.txt.zip
2025-04-22 – 824.5 KB – CC BY 4.0
Explore in:
FTS - Faroese text collection
Faroese text collection, in cooperation with University of Faroe Islands, Fróðskaparsetur Føroya, http://www.setur.fo/
1,142,289
Faroese
Dataset:
fts.xml.bz2
2015-05-27 – 7.22 MB – CC BY 4.0
Explore in:
Förvaltningsmyndigheters texter
51,366
Swedish
Dataset:
klarsprak.xml.bz2
2017-03-30 – 791.21 KB – CC BY 4.0
Word statistics:
stats_KLARSPRAK.txt.zip
2025-04-22 – 76.68 KB – CC BY 4.0
Explore in:
Gothenburg Dialogue Corpus (GDC)
GDC is a collection of 360 individual dialogues transcribed from recordings.
1,473,608
Swedish
Word statistics:
stats_GDC.txt.zip
2025-04-22 – 800.3 KB – CC BY 4.0
Explore in:
GP – Två dagar
Weekend appendix of Göteborgs-Posten
1,033,747
Swedish
Dataset:
gp2d.xml.bz2
2017-02-07 – 16.18 MB – CC BY 4.0
Word statistics:
stats_GP2D.txt.zip
2025-04-22 – 1.18 MB – CC BY 4.0
Explore in:
GP 1994
A corpus with texts from Göteborgsposten
21,331,715
Swedish
Dataset:
gp1994.xml.bz2
2017-02-08 – 369.45 MB – CC BY 4.0
Word statistics:
stats_GP1994.txt.zip
2025-04-22 – 7.21 MB – CC BY 4.0
Explore in:
GP 2001
A corpus with texts from Göteborgsposten
17,411,332
Swedish
Dataset:
gp2001.xml.bz2
2017-02-06 – 286.6 MB – CC BY 4.0
Word statistics:
stats_GP2001.txt.zip
2025-04-22 – 6.36 MB – CC BY 4.0
Explore in:
GP 2002
A corpus with texts from Göteborgsposten
21,064,282
Swedish
Dataset:
gp2002.xml.bz2
2017-02-06 – 345.22 MB – CC BY 4.0
Word statistics:
stats_GP2002.txt.zip
2025-04-22 – 6.97 MB – CC BY 4.0
Explore in:
GP 2003
A corpus with texts from Göteborgsposten
19,064,063
Swedish
Dataset:
gp2003.xml.bz2
2017-02-07 – 314.46 MB – CC BY 4.0
Word statistics:
stats_GP2003.txt.zip
2025-04-22 – 6.64 MB – CC BY 4.0
Explore in:
GP 2004
A corpus with texts from Göteborgsposten
22,350,225
Swedish
Dataset:
gp2004.xml.bz2
2017-02-07 – 367.72 MB – CC BY 4.0
Word statistics:
stats_GP2004.txt.zip
2025-04-22 – 7.67 MB – CC BY 4.0
Explore in:
GP 2005
A corpus with texts from Göteborgsposten
23,153,863
Swedish
Dataset:
gp2005.xml.bz2
2017-02-07 – 378.57 MB – CC BY 4.0
Word statistics:
stats_GP2005.txt.zip
2025-04-22 – 7.72 MB – CC BY 4.0
Explore in:
GP 2006
A corpus with texts from Göteborgsposten
22,469,453
Swedish
Dataset:
gp2006.xml.bz2
2017-02-07 – 372.32 MB – CC BY 4.0
Word statistics:
stats_GP2006.txt.zip
2025-04-22 – 7.52 MB – CC BY 4.0
Explore in:
GP 2007
A corpus with texts from Göteborgsposten
18,604,742
Swedish
Dataset:
gp2007.xml.bz2
2017-02-07 – 319.44 MB – CC BY 4.0
Word statistics:
stats_GP2007.txt.zip
2025-04-22 – 6.41 MB – CC BY 4.0
Explore in:
GP 2008
A corpus with texts from Göteborgsposten
16,982,289
Swedish
Dataset:
gp2008.xml.bz2
2017-10-12 – 454.31 MB – CC BY 4.0
Word statistics:
stats_GP2008.txt.zip
2025-04-22 – 6.67 MB – CC BY 4.0
Explore in:
GP 2009
A corpus with texts from Göteborgsposten
17,355,048
Swedish
Dataset:
gp2009.xml.bz2
2017-02-01 – 315.47 MB – CC BY 4.0
Word statistics:
stats_GP2009.txt.zip
2025-04-22 – 6.02 MB – CC BY 4.0
Explore in:
GP 2010
A corpus with texts from Göteborgsposten
17,211,065
Swedish
Dataset:
gp2010.xml.bz2
2017-02-01 – 311.44 MB – CC BY 4.0
Word statistics:
stats_GP2010.txt.zip
2025-04-22 – 5.86 MB – CC BY 4.0
Explore in:
GP 2011
A corpus with texts from Göteborgsposten
19,938,391
Swedish
Dataset:
gp2011.xml.bz2
2017-02-01 – 348.44 MB – CC BY 4.0
Word statistics:
stats_GP2011.txt.zip
2025-04-22 – 5.85 MB – CC BY 4.0
Explore in:
GP 2012
A corpus with texts from Göteborgsposten
17,162,039
Swedish
Dataset:
gp2012.xml.bz2
2017-02-01 – 300.38 MB – CC BY 4.0
Word statistics:
stats_GP2012.txt.zip
2025-04-22 – 5.48 MB – CC BY 4.0
Explore in:
GP 2013
A corpus with texts from Göteborgsposten
16,872,043
Swedish
Dataset:
gp2013.xml.bz2
2017-02-01 – 317.84 MB – CC BY 4.0
Word statistics:
stats_GP2013.txt.zip
2025-04-22 – 5.15 MB – CC BY 4.0
Explore in:
GU-Journal
Texts from GU Journalen
4,697,018
Swedish
Dataset:
gujournalen.xml.bz2
2025-08-14 – 93.28 MB – CC BY 4.0
Word statistics:
stats_gujournalen.csv.zip
2025-08-14 – 2.4 MB – CC BY 4.0
Explore in:
Gustav Vasa's bible - New testament
The new testament in Gustav Vasa's Bible published by Natan Lidqvist in 1941
469,706
Swedish
Dataset:
vasabibel-nt.xml.bz2
2015-05-26 – 3.04 MB – CC BY 4.0
Word statistics:
stats_VASABIBEL-NT.txt.zip
2025-04-22 – 307.14 KB – CC BY 4.0
Explore in:
Gustav Vasa's letter production
King Gustav I's registry
253,642
Swedish
Dataset:
vasabrev.xml.bz2
2015-05-26 – 2.02 MB – CC BY 4.0
Word statistics:
stats_VASABREV.txt.zip
2025-04-22 – 401.14 KB – CC BY 4.0
Explore in:
Collection
Göteborgsposten
A corpus with texts from the newspaper Göteborgs-Posten
Swedish
See 14 collected resources
Explore in:
Göteborgsposten 1860's
Part of the collection Kubhist2
56,042,023
Swedish
Dataset:
kubhist2-goteborgsposten-1860.xml.bz2
2024-01-16 – 1.75 GB – CC BY 4.0
Word statistics:
stats_kubhist2-goteborgsposten-1860.csv.zip
2025-04-22 – 13.74 MB – CC BY 4.0
Explore in:
InterFra
Corpus of French spoken by Swedish students
1,132,307
French
Dataset:
interfra.xml.bz2
2016-06-16 – 2.48 MB – CC BY 4.0
Word statistics:
stats_INTERFRA.txt.zip
2025-04-22 – 158 KB – CC BY 4.0
Explore in:
InterFra Swedish
To promote research in the field of French L2 second language acquisition in a developmental, interactional and variationist perspective. The HLP (High Level Proficiency in Second language use) project also investigates learners of other L2s such as Swedish, Spanish, English and Italian.
50,993
Swedish
Dataset:
interfra-sv.xml.bz2
2016-06-16 – 406.96 KB – CC BY 4.0
Word statistics:
stats_INTERFRA-SV.txt.zip
2025-04-22 – 54.17 KB – CC BY 4.0
Explore in:
InterFra tagged
To promote research in the field of French L2 second language acquisition in a developmental, interactional and variationist perspective. The HLP (High Level Proficiency in Second language use) project also investigates learners of other L2s such as Swedish, Spanish, English and Italian.
425,771
French
Dataset:
interfra-tagged.xml.bz2
2016-06-15 – 979.79 KB – CC BY 4.0
Word statistics:
stats_INTERFRA-TAGGED.txt.zip
2025-04-22 – 77.36 KB – CC BY 4.0
Explore in:
Interrogations
The corpus is protected, contact Ylva Burman (ylva.byrman@svenska.gu.se) for more information and access.
49,299
Swedish
Explore in:
IVIP
Interaction and Variation in Pluricentric Languages – Communicative Patterns in Sweden Swedish and Finland Swedish
374,387
Swedish
Word statistics:
stats_ivip.csv.zip
2025-04-22 – 205.31 KB – CC BY 4.0
Explore in:
IVIP demo
Interaction and Variation in Pluricentric Languages – Communicative Patterns in Sweden Swedish and Finland Swedish.
572
Swedish
Dataset:
ivip-demo.xml.bz2
2019-08-12 – 13.1 KB – CC BY 4.0
Word statistics:
stats_ivip-demo.csv.zip
2025-04-22 – 3.06 KB – CC BY 4.0
Explore in:
Jubilee Archive (pilot)
Parts of the Gothenburg University Library's digitized collection of newspaper clippings about the Gothenburg Tercentennial Jubilee Exposition in 2023
1,290,763
Swedish
Dataset:
jubileumsarkivet-pilot.xml.bz2
2023-03-29 – 32.45 MB – CC BY 4.0
Word statistics:
stats_jubileumsarkivet-pilot.csv.zip
2025-04-22 – 1.54 MB – CC BY 4.0
Explore in:
Judgements
32,206,334
Swedish
Dataset:
moderntdv.xml.bz2
2015-05-20 – 295.47 MB – CC BY 4.0
Word statistics:
stats_MODERNTDV.txt.zip
2025-04-22 – 4.25 MB – CC BY 4.0
Explore in:
KBs digitaliserade SOU:er (1922–1996)
Statens offentliga utredningar (SOU) i digitaliserat format från <a href="https://sou.kb.se" target="_blank">Kungliga biblioteket</a>. Samlingen är inte komplett.
428,188,012
Swedish
Dataset:
sou.xml.bz2
2017-07-02 – 6.31 GB – CC BY 4.0
Word statistics:
stats_SOU.txt.zip
2025-04-22 – 73.34 MB – CC BY 4.0
Explore in:
Collection
Kubhist
Diachronic collection of Swedish historical newspaper texts from the period of 1749–1926
Swedish
See 78 collected resources
Collection
Kubhist 2
Diachronic collection of Swedish historical newspaper texts from the period of 1645–1926. Kubhist 2 is an updated version av Kubhist with improved OCR and more material.
Swedish
See 328 collected resources
Explore in:
Kubhist 2: Aftonbladet 1840's
Part of the collection Kubhist 2
62,967,761
Swedish
Dataset:
kubhist2-aftonbladet-1840.xml.bz2
2021-07-08 – 2.13 GB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1840.csv.zip
2025-04-22 – 36.97 MB – CC BY 4.0
Explore in:
Kubhist 2: Aftonbladet 1850's
Part of the collection Kubhist 2
82,212,220
Swedish
Dataset:
kubhist2-aftonbladet-1850.xml.bz2
2021-08-18 – 2.78 GB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1850.csv.zip
2025-04-22 – 45.11 MB – CC BY 4.0
Explore in:
Kubhist 2: Aftonbladet 1860's
Part of the collection Kubhist 2
94,574,658
Swedish
Dataset:
kubhist2-aftonbladet-1860.xml.bz2
2021-08-18 – 3.22 GB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1860.csv.zip
2025-04-22 – 48.6 MB – CC BY 4.0
Explore in:
Kubhist 2: Aftonbladet 1870's
Part of the collection Kubhist 2
96,758,392
Swedish
Dataset:
kubhist2-aftonbladet-1870.xml.bz2
2021-08-18 – 3.3 GB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1870.csv.zip
2025-04-22 – 49.19 MB – CC BY 4.0
Explore in:
Kubhist 2: Aftonbladet 1880's
Part of the collection Kubhist 2
100,382,714
Swedish
Dataset:
kubhist2-aftonbladet-1880.xml.bz2
2021-08-19 – 3.44 GB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1880.csv.zip
2025-04-22 – 45.88 MB – CC BY 4.0
Explore in:
Kubhist 2: Aftonbladet 1890's
Part of the collection Kubhist 2
101,943,656
Swedish
Dataset:
kubhist2-aftonbladet-1890.xml.bz2
2021-08-20 – 3.5 GB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1890.csv.zip
2025-04-22 – 44.1 MB – CC BY 4.0
Explore in:
Kubhist 2: Aftonbladet 1900's
Part of the collection Kubhist 2
11,031,971
Swedish
Dataset:
kubhist2-aftonbladet-1900.xml.bz2
2021-08-20 – 383.12 MB – CC BY 4.0
Word statistics:
stats_kubhist2-aftonbladet-1900.csv.zip
2025-04-22 – 9.26 MB – CC BY 4.0
Explore in:
Kubhist 2: Alfwar och Skämt 1840's
Part of the collection Kubhist2
806,471
Swedish
Dataset:
kubhist2-alfwarochskamt-1840.xml.bz2
2024-01-04 – 24.47 MB – CC BY 4.0
Word statistics:
stats_kubhist2-alfwarochskamt-1840.csv.zip
2025-04-22 – 1.07 MB – CC BY 4.0
Explore in:
Kubhist 2: Barometern 1840's
Part of the collection Kubhist 2
6,950,745
Swedish
Dataset:
kubhist2-barometern-1840.xml.bz2
2021-08-20 – 212.15 MB – CC BY 4.0
Word statistics:
stats_kubhist2-barometern-1840.csv.zip
2025-04-22 – 5.38 MB – CC BY 4.0
Explore in:
Kubhist 2: Barometern 1850's
Part of the collection Kubhist 2
11,384,016
Swedish
Dataset:
kubhist2-barometern-1850.xml.bz2
2021-08-20 – 344.89 MB – CC BY 4.0
Word statistics:
stats_kubhist2-barometern-1850.csv.zip
2025-04-22 – 8.22 MB – CC BY 4.0
Explore in:
Kubhist 2: Barometern 1860's
Part of the collection Kubhist 2
15,930,572
Swedish
Dataset:
kubhist2-barometern-1860.xml.bz2
2021-08-20 – 484.81 MB – CC BY 4.0
Word statistics:
stats_kubhist2-barometern-1860.csv.zip
2025-04-22 – 10.34 MB – CC BY 4.0
Explore in:
Kubhist 2: Barometern 1870's
Part of the collection Kubhist2
25,488,681
Swedish
Dataset:
kubhist2-barometern-1870.xml.bz2
2024-01-06 – 789.65 MB – CC BY 4.0
Word statistics:
stats_kubhist2-barometern-1870.csv.zip
2025-04-22 – 9.74 MB – CC BY 4.0
Explore in:
Kubhist 2: Barometern 1880's
Part of the collection Kubhist 2
39,808,549
Swedish
Dataset:
kubhist2-barometern-1880.xml.bz2
2021-08-20 – 1.22 GB – CC BY 4.0
Word statistics:
stats_kubhist2-barometern-1880.csv.zip
2025-04-22 – 16.12 MB – CC BY 4.0
Explore in:
Kubhist 2: Barometern 1890's
Part of the collection Kubhist 2
28,408,936
Swedish
Dataset:
kubhist2-barometern-1890.xml.bz2
2021-08-20 – 894.76 MB – CC BY 4.0
Word statistics:
stats_kubhist2-barometern-1890.csv.zip
2025-04-22 – 12.39 MB – CC BY 4.0
Explore in:
Kubhist 2: Blekingsposten 1850's
Part of the collection Kubhist 2
5,951,507
Swedish
Dataset:
kubhist2-blekingsposten-1850.xml.bz2
2021-08-20 – 178.6 MB – CC BY 4.0
Word statistics:
stats_kubhist2-blekingsposten-1850.csv.zip
2025-04-22 – 5.04 MB – CC BY 4.0
Explore in:
Kubhist 2: Blekingsposten 1860's
Part of the collection Kubhist2
11,165,744
Swedish
Dataset:
kubhist2-blekingsposten-1860.xml.bz2
2024-01-06 – 345.68 MB – CC BY 4.0
Word statistics:
stats_kubhist2-blekingsposten-1860.csv.zip
2025-04-22 – 6.2 MB – CC BY 4.0
Explore in:
Kubhist 2: Blekingsposten 1870's
Part of the collection Kubhist 2
13,863,380
Swedish
Dataset:
kubhist2-blekingsposten-1870.xml.bz2
2021-08-20 – 453.83 MB – CC BY 4.0
Word statistics:
stats_kubhist2-blekingsposten-1870.csv.zip
2025-04-22 – 8.14 MB – CC BY 4.0
Explore in:
Kubhist 2: Blekingsposten 1880's
Part of the collection Kubhist 2
7,493,604
Swedish
Dataset:
kubhist2-blekingsposten-1880.xml.bz2
2021-08-21 – 250.2 MB – CC BY 4.0
Word statistics:
stats_kubhist2-blekingsposten-1880.csv.zip
2025-04-22 – 4.22 MB – CC BY 4.0
Explore in:
Kubhist 2: Bollnäs Tidning 1870's
Part of the collection Kubhist 2
2,808,820
Swedish
Dataset:
kubhist2-bollnastidning-1870.xml.bz2
2021-08-21 – 92.9 MB – CC BY 4.0
Word statistics:
stats_kubhist2-bollnastidning-1870.csv.zip
2025-04-22 – 2.45 MB – CC BY 4.0
Explore in:
Kubhist 2: Bollnäs Tidning 1880's
Part of the collection Kubhist2
285,091
Swedish
Dataset:
kubhist2-bollnastidning-1880.xml.bz2
2024-01-06 – 9.44 MB – CC BY 4.0
Word statistics:
stats_kubhist2-bollnastidning-1880.csv.zip
2025-04-22 – 481.5 KB – CC BY 4.0
Explore in:
Kubhist 2: Borås Tidning 1830's
Part of the collection Kubhist 2
198,689
Swedish
Dataset:
kubhist2-borastidning-1830.xml.bz2
2021-08-21 – 6 MB – CC BY 4.0
Word statistics:
stats_kubhist2-borastidning-1830.csv.zip
2025-04-22 – 458.03 KB – CC BY 4.0
Explore in:
Pagination
First page
« First
Previous page
‹ Previous
Page
1
Page
2
Page
3
Page
4
Page
5
Page
6
Page
7
Page
8
Page
9
Page
10
Page
11
Page
12
Page
13
Next page
Next ›
Last page
Last »
News and events
News archive
Blog
Calendar
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Höstworkshop 2025
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Sustainable language representations
Position statements
Workshop on Profiling second language vocabulary and grammar - 2023
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Data
Analyses
Platforms
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Karp
Web API
Sparv
Web Sparv - User Manual
Web service (API)
Web Sparv - Technical Documentation
Mink
User manual
Tutorial
Video: Overview (sv)
Web API
Privacy and data policy
Strix
Lärka
Other tools
Catta
IT-baserad grammatikinlärning
FAQ
About us
Staff
Organisation
Språkbanken Text around the world
Språkbanken 50 years
Celebration
A brief history
Studera språkteknologi
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Workshops
End of the year workshop & APT 2025
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Forskningsmöten
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Working group meetings
Cookies
Internal
Contact us
Help desk