Skip to main content
Språkbanken Text is a department within Språkbanken.

SAOB1950

Citation Information

Språkbanken Text (2023). SAOB1950 (updated: 2023-11-30). [Data set]. Språkbanken Text. https://doi.org/10.23695/zph6-en76
BibTeX Additional ways to cite the dataset.
Scanned books from 1950 to 2007 that are used as source material for updating SAOB, with a selection that reflects the Swedish vocabulary during the 20th century.

SAOB1950 was created to serve as complementary research material for updating the Swedish Academy Dictionary (SAOB). It contains books from 1950 to 2007 that were borrowed from Lund University Library and scanned at the Swedish Academy's dictionary editorial office in Lund. The works are selected to provide a representative overview of the Swedish vocabulary mainly during the second half of the 20th century. About half of the works were selected – either from the National Biography or from Lund University Library's catalog – with the aim of covering a wide range of subject areas and achieving a relatively even distribution of authors' gender. For the corpus's second half, a more random selection was made, so that books with a certain placement in the library' stacks have been inserted into the corpus (without regard to subject area or the authors' gender).

For all works, the year of publication, author, and title are provided. In many cases, the author's gender (M or K) and subject area are also indicated by the library signum (a translation of these signa can be found here: https://libris.kb.se/subjecttree.jsp)).

File Size Modified Licence
saob-bocker.xml.bz2
this file contains a scrambled version of the corpus Information (XML)
1006.14 MB 2023-11-30 CC BY 4.0
attribution
stats_saob-bocker.csv
Word statistics: Information (CSV)
367.54 MB 2023-12-01 CC BY 4.0
attribution

Type

  • Corpus

Language

Swedish

Size

Sentences: 3,773,042
Tokens: 50,285,466

Updated

2023-11-30

Contact

Språkbanken
sb-info@svenska.gu.se