Skip to main content

GP 2002

A corpus with texts from Göteborgsposten
File Storlek Modified Licence
gp2002.xml.bz2
this file contains a scrambled version of the corpus
345.22 MB 2017-02-06 CC BY 4.0
attribution
stats_GP2002.txt
token frequencies (CSV)
32.82 MB 2017-02-12 CC BY 4.0
attribution

Collection

Göteborgsposten

Type

  • Corpus

Language

Swedish

Storlek

Tokens: 21,064,282
Sentences: 1,629,942

Contact

Språkbanken
sb-info@svenska.gu.se