A corpus with texts from Göteborgsposten Summary Resource type Corpus Language Swedish Tokens 22,350,225 Sentences 1,361,398 Download gp2004.xml.bz2 corpus (XML) scrambled licence: CC BY 4.0 (attribution) press other licence: CC BY 4.0 (attribution) press other licence: CC BY 4.0 (attribution) stats_GP2004.txt token frequencies (CSV) licence: CC BY 4.0 (attribution)