A corpus with texts from the women's newspaper Hertha Summary Resource type Corpus Language Swedish Tokens 3,842,984 Sentences 291,135 Download ub-kvt-hertha.xml.bz2 corpus (XML) licence: CC BY 4.0 (attribution) stats_UB-KVT-HERTHA.txt token frequencies (CSV) licence: CC BY 4.0 (attribution)