General type | corpus |
Timespan | – |
Tokens | 1473608 |
Sentences | 107700 |
Ottesjö Cajsa, Institutionen för filosofi, lingvistik och vetenskapsteori; flov@flov.gu.se
gdc.xml.bz2 scrambled XML (236 bytes)
this file contains a scrambled version of the corpus
License:
CC-BY (attribution)
stats_GDC.txt statistics TXT (3.95MB)
License:
CC-BY (attribution)