Part of The Amsterdam Slavic Parallel Aligned Corpus
General type | corpus |
Timespan | – |
Tokens | 667092 |
Sentences | 45008 |
Swedish tokens | 351322 |
Swedish sentences | 22565 |
Bulgarian tokens | 315770 |
Bulgarian sentences | 22443 |
Språkbanken; sb-info@svenska.gu.se
aspacsvbg-sv.xml.bz2 scrambled XML (4.08MB)
this file contains a scrambled version of the Swedish part of the corpus
License:
CC-BY (attribution)
aspacsvbg-bg.xml.bz2 scrambled XML (1.83MB)
this file contains a scrambled version of the Bulgarian part of the corpus
License:
CC-BY (attribution)
stats_ASPACSVBG-SV.txt statistics TXT (1.66MB)
this file contains statistics on the Swedish part of the corpus
License:
CC-BY (attribution)
stats_ASPACSVBG-BG.txt statistics TXT (1.42MB)
this file contains statistics on the Bulgarian part of the corpus
License:
CC-BY (attribution)