Part of European Parliament Proceedings Parallel Corpus
General type | corpus |
Timespan | – |
Tokens | 68134508 |
Sentences | 2886072 |
Swedish tokens | 33406922 |
Swedish sentences | 1475195 |
German tokens | 34727586 |
German sentences | 1410877 |
Språkbanken; sb-info@svenska.gu.se
europarl-sv.xml.bz2 scrambled XML (354.92MB)
this file contains a scrambled version of the Swedish part of the corpus
License:
CC-BY (attribution)
europarl-de.xml.bz2 scrambled XML (83.10MB)
this file contains a scrambled version of the German part of the corpus
License:
CC-BY (attribution)
stats_EUROPARL-SV.txt statistics TXT (18.20MB)
this file contains statistics on the Swedish part of the corpus
License:
CC-BY (attribution)
stats_EUROPARL-DE.txt statistics TXT (8.14MB)
this file contains statistics on the German part of the corpus
License:
CC-BY (attribution)