Skip to main content

Europarl: Swedish-Spanish

Standard reference Information

Philipp Koehn. 2005. Europarl: A Parallel Corpus for Statistical Machine Translation. In Proceedings of Machine Translation Summit X: Papers, pages 79–86, Phuket, Thailand [publication]

Data citation Information

Koehn, Philipp (2014). Europarl: Swedish-Spanish (updated: 2014-04-29). [Data set]. Enriched and distributed by Språkbanken. https://doi.org/10.23695/r97m-ty64
BibTeX Additional ways to cite the dataset.
The Swedish and Spanish parts of European Parliament Proceedings Parallel Corpus, release v3

Accessible through

Access Platform Licence
CC-BY-4.0

Download

File Size Modified Licence
europarl-sv.xml.bz2
this file contains the Swedish part of the corpus Information (XML)
354.92 MB 2013-11-18 CC-BY-4.0
europarl-es.xml.bz2
this file contains the Spanish part of the corpus Information (XML)
88.81 MB 2013-11-20 CC-BY-4.0
stats_EUROPARL-SV.txt.zip
this file contains statistics on the Swedish part of the corpus (CSV)
3.66 MB 2025-04-22 CC-BY-4.0
stats_EUROPARL-ES.txt.zip
this file contains statistics on the Spanish part of the corpus (CSV)
1015.94 KB 2025-04-22 CC-BY-4.0

Collection

Type

  • Corpus

Language

Swedish
Spanish

Size

Tokens: 70,962,605
Sentences: 2,846,835

Creators

  • Koehn, Philipp

Created

2007-09-28

Updated

2014-04-29

Contact

sb-info@svenska.gu.se