Skip to main content

Europarl: Swedish-Finnish

Standard reference Information

Philipp Koehn. 2005. Europarl: A Parallel Corpus for Statistical Machine Translation. In Proceedings of Machine Translation Summit X: Papers, pages 79–86, Phuket, Thailand [publication]

Data citation Information

Koehn, Philipp (2014). Europarl: Swedish-Finnish (updated: 2014-04-29). [Data set]. Enriched and distributed by Språkbanken. https://doi.org/10.23695/fs50-et41
BibTeX Additional ways to cite the dataset.
The English and Finnish parts of European Parliament Proceedings Parallel Corpus, release v3

Accessible through

Access Platform Licence
CC-BY-4.0

Download

File Size Modified Licence
europarl-sv.xml.bz2
this file contains the Swedish part of the corpus Information (XML)
354.92 MB 2013-11-18 CC-BY-4.0
europarl-fi.xml.bz2
this file contains the Finnish part of the corpus Information (XML)
81.98 MB 2013-11-20 CC-BY-4.0
stats_EUROPARL-SV.txt.zip
this file contains statistics on the Swedish part of the corpus (CSV)
3.66 MB 2025-04-22 CC-BY-4.0
stats_EUROPARL-FI.txt.zip
this file contains statistics on the Finnish part of the corpus (CSV)
4.24 MB 2025-04-22 CC-BY-4.0

Collection

Type

  • Corpus

Language

Swedish
Finnish

Size

Tokens: 59,633,364
Sentences: 2,872,696

Creators

  • Koehn, Philipp

Created

2007-09-28

Updated

2014-04-29

Contact

sb-info@svenska.gu.se