Skip to main content
Språkbanken Text is a department within Språkbanken.

The English-Swedish Parallel Corpus (ESPC)

Citation Information

Språkbanken Text. The English-Swedish Parallel Corpus (ESPC) [Data set]. Språkbanken Text. https://doi.org/10.23695/pmpb-ry83
BibTeX Additional ways to cite the dataset.
ESPC is a combined comparable and parallel corpus suitable for cross-language research for diffferent types.

English-Swedish Parallel Corpus (ESPC) was compiled in 1990's in a joint project at the University of Gothenburg and University of Lund. The first version of this corpus was released for research in early 2000 and it consist of approximately 2.8 million words including both original and translated texts. The corpus have two main parts, fiction and non-fiction, which are further divided into a number of text types or categories. All texts are written texts published between 1980 and 2000, and the great majority are extracts of complete texts. For access please contact Anna-Lena Fredriksson.

Type

  • Corpus

Language

Swedish
English

Size

Sentences: 89,779
Tokens: 1,518,759

Keywords

  • parallel
  • fiction
  • non-fiction

Contact

Institutionen för språk och litteraturer
espc@sprak.gu.se