Skip to main content

PAROLE

A corpus annotated with morphological and syntactic information

The data called Parole was collected within the EU project PAROLE (finished in 1997 and aimed at building a European network of language resources). The texts have been annotated with morphosyntactic information and contain about 19 million tokens.

The download of the resource is restricted. The resource cannot be used commerially and is therefore restricted to researchers at certain Swedish universities. Please contact the director of Språkbanken, Lars Borin, (sb-info at svenska . gu . se) for further information.


Download the PAROLE corpus
Size: About 19 million grafh words
Format: Tabular text file
Documentation: see the web version of the PAROLE corpus.

File Size Modified Licence
parole.xml.bz2
this file contains a scrambled version of the corpus Information (XML)
425.19 MB 2017-05-17 CC BY 4.0
attribution
parole.zip
parole.zip (zip)
67.62 MB 2024-01-25 CC BY 4.0
attribution
stats_PAROLE.txt
Word statistics: Information (CSV)
39.18 MB 2017-05-21 CC BY 4.0
attribution

Type

  • Corpus

Language

Swedish

Size

Sentences: 1,625,870
Tokens: 24,303,096

Contact

Språkbanken
sb-info@svenska.gu.se