Skip to main content

The Swedish treebank Eukalyptus has been released in a new version

29 May 2020
The Swedish treebank Eukalyptus consists of a collection of contemporary Swedish texts from five different genres of close to 100 000 Words. Eukalyptus has now been released in a new version.

The texts have been annotated with parts of speech, morphology, and senses, as well as syntactic structure. In the new version we have corrected the part-of-speech and morphological annotation. Another new version is planned for the near future, with updated syntactic annotation.

The Eukalyptus-corpus can be downloaded here.

Read more in the Språkbanken blog.