Skip to main content

Syntag treebank

A Swedish treebank with syntactic analysis of 158 articles from Press-65.

SynTag is a so called tree bank, containing syntactically annotated text from 158 articles from the corpus Press-65, with about 100 000 running words. The annotation contains the relations of constituents and words, such as subjects or other arguments of finite verbs, in up to 12 levels of analysis. Additionally, there are simple word tags. The data still contains some errors, which will be corrected in the future.

More information (in Swedish) about SynTag can be found in this manual.

File Size Modified Licence
syntag.txt
syntag.txt (txt)
4.45 MB 2010-02-08 CC BY 4.0
attribution
syntag.html
syntag.html (html)
10.15 MB 2010-05-24 CC BY 4.0
attribution

Type

  • Corpus
  • Training and evaluation data

Language

Swedish

Size

Tokens: 101,329

Contact

Språkbanken
sb-info@svenska.gu.se