Skip to main content
Språkbanken Text is a department within Språkbanken.

Syntag treebank

Citation Information

Språkbanken Text (2010). Syntag treebank (updated: 2010-05-24). [Data set]. Språkbanken Text. https://doi.org/10.23695/0fm6-ah89
BibTeX Additional ways to cite the dataset.
A Swedish treebank with syntactic analysis of 158 articles from Press-65.

SynTag is a so called tree bank, containing syntactically annotated text from 158 articles from the corpus Press-65, with about 100 000 running words. The annotation contains the relations of constituents and words, such as subjects or other arguments of finite verbs, in up to 12 levels of analysis. Additionally, there are simple word tags. The data still contains some errors, which will be corrected in the future.

More information (in Swedish) about SynTag can be found in this manual.

File Size Modified Licence
syntag.txt
syntag.txt (txt)
4.45 MB 2010-02-08 CC BY 4.0
attribution
syntag.html
syntag.html (html)
10.15 MB 2010-05-24 CC BY 4.0
attribution

Type

  • Corpus
  • Training and evaluation data

Language

Swedish

Size

Tokens: 101,329

Updated

2010-05-24

Contact

Språkbanken
sb-info@svenska.gu.se