Skip to main content
Språkbanken Text is a department within Språkbanken.

swe-msd-hunpos-suc3

Citation Information

Språkbanken Text (2018). swe-msd-hunpos-suc3 (updated: 2018-05-28). [Analysis]. Språkbanken Text.
Annotation of morphological features (SUC) by Hunpos for Swedish

Sentence segments are analysed to enrich tokens with part-of-speech tags and morphosyntactic information. No longer used by default by Sparv because Stanza's POS-tagging yields better results.

Example

This analysis is used with Sparv. Check out Sparv's quick start guide to get started!

To use this analysis, add the following line under export.annotations in the Sparv corpus configuration file:

- <token>:hunpos.msd  # Part-of-speeches with morphological descriptions

For more info on how to use Sparv, check out the Sparv documentation.

Example output:

<token msd="PN.NEU.SIN.DEF.SUB+OBJ">Det</token>
<token msd="AB">här</token>
<token msd="VB.PRS.AKT">är</token>
<token msd="DT.UTR.SIN.IND">en</token>
<token msd="NN.UTR.SIN.IND.NOM">korpus</token>
<token msd="MAD">.</token>

Other references

  • Hunpos: https://code.google.com/archive/p/hunpos/

Type

  • Analysis

Task

morphosyntactic tagging

Unit

token

Tool

Hunpos

Tagset

Trained on

Created

2010-12-15

Updated

2018-05-28

Contact

Språkbanken Text
sb-info@svenska.gu.se