Skip to main content
Språkbanken Text is a department within Språkbanken.

swe-pos-hunpos-suc3-1800

Citation Information

Språkbanken Text (2015). swe-pos-hunpos-suc3-1800 (updated: 2015-09-11). [Analysis]. Språkbanken Text.
Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's

Sentence segments are analysed to enrich tokens with part-of-speech tags. In addition to the pos model inflection lists are provided to Hunpos to make more accuare part-of-speech predictions for Swedish from the 1800's.

Example

This analysis is used with Sparv. Check out Sparv's quick start guide to get started!

To use this analysis, add the following line under export.annotations in the Sparv corpus configuration file:

- <token>:hunpos.pos  # Part-of-speech tags

For more info on how to use Sparv, check out the Sparv documentation.

Example output:

<token pos="NN">Lådan</token>
<token pos="VB">var</token>
<token pos="PC">upphängd</token>
<token pos="PP">under</token>
<token pos="DT">den</token>
<token pos="NN">waggon</token>
<token pos="HA">hvari</token>
<token pos="DT">de</token>
<token pos="JJ">andra</token>
<token pos="NN">djuren</token>
<token pos="VB">befunno</token>
<token pos="PN">sig</token>
<token pos="MAD">.</token>

Other references

  • Hunpos: https://code.google.com/archive/p/hunpos/

Type

  • Analysis

Task

part-of-speech tagging

Unit

token

Tool

Hunpos

Model

Tagset

Trained on

Created

2012-10-23

Updated

2015-09-11

Contact

Språkbanken Text
sb-info@svenska.gu.se