Skip to main content
Språkbanken Text is a part of Språkbanken.

swe-pos-hunpos-suc3-1800

Citation Information

Språkbanken Text (2015). swe-pos-hunpos-suc3-1800 (updated: 2015-09-11). [Analysis]. Språkbanken Text.
BibTeX
Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's

Sentence segments are analysed to enrich tokens with part-of-speech tags. In addition to the pos model inflection lists are provided to Hunpos to make more accuare part-of-speech predictions for Swedish from the 1800's.

Example

This analysis is used with Sparv. Check out Sparv's quick start guide to get started!

To use this analysis, add the following line under export.annotations in the Sparv corpus configuration file:

- <token>:hunpos.pos  # Part-of-speech tags

In order to use this annotation you need to add the following setting to your Sparv corpus configuration file:

metadata:
  language: swe
  variety: "1800"

For more info on how to use Sparv, check out the Sparv documentation.

Example output:

<token pos="NN">Lådan</token>
<token pos="VB">var</token>
<token pos="PC">upphängd</token>
<token pos="PP">under</token>
<token pos="DT">den</token>
<token pos="NN">waggon</token>
<token pos="HA">hvari</token>
<token pos="DT">de</token>
<token pos="JJ">andra</token>
<token pos="NN">djuren</token>
<token pos="VB">befunno</token>
<token pos="PN">sig</token>
<token pos="MAD">.</token>

Other references

  • Hunpos: https://code.google.com/archive/p/hunpos/

Type

  • Analysis

Task

  • part-of-speech tagging

Unit

  • token

Tool

Hunpos

Model

Tagset

Trained on

Created

2012-10-23

Updated

2015-09-11

Contact

Språkbanken Text
sb-info@svenska.gu.se