Skip to main content

sbx-swe-pos-hunpos-suc3-1800

Analysis citation Information

Språkbanken Text (2015). sbx-swe-pos-hunpos-suc3-1800 (updated: 2015-09-11). [Analysis]. Språkbanken Text. https://doi.org/10.23695/dmq8-qp10
BibTeX Additional ways to cite the dataset.
Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's

Sentence segments are analysed to enrich tokens with part-of-speech tags. In addition to the pos model inflection lists are provided to Hunpos to make more accurate part-of-speech predictions for Swedish from the 1800's.

Example

This analysis is used with Sparv. Check out Sparv's quick start guide to get started!

To use this analysis, add the following line under export.annotations in the Sparv corpus configuration file:

- <token>:hunpos.pos  # Part-of-speech tags

In order to use this annotation you need to add the following setting to your Sparv corpus configuration file:

metadata:
  language: swe
  variety: "1800"

For more info on how to use Sparv, check out the Sparv documentation.

Example output:

<token pos="NN">Lådan</token>
<token pos="VB">var</token>
<token pos="PC">upphängd</token>
<token pos="PP">under</token>
<token pos="DT">den</token>
<token pos="NN">waggon</token>
<token pos="HA">hvari</token>
<token pos="DT">de</token>
<token pos="JJ">andra</token>
<token pos="NN">djuren</token>
<token pos="VB">befunno</token>
<token pos="PN">sig</token>
<token pos="MAD">.</token>

Type

  • Analysis

Task

  • part-of-speech tagging

Unit

  • token

Dependencies

External tools

Hunpos
BSD-3-Clause

Models

dalinm-swedberg_saldo_suc-tags.morphtable
A word list along with the words' morphosyntactic information generated from the [Dalin morphology](https://spraakbanken.gu.se/resurser/dalinm) and the [Swedberg morphology](https://spraakbanken.gu.se/resurser/swedbergm)

Tagset

Trained on

Created

2012-10-23

Updated

2015-09-11

Contact

sb-info@svenska.gu.se