Skip to main content
Språkbanken Text is a department within Språkbanken.

swe-lemmatization-sparv-saldo

Citation Information

Språkbanken Text (2018). swe-lemmatization-sparv-saldo (updated: 2018-03-28). [Analysis]. Språkbanken Text.
Full-form lookup for SALDO citation forms (lemmas)

The SALDO morphology full-form lexicon is used to find possible citation forms (lemmas) and word senses for text word tokens, preserving ambiguity.

Example

This analysis is used with Sparv. Check out Sparv's quick start guide to get started!

To use this analysis, add the following line under export.annotations in the Sparv corpus configuration file:

- <token>:saldo.baseform  # Baseforms from SALDO

For more info on how to use Sparv, check out the Sparv documentation.

Example output:

<token baseform="|vi|">Vi</token>
<token baseform="|skola|">ska</token>
<token baseform="|köra|">köra</token>
<token baseform="|den|en|den här|">den</token>
<token baseform="|här|den här:4|">här</token>
<token baseform="|">clownbilen</token>
<token baseform="|till|">till</token>
<token baseform="|cirkus|">cirkusen</token>
<token baseform="|">.</token>

Type

  • Analysis

Task

lemmatization

Unit

token

Tool

Sparv

Keyword

  • saldo

Created

2010-12-15

Updated

2018-03-28

Contact

Språkbanken Text
sb-info@svenska.gu.se