Full-form lookup for SALDO citation forms (lemmas)
The SALDO morphology full-form lexicon is used to find possible citation forms (lemmas) and word senses for text word tokens, preserving ambiguity.
The SALDO morphology full-form lexicon is used to find possible citation forms (lemmas) and word senses for text word tokens, preserving ambiguity.
This analysis is used with Sparv. Check out Sparv's quick start guide to get started!
To use this analysis, add the following line under export.annotations
in the Sparv corpus configuration file:
- <token>:saldo.baseform # Baseforms from SALDO
For more info on how to use Sparv, check out the Sparv documentation.
Example output:
<token baseform="|vi|">Vi</token>
<token baseform="|skola|">ska</token>
<token baseform="|köra|">köra</token>
<token baseform="|den|en|den här|">den</token>
<token baseform="|här|den här:4|">här</token>
<token baseform="|">clownbilen</token>
<token baseform="|till|">till</token>
<token baseform="|cirkus|">cirkusen</token>
<token baseform="|">.</token>