Lemmatiseringsmodell: Stanza

Datacitering

Språkbanken (2020). Lemmatiseringsmodell: Stanza (uppdaterad: 2020-11-19). [Data set]. Bearbetad och distribuerad av Språkbanken. https://doi.org/10.23695/681b-be74

Ytterligare sätt att citera datamängden.

Förtränad modell för lemmatisering.

Models

We provide a model that enables lemmatization of Swedish text following the SUC3 standard. Note that SUC3 lemmatization does not exactly match the SALDO standard that is used in our Korp resources.

SUC3 was randomly split into training, validation and test sets (80:10:10). The model was trained for 30 epochs using the default Stanza settings. The accuracy on the test set is 99.18.

Ladda ned

Fil	Storlek	Modifierad	Licens
lem_stanza.zip	3.74 MB	2020-11-19	CC-BY-4.0

Datacitering

Models

Ladda ned

Typ

Språk

Storlek

Uppdaterad

Kontakt

DOI