Språkbanken Text is a department within Språkbanken.

Kubord-fasttext - Dagens Nyheter 2010–2022 - token

Citation

Språkbanken Text. (2024-06-11). Kubord-fasttext - Dagens Nyheter 2010–2022 - token [Data set]. Språkbanken Text. https://doi.org/10.23695/rnmb-ga21
Additional ways to cite the dataset.

Fasttext model trained on Dagens Nyheter 2010–2022

Kubord-fasttext is a collection of fasttext models, developed within a collaboration between KBLab and Språkbanken Text, that have been trained on the same underlying data as Kubord 2. The models have been trained on the token and the lemma level. The tool that has been used for the training is Gensim, with the following parameter settings: min_n 4, max_n 7, 20 epoker, dim 300, and lr .05.

File	Size	Modified	Licence
kubord-fasttext-dn-2010-2022-token.zip model (zip)	3.1 GB	2024-06-11	CC BY 4.0 attribution

Collection

Kubord-fasttext

Type

Model

Language

Swedish

Size

Created

2024-06-11

Updated

2024-06-11

Contact

Språkbanken

sb-info@svenska.gu.se

DOI

https://doi.org/10.23695/rnmb-ga21

Page manager: Språkbanken Text

Last update: 2024-08-19