Fasttext model trained on Göteborgsposten 2013–2022
Kubord-fasttext is a collection of fasttext models, developed within a collaboration between
KBLab and Språkbanken Text, that have
been trained on the same underlying data as Kubord 2.
The models have been trained on the token and the lemma level. The tool that has been used for the training is
Gensim, with the following parameter settings:
min_n 4, max_n 7, 20 epoker, dim 300, and lr .05.
Citation
Språkbanken Text. (2024-06-11). Kubord-fasttext - Göteborgsposten 2013–2022 - lemma [Data set]. Språkbanken Text. https://doi.org/10.23695/mwyr-gk24Additional ways to cite the dataset.
File | Size | Modified | Licence |
---|---|---|---|
kubord-fasttext-gp-2013-2022-lemma.zip
model
(zip)
|
2.69 GB | 2024-08-05 |
CC BY 4.0
attribution
|