Korpus av somaliska Wikipedia Sammanfattning Resurstyp Corpus Språk somaliska Tokens 869 335 Sentences 42 514 Ladda ned wikipedia-so.xml.bz2 corpus (XML) licens: CC BY 4.0 (attribution) stats_WIKIPEDIA-SO.txt token frequencies (CSV) licens: CC BY 4.0 (attribution)