Annotation of Swedish texts with OVIX values which indicate the difficulty of the texts
OVIX (ordvariationsindex) is a readability measure based on how many words occur only once in the text chunk.
OVIX is calculated as log(tokens) / log(2 - (log(types) / log(tokens)))
A high value can be interpreted as frequently introducing new words to the reader. On the other hand, a low value may indicate a monotonous text.