Skip to main content
Språkbanken Text is a part of Språkbanken.

Analyses

Search our analyses. You can click on a row to see the details.
Analysis Sort descending Type Collections Task Unit Language
eng-tokenization-stanza
Tokenization with Stanza's standard model for English
Analysis tokenization token English
paragraph-sparv-blanklines
Segments text into paragraphs by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization paragraph
sentence-sparv-blanklines
Segments text into sentences by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization sentence
swe-tokenization-sparv-betterword
Tokenizes text, custom-made for Swedish
Analysis mink-analyses, standard-analyses-swe tokenization token Swedish
tokenization-sparv-blanklines
Tokenizes text into tokens by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization token
tokenization-sparv-linebreaks
Tokenizes text into tokens by linebreaks using the RegexpTokenizer from NLTK
Analysis tokenization token
tokenization-sparv-whitespace
Tokenizes text into tokens by whitespaces using the RegexpTokenizer from NLTK
Analysis tokenization token