Skip to main content

Analyses

Search our analyses. You can click on a row to see the details.
Analysis Sort descending Type Collections Task Unit Language
sbx-eng-tokenization-stanza
Tokenization with Stanza's standard model for English
Analysis tokenization token English
sbx-mul-paragraph-sparv-blanklines
Segments text into paragraphs by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization paragraph
sbx-mul-sentence-sparv-blanklines
Segments text into sentences by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization sentence
sbx-mul-tokenization-sparv-blanklines
Tokenizes text into tokens by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization token
sbx-mul-tokenization-sparv-linebreaks
Tokenizes text into tokens by linebreaks using the RegexpTokenizer from NLTK
Analysis tokenization token
sbx-mul-tokenization-sparv-whitespace
Tokenizes text into tokens by whitespaces using the RegexpTokenizer from NLTK
Analysis tokenization token
sbx-swe-tokenization-sparv-betterword
Tokenizes text, custom-made for Swedish
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses tokenization token Swedish