Skip to main content
Språkbanken Text is a part of Språkbanken.

Analyses

Search our analyses. You can click on a row to see the details.
Analysis Sort descending Collections Task Unit Language
sbx-eng-dependency-stanza
Dependency parsing with Stanza's standard model for English
dependency parsing token English
sbx-eng-lemmatization-stanza
Lemmatization with Stanza's standard model for English
lemmatization token English
sbx-eng-msd-stanza-ufeats
Stanza-based morphological analysis for English, using universal features (UD)
morphosyntactic tagging token English
sbx-eng-namedentity-stanza
Named entity recognition with Stanza's standard model for English
named entity recognition English
sbx-eng-pos-stanza
Part-of-speech annotation with Penn Treebank tags with Stanza's standard model for English
part-of-speech tagging token English
sbx-eng-pos-stanza-upos
Part-of-speech annotation with UD (universal dependency) tags with Stanza's standard model for English
part-of-speech tagging token English
sbx-eng-sentence-stanza
Sentence segmentation with Stanza's standard model for English
sentence segmentation sentence English
sbx-eng-tokenization-stanza
Tokenization with Stanza's standard model for English
tokenization token English
sbx-mul-paragraph-sparv-blanklines
Segments text into paragraphs by blank lines using the RegexpTokenizer from NLTK
tokenization paragraph
sbx-mul-paragraph-sparv-linebreaks
Segments text into paragraphs by linebreaks using the RegexpTokenizer from NLTK
paragraph segmentation paragraph
sbx-mul-paragraph-sparv-whitespace
Segments text into paragraphs by whitespaces using the RegexpTokenizer from NLTK
paragraph segmentation paragraph
sbx-mul-sentence-sparv-blanklines
Segments text into sentences by blank lines using the RegexpTokenizer from NLTK
tokenization sentence
sbx-mul-sentence-sparv-linebreaks
Segments text into sentences by linebreaks using the RegexpTokenizer from NLTK
sentence segmentation sentence
sbx-mul-sentence-sparv-punctuation
Segments text into sentences by punctuation marks using the RegexpTokenizer from NLTK
sentence segmentation sentence
sbx-mul-sentence-sparv-whitespace
Segments text into sentences by whitespaces using the RegexpTokenizer from NLTK
sentence segmentation sentence
sbx-mul-tokenization-sparv-blanklines
Tokenizes text into tokens by blank lines using the RegexpTokenizer from NLTK
tokenization token
sbx-mul-tokenization-sparv-linebreaks
Tokenizes text into tokens by linebreaks using the RegexpTokenizer from NLTK
tokenization token
sbx-mul-tokenization-sparv-whitespace
Tokenizes text into tokens by whitespaces using the RegexpTokenizer from NLTK
tokenization token
sbx-swe-compound-sparv-saldolemgram
Analysis of SALDO lemgram compounds including a probability ranking
sbx-swe-mink_analyses, sbx-swe-standard_analyses compound analysis token Swedish
sbx-swe-compound-sparv-saldowords
Analysis of SALDO wordform compounds
sbx-swe-mink_analyses, sbx-swe-standard_analyses compound analysis token Swedish
sbx-swe-dependency-malt-treebank
Swedish dependency parsing from MaltParser trained on Sweedish treebank
dependency parsing token Swedish
sbx-swe-dependency-stanza-stanzasynt
Swedish dependency parsing with Stanza trained on Sweedish treebank
sbx-swe-mink_analyses, sbx-swe-standard_analyses dependency parsing token Swedish
sbx-swe-geotagcontext-sparv
Annotate text chunks with location data, based on locations contained within the text
sbx-swe-standard_analyses geotagging text Swedish
sbx-swe-geotagmetadata-sparv
Annotate text chunks with location data, based on metadata containing location names
geotagging text Swedish
sbx-swe-lemgram-sparv-saldo
Lookup for SALDO lemgrams
sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical lookup token Swedish
sbx-swe-lemmatization-sparv-saldo
Full-form lookup for SALDO citation forms (lemmas)
lemmatization token Swedish
sbx-swe-lemmatization-sparv-saldo2
Full-form lookup for SALDO citation forms (lemmas) plus analysis of compounds made up of SALDO entries
sbx-swe-mink_analyses, sbx-swe-standard_analyses lemmatization token Swedish
sbx-swe-lemmatization-stanza-stanzalem
Swedish citation form analysis (base forms, lemmas) by Stanza, trained on SUC3
lemmatization token Swedish
sbx-swe-lexical_classes_text-sparv-blingbring
Lexical classes from Blingbring on text-level
sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes text Swedish
sbx-swe-lexical_classes_text-sparv-swefn
Lexical classes from SweFN on text-level
sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes text Swedish
sbx-swe-lexical_classes_token-sparv-blingbring
Lexical classes from Blingbring on token-level
sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes token Swedish
sbx-swe-lexical_classes_token-sparv-swefn
Lexical classes from SweFN on token-level
sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes token Swedish
Collection
sbx-swe-mink_analyses
Collection of analyses used in Mink
Swedish
sbx-swe-msd-hunpos-suc3
Annotation of morphological features (SUC) by Hunpos for Swedish
morphosyntactic tagging token Swedish
sbx-swe-msd-hunpos-suc3-1800
Annotation of morphological features (SUC) by Hunpos for Swedish from the 1800's
morphosyntactic tagging token Swedish
sbx-swe-msd-stanza-stanzamorph-suc3
Annotation of morphological features (SUC) by Stanza for Swedish
sbx-swe-mink_analyses, sbx-swe-standard_analyses morphosyntactic tagging token Swedish
sbx-swe-msd-stanza-stanzamorph-ufeats
Stanza-based morphological analysis for Swedish, using universal features (UD)
sbx-swe-mink_analyses, sbx-swe-standard_analyses morphosyntactic tagging token Swedish
sbx-swe-namedentity-swener
Named entity recognition (NER) recognises named entities such as locations, persons and time expressions in text.
sbx-swe-mink_analyses, sbx-swe-standard_analyses named entity recognition Swedish
sbx-swe-phrasestructure-sparv
Swedish phrase structure parsing based on Mamba-Dep dependency analysis
phrase structure parsing Swedish
sbx-swe-pos-hunpos-suc3
Swedish part-of-speech annotation with SUC tags by Hunpos
part-of-speech tagging token Swedish
sbx-swe-pos-hunpos-suc3-1800
Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's
part-of-speech tagging token Swedish
sbx-swe-pos-stanza-stanzamorph
Swedish part-of-speech annotation with SUC tags by Stanza
sbx-swe-mink_analyses, sbx-swe-standard_analyses part-of-speech tagging token Swedish
sbx-swe-readability-sparv-lix
Annotation of Swedish texts with LIX values which indicate the difficulty of the texts
sbx-swe-mink_analyses, sbx-swe-standard_analyses readability measures text Swedish
sbx-swe-readability-sparv-nk
Annotation of Swedish texts with nominal ratios which indicate the difficulty of the texts
sbx-swe-mink_analyses, sbx-swe-standard_analyses readability measures text Swedish
sbx-swe-readability-sparv-ovix
Annotation of Swedish texts with OVIX values which indicate the difficulty of the texts
sbx-swe-mink_analyses, sbx-swe-standard_analyses readability measures text Swedish
sbx-swe-sense-sparv
Word sense disambiguation based on SALDO annotation
sbx-swe-mink_analyses, sbx-swe-standard_analyses sense disambiguation token Swedish
sbx-swe-sense-sparv-saldo
Lookup for SALDO identifiers
lexical lookup token Swedish
sbx-swe-sentence-sparv-storsuc
Segments text into sentences, custom-made for Swedish
sbx-swe-mink_analyses, sbx-swe-standard_analyses sentence segmentation sentence Swedish
sbx-swe-sentiment-sparv-sensaldo
Sentiment analysis via lookup in SenSALDO
sbx-swe-mink_analyses, sbx-swe-standard_analyses sentiment analysis token Swedish
Collection
sbx-swe-standard_analyses
Collection of Sparv analyses for modern Swedish
Swedish
sbx-swe-tokenization-sparv-betterword
Tokenizes text, custom-made for Swedish
sbx-swe-mink_analyses, sbx-swe-standard_analyses tokenization token Swedish
swe-sbx-ocr-correction-viklofg-sweocr
OCR correction annotations
ocr-correction
swe-sbx-word-prediction-kb-bert
Word prediction annotations for each word in a text.
word-prediction token