Skip to main content
Språkbanken Text is a part of Språkbanken.

Analyses

Search our analyses. You can click on a row to see the details.
Analysis Sort descending Type Collections Task Unit Language
sbx-eng-dependency-stanza
Dependency parsing with Stanza's standard model for English
Analysis dependency parsing token English
sbx-eng-lemmatization-stanza
Lemmatization with Stanza's standard model for English
Analysis lemmatization token English
sbx-eng-msd-stanza-ufeats
Stanza-based morphological analysis for English, using universal features (UD)
Analysis morphosyntactic tagging token English
sbx-eng-namedentity-stanza
Named entity recognition with Stanza's standard model for English
Analysis named entity recognition English
sbx-eng-pos-stanza
Part-of-speech annotation with Penn Treebank tags with Stanza's standard model for English
Analysis part-of-speech tagging token English
sbx-eng-pos-stanza-upos
Part-of-speech annotation with UD (universal dependency) tags with Stanza's standard model for English
Analysis part-of-speech tagging token English
sbx-eng-sentence-stanza
Sentence segmentation with Stanza's standard model for English
Analysis sentence segmentation sentence English
sbx-eng-tokenization-stanza
Tokenization with Stanza's standard model for English
Analysis tokenization token English
sbx-mul-paragraph-sparv-blanklines
Segments text into paragraphs by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization paragraph
sbx-mul-paragraph-sparv-linebreaks
Segments text into paragraphs by linebreaks using the RegexpTokenizer from NLTK
Analysis paragraph segmentation paragraph
sbx-mul-paragraph-sparv-whitespace
Segments text into paragraphs by whitespaces using the RegexpTokenizer from NLTK
Analysis paragraph segmentation paragraph
sbx-mul-sentence-sparv-blanklines
Segments text into sentences by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization sentence
sbx-mul-sentence-sparv-linebreaks
Segments text into sentences by linebreaks using the RegexpTokenizer from NLTK
Analysis sentence segmentation sentence
sbx-mul-sentence-sparv-punctuation
Segments text into sentences by punctuation marks using the RegexpTokenizer from NLTK
Analysis sentence segmentation sentence
sbx-mul-sentence-sparv-whitespace
Segments text into sentences by whitespaces using the RegexpTokenizer from NLTK
Analysis sentence segmentation sentence
sbx-mul-tokenization-sparv-blanklines
Tokenizes text into tokens by blank lines using the RegexpTokenizer from NLTK
Analysis tokenization token
sbx-mul-tokenization-sparv-linebreaks
Tokenizes text into tokens by linebreaks using the RegexpTokenizer from NLTK
Analysis tokenization token
sbx-mul-tokenization-sparv-whitespace
Tokenizes text into tokens by whitespaces using the RegexpTokenizer from NLTK
Analysis tokenization token
sbx-swe-compound-sparv-saldolemgram
Analysis of SALDO lemgram compounds including a probability ranking
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses compound analysis token Swedish
sbx-swe-compound-sparv-saldowords
Analysis of SALDO wordform compounds
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses compound analysis token Swedish
sbx-swe-dependency-malt-treebank
Swedish dependency parsing from MaltParser trained on Sweedish treebank
Analysis dependency parsing token Swedish
sbx-swe-dependency-stanza-stanzasynt
Swedish dependency parsing with Stanza trained on Sweedish treebank
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses dependency parsing token Swedish
sbx-swe-export-sparv-conllu
Export of corpus data in Språkbanken Text's CoNLL-U format
Utility export
sbx-swe-geotagcontext-sparv
Annotate text chunks with location data, based on locations contained within the text
Analysis sbx-swe-standard_analyses geotagging text Swedish
sbx-swe-geotagmetadata-sparv
Annotate text chunks with location data, based on metadata containing location names
Analysis geotagging text Swedish
sbx-swe-lemgram-sparv-saldo
Lookup for SALDO lemgrams
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical lookup token Swedish
sbx-swe-lemmatization-sparv-saldo
Full-form lookup for SALDO citation forms (lemmas)
Analysis lemmatization token Swedish
sbx-swe-lemmatization-sparv-saldo2
Full-form lookup for SALDO citation forms (lemmas) plus analysis of compounds made up of SALDO entries
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses lemmatization token Swedish
sbx-swe-lemmatization-stanza-stanzalem
Swedish citation form analysis (base forms, lemmas) by Stanza, trained on SUC3
Analysis lemmatization token Swedish
sbx-swe-lexical_classes_text-sparv-blingbring
Lexical classes from Blingbring on text-level
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes text Swedish
sbx-swe-lexical_classes_text-sparv-swefn
Lexical classes from SweFN on text-level
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes text Swedish
sbx-swe-lexical_classes_token-sparv-blingbring
Lexical classes from Blingbring on token-level
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes token Swedish
sbx-swe-lexical_classes_token-sparv-swefn
Lexical classes from SweFN on token-level
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses lexical classes token Swedish
Collection
sbx-swe-mink_analyses
Collection of analyses used in Mink
Analysis, Collection Swedish
sbx-swe-msd-hunpos-suc3
Annotation of morphological features (SUC) by Hunpos for Swedish
Analysis morphosyntactic tagging token Swedish
sbx-swe-msd-hunpos-suc3-1800
Annotation of morphological features (SUC) by Hunpos for Swedish from the 1800's
Analysis morphosyntactic tagging token Swedish
sbx-swe-msd-stanza-stanzamorph-suc3
Annotation of morphological features (SUC) by Stanza for Swedish
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses morphosyntactic tagging token Swedish
sbx-swe-msd-stanza-stanzamorph-ufeats
Stanza-based morphological analysis for Swedish, using universal features (UD)
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses morphosyntactic tagging token Swedish
sbx-swe-namedentity-swener
Named entity recognition (NER) recognises named entities such as locations, persons and time expressions in text.
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses named entity recognition Swedish
sbx-swe-phrasestructure-sparv
Swedish phrase structure parsing based on Mamba-Dep dependency analysis
Analysis phrase structure parsing Swedish
sbx-swe-pos-hunpos-suc3
Swedish part-of-speech annotation with SUC tags by Hunpos
Analysis part-of-speech tagging token Swedish
sbx-swe-pos-hunpos-suc3-1800
Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's
Analysis part-of-speech tagging token Swedish
sbx-swe-pos-stanza-stanzamorph
Swedish part-of-speech annotation with SUC tags by Stanza
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses part-of-speech tagging token Swedish
sbx-swe-readability-sparv-lix
Annotation of Swedish texts with LIX values which indicate the difficulty of the texts
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses readability measures text Swedish
sbx-swe-readability-sparv-nk
Annotation of Swedish texts with nominal ratios which indicate the difficulty of the texts
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses readability measures text Swedish
sbx-swe-readability-sparv-ovix
Annotation of Swedish texts with OVIX values which indicate the difficulty of the texts
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses readability measures text Swedish
sbx-swe-sense-sparv
Word sense disambiguation based on SALDO annotation
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses sense disambiguation token Swedish
sbx-swe-sense-sparv-saldo
Lookup for SALDO identifiers
Analysis lexical lookup token Swedish
sbx-swe-sentence-sparv-storsuc
Segments text into sentences, custom-made for Swedish
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses sentence segmentation sentence Swedish
sbx-swe-sentiment-sparv-sensaldo
Sentiment analysis via lookup in SenSALDO
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses sentiment analysis token Swedish
Collection
sbx-swe-standard_analyses
Collection of Sparv analyses for modern Swedish
Analysis, Collection Swedish
sbx-swe-tokenization-sparv-betterword
Tokenizes text, custom-made for Swedish
Analysis sbx-swe-mink_analyses, sbx-swe-standard_analyses tokenization token Swedish
sbx-zxx-export-sparv-xml_preserved
XML corpus export preserving whitespaces from source file
Utility export
sbx-zxx-export-sparv-xml_pretty
XML corpus export where every token is printed on a new line
Utility export
sbx-zxx-export-sparv-xml_scrambled
XML corpus export with scrambled contents
Utility export
swe-sbx-ocr-correction-viklofg-sweocr
OCR correction annotations
Analysis ocr-correction
swe-sbx-word-prediction-kb-bert
Word prediction annotations for each word in a text.
Analysis word-prediction token