Skip to main content
Svenska
English
Språkbanken Text is a part of
Språkbanken
.
News and events
Research
Tools
Data
FAQ
About us
Contact us
Menu
Breadcrumb
Home
Analyses
Analyses
Analyses
Search our analyses. You can click on a row to see the details.
All (57)
Collections (2)
Analyses (51)
Utilities (4)
Name or description
Task
- Any -
compound analysis
dependency parsing
export
geotagging
lemmatization
lexical classes
lexical lookup
morphosyntactic tagging
named entity recognition
ocr-correction
paragraph segmentation
part-of-speech tagging
phrase structure parsing
readability measures
sense disambiguation
sentence segmentation
sentiment analysis
tokenization
word-prediction
Unit
- Any -
paragraph
sentence
text
token
Language
- Any -
Swedish
English
Analysis
Sort descending
Collections
Task
Unit
Language
eng-dependency-stanza
Dependency parsing with Stanza's standard model for English
dependency parsing
token
English
eng-lemmatization-stanza
Lemmatization with Stanza's standard model for English
lemmatization
token
English
eng-msd-stanza-ufeats
Stanza-based morphological analysis for English, using universal features (UD)
morphosyntactic tagging
token
English
eng-namedentity-stanza
Named entity recognition with Stanza's standard model for English
named entity recognition
English
eng-pos-stanza
Part-of-speech annotation with Penn Treebank tags with Stanza's standard model for English
part-of-speech tagging
token
English
eng-pos-stanza-upos
Part-of-speech annotation with UD (universal dependency) tags with Stanza's standard model for English
part-of-speech tagging
token
English
eng-sentence-stanza
Sentence segmentation with Stanza's standard model for English
sentence segmentation
sentence
English
eng-tokenization-stanza
Tokenization with Stanza's standard model for English
tokenization
token
English
Collection
mink-analyses
Collection of analyses used in Mink
Swedish
paragraph-sparv-blanklines
Segments text into paragraphs by blank lines using the RegexpTokenizer from NLTK
tokenization
paragraph
paragraph-sparv-linebreaks
Segments text into paragraphs by linebreaks using the RegexpTokenizer from NLTK
paragraph segmentation
paragraph
paragraph-sparv-whitespace
Segments text into paragraphs by whitespaces using the RegexpTokenizer from NLTK
paragraph segmentation
paragraph
sentence-punkt
Segments text into sentences by punctuation marks using the RegexpTokenizer from NLTK
sentence segmentation
sentence
sentence-sparv-blanklines
Segments text into sentences by blank lines using the RegexpTokenizer from NLTK
tokenization
sentence
sentence-sparv-linebreaks
Segments text into sentences by linebreaks using the RegexpTokenizer from NLTK
sentence segmentation
sentence
sentence-sparv-whitespace
Segments text into sentences by whitespaces using the RegexpTokenizer from NLTK
sentence segmentation
sentence
Collection
standard-analyses-swe
Collection of Sparv analyses for modern Swedish
Swedish
swe-compound-sparv-saldolemgram
Analysis of SALDO lemgram compounds including a probability ranking
mink-analyses
,
standard-analyses-swe
compound analysis
token
Swedish
swe-compound-sparv-saldowords
Analysis of SALDO wordform compounds
mink-analyses
,
standard-analyses-swe
compound analysis
token
Swedish
swe-dependency-malt-treebank
Swedish dependency parsing from MaltParser trained on Sweedish treebank
dependency parsing
token
Swedish
swe-dependency-stanza-stanzasynt
Swedish dependency parsing with Stanza trained on Sweedish treebank
mink-analyses
,
standard-analyses-swe
dependency parsing
token
Swedish
swe-geotagcontext-sparv
Annotate text chunks with location data, based on locations contained within the text
standard-analyses-swe
geotagging
text
Swedish
swe-geotagmetadata-sparv
Annotate text chunks with location data, based on metadata containing location names
geotagging
text
Swedish
swe-lemgram-sparv-saldo
Lookup for SALDO lemgrams
mink-analyses
,
standard-analyses-swe
lexical lookup
token
Swedish
swe-lemmatization-sparv-saldo
Full-form lookup for SALDO citation forms (lemmas)
lemmatization
token
Swedish
swe-lemmatization-sparv-saldo2
Full-form lookup for SALDO citation forms (lemmas) plus analysis of compounds made up of SALDO entries
mink-analyses
,
standard-analyses-swe
lemmatization
token
Swedish
swe-lemmatization-stanza-stanzalem
Swedish citation form analysis (base forms, lemmas) by Stanza, trained on SUC3
lemmatization
token
Swedish
swe-lexical_classes_text-sparv-blingbring
Lexical classes from Blingbring on text-level
mink-analyses
,
standard-analyses-swe
lexical classes
text
Swedish
swe-lexical_classes_text-sparv-swefn
Lexical classes from SweFN on text-level
mink-analyses
,
standard-analyses-swe
lexical classes
text
Swedish
swe-lexical_classes_token-sparv-blingbring
Lexical classes from Blingbring on token-level
mink-analyses
,
standard-analyses-swe
lexical classes
token
Swedish
swe-lexical_classes_token-sparv-swefn
Lexical classes from SweFN on token-level
mink-analyses
,
standard-analyses-swe
lexical classes
token
Swedish
swe-msd-hunpos-suc3
Annotation of morphological features (SUC) by Hunpos for Swedish
morphosyntactic tagging
token
Swedish
swe-msd-hunpos-suc3-1800
Annotation of morphological features (SUC) by Hunpos for Swedish from the 1800's
morphosyntactic tagging
token
Swedish
swe-msd-stanza-stanzamorph-suc3
Annotation of morphological features (SUC) by Stanza for Swedish
mink-analyses
,
standard-analyses-swe
morphosyntactic tagging
token
Swedish
swe-msd-stanza-stanzamorph-ufeats
Stanza-based morphological analysis for Swedish, using universal features (UD)
mink-analyses
,
standard-analyses-swe
morphosyntactic tagging
token
Swedish
swe-namedentity-swener
Named entity recognition (NER) recognises named entities such as locations, persons and time expressions in text.
mink-analyses
,
standard-analyses-swe
named entity recognition
Swedish
swe-phrasestructure-sparv
Swedish phrase structure parsing based on Mamba-Dep dependency analysis
phrase structure parsing
Swedish
swe-pos-hunpos-suc3
Swedish part-of-speech annotation with SUC tags by Hunpos
part-of-speech tagging
token
Swedish
swe-pos-hunpos-suc3-1800
Part-of-speech annotation with SUC tags by Hunpos for Swedish from the 1800's
part-of-speech tagging
token
Swedish
swe-pos-stanza-stanzamorph
Swedish part-of-speech annotation with SUC tags by Stanza
mink-analyses
,
standard-analyses-swe
part-of-speech tagging
token
Swedish
swe-readability-sparv-lix
Annotation of Swedish texts with LIX values which indicate the difficulty of the texts
mink-analyses
,
standard-analyses-swe
readability measures
text
Swedish
swe-readability-sparv-nk
Annotation of Swedish texts with nominal ratios which indicate the difficulty of the texts
mink-analyses
,
standard-analyses-swe
readability measures
text
Swedish
swe-readability-sparv-ovix
Annotation of Swedish texts with OVIX values which indicate the difficulty of the texts
mink-analyses
,
standard-analyses-swe
readability measures
text
Swedish
swe-sbx-ocr-correction-viklofg-sweocr
OCR correction annotations
ocr-correction
swe-sbx-word-prediction-kb-bert
Word prediction annotations for each word in a text.
word-prediction
token
swe-sense-sparv-saldo
Lookup for SALDO identifiers
lexical lookup
token
Swedish
swe-sense-wsd
Word sense disambiguation based on SALDO annotation
mink-analyses
,
standard-analyses-swe
sense disambiguation
token
Swedish
swe-sentence-punkt-storsuc
Segments text into sentences, custom-made for Swedish
mink-analyses
,
standard-analyses-swe
sentence segmentation
sentence
Swedish
swe-sentiment-sparv-sensaldo
Sentiment analysis via lookup in SenSALDO
mink-analyses
,
standard-analyses-swe
sentiment analysis
token
Swedish
swe-tokenization-sparv-betterword
Tokenizes text, custom-made for Swedish
mink-analyses
,
standard-analyses-swe
tokenization
token
Swedish
tokenization-sparv-blanklines
Tokenizes text into tokens by blank lines using the RegexpTokenizer from NLTK
tokenization
token
tokenization-sparv-linebreaks
Tokenizes text into tokens by linebreaks using the RegexpTokenizer from NLTK
tokenization
token
tokenization-sparv-whitespace
Tokenizes text into tokens by whitespaces using the RegexpTokenizer from NLTK
tokenization
token
News and events
News archive
Conferences and workshops
CLT retreat 2020
AI Trust workshop
Autumn Workshop
Höstworkshop 2024
Höstworkshop 2023
Höstworkshop 2022
Höstworkshop 2021
Autumn Workshop 2020
Autumn Workshop 2011 and Korp-release
Autumn Workshop 2012
Autumn Workshop 2013
Autumn Workshop 2014
Autumn Workshop 2015
Autumn Workshop 2016
Autumn Workshop 2017
Autumn Workshop 2018
Autumn Workshop 2019
Språkbanken 40 years
CDLC workshop
CLT workshop Spring 2023
EACL 2014
Korp Workshop
Korp Workshop 2014
Korpworkshop 2018
NoDaLiDa 2017
RESOURCEFUL
SLTC 2020
Programme
Instructions
People
Support
Call for papers
Sustainable language representations
Position statements
Workshop on Profiling second language vocabulary and grammar - 2023
Blog
Calendar
Previous events
Research
Publications
Doktorandutbildning
For PhD students and supervisors
Tools
Korp
User manual
Web API
Distribution and development
Corpus statistics
Sentence sets
Karp
Web API
Sparv
Sparv Pipeline
Sparv's user manual
Annotations by Sparv
Web service (API)
Web Sparv
Mink
User manual
Tutorial
Web API
Privacy and data policy
Lärka
Other tools
Catta
IT-baserad grammatikinlärning
Data
FAQ
About us
Staff
Organisation
Språkbanken Text i världen
Språkbanken 50 years
Celebration
PhD program
Teaching
How to cite
Alumni
Meetings and workshops
Kick-off meetings
Kick-off H2021
Kick-off V2021
Kick-off H2020
Kick-off V2020
Kick-off H2019
Kick-off V2019
Kick-off H2018
Kick-off V2018
Kick-off H2017
Kick-off V2017
Kick-off H2016
Kick-off V2016
Kick-off H2015
Workshops
End of the year workshop 2024
End of the year workshop 2023
Semester workshop 2022
Semester workshop H2021
Semester workshop V2021
Semester workshop H2020
Semester workshop V2020
Research meetings
Gruppmöten
SBX Retreat
SBX Retreat 2024
SBX Retreat 2023
SBX Retreat 2022
Cookies
Internal
Contact us
Help desk