Språkbanken (the Swedish Language Bank)

In recognition of the groundbreaking corpus linguistic work initiated by Sture Allén at the University of Gothenburg in the 1960s (which had resulted in the creation of one of the first large electronic text corpora in another language than English, Press-65, one million words of newstext), Språkbanken (the Swedish Language Bank) was established in 1975 as a national center with a remit to collect, process and store (Swedish) text corpora, and to make linguistic data extracted from the corpora available to researchers and to the public. Since then, Språkbanken has developed into a nationally and internationally acknowledged research unit whose work focuses on the development of linguistic resources and tools, and methodologies for using the resources in research in language technology and a number of other disciplines.
Read more...

Språkbanken tidbits

Reload page for new tidbit.

Corpora

Korp
corpus search interface

antal korpusar 141
token (total)1 286 980 624
antal meningar89 934 873

Annotation lab
The annotation lab of Korp

Finland Swedish texts
token: 50 570 560
sentences: 2 874 205

Swedish Wikipedia Corpus
tokens: 90 120 941
sentences: 5 978 611

more...

Lexical resources

Karp
Search in the lexical resources

lexica count22
entries (total)704 689

SweFN++
The Swedish framenet project

SweCxn
The Swedish constructicon

SALDO
Lexical resource for Swedish language technology

Söderwall och Schlyter
Dictionary of Old Swedish

Dalins ordbok
Dictionary of 19th century Swedish

Svenska ord, LEXIN
Basic Swedish dictionary

SAOB
The Swedish Academy Dictionary

more ...


Development

© University of Gothenburg 2009, Box 100, 405 30 Gothenburg, Sweden
Tel +46 31 786 0000, Contact

About the site

X
Loading