Skip to main content
Researchers: 24 of which 8 PhD students • Research engineers: 12 • Active projects: 10 • Data sets: 1305 • Analyses: 56 • Data points: >34.4G

What we do

Språkbanken Text is a part of Språkbanken, a national e-infrastructure that supports research based on linguistic data.

We develop, refine, and make freely available language data and language technology analyses, with a special focus on the Swedish language throughout history.

We develop freely available digital research platforms, where we aim to support all types of research where language data is central.

We conduct our own research in language technology, including language-based AI, and participate in projects in other disciplines.

More about what we do

Word of the week

An example of how to use the resources and API:s of Språkbanken Text.

siren

siren, bedårande o. lockande kvinnlig varelse, kokett, egentl.: havsjungfru som dårar med sin sång = ty. sirene, fra. siréne osv., från lat. siren, av grek. sei-rén, plur. seirenes, av, trots framställda förklaringsförsök, alltjämt dunkelt ursprung. I betyd. 'ångvissla' möjl. även påverkat från lat. syrinx, rör (se syren).
Svensk etymologisk ordbok, Hellquist, 1922

Fifty years ago, the Logoteket was established at the University of Gothenburg. Today, Språkbanken is a national research infrastructure with extensive national and international collaborations. This makes us one of the world's oldest research and development units in language technology.

 

Språkbanken was envisioned in an op-ed piece written by Sture Allén for the Swedish daily Dagens Nyheter in September 1970. In 1973, the Computational Linguistics Unit submitted a formal proposal to the Ministry of Education, requesting earmarked funding for what was to become Språkbanken. Two years later, this research infrastructure became a reality, when the Logotheque (as it was called initially) was established with national funding in 1975.

 

Read a brief history of Språkbanken.