Skip to main content


Språkbanken's research unit develops state-of-the-art language technology and pursues theoretical and practical aims within different research areas. Our research focuses both on language technology itself (creating comprehensive, high-quality resources that are needed to develop tools and algorithms) and on questions from other disciplines.

Cassandra: Explaining and predicting short-term language change in Contemporary Swedish

Linguists often try to explain language change, but all our explanations are necessarily post hoc and thus difficult to evaluate. What happens if we turn to the future instead of the past and try to predict language change?
  • Aleksandrs (Sasha) Berdicevskis
  • Yvonne Adesam
  • Nina Tahmasebi
  • Evie Coussé
  • linguistics
  • computational linguistics
  • language change
  • language evolution
  • sociolinguistics
cassandra logo

Change is Key!

This program has two main aims, firstly to develop corpus-based methods for detecting semantic change (over time) and variation (across social groups and media). This will create general tools for the study and detection of language change at large-scale and directly benefit historical linguistics and lexicography. Secondly, we will collaborate with researchers from social sciences, gender studies, and literature to answer their research questions. We will develop tools, evaluation data, and research methodology for their specific needs.
  • Nina Tahmasebi
  • Simon Hengchen
  • Haim Dubossarsky
  • Dominik Schlechtweg
  • Shafqat Virk
  • Emma Sköldberg
  • Mats Malm
  • Mia Liinason
  • Sarah Valdez
  • Dirk Geeraerts
  • Stefano de Pascale
  • lexical-semantic-change

Hot och hat mot journalister

Under vilka förutsättningar försvagas yttrandefrihet och demokrati av hat och hot mot journalister online?
  • Peter Ljunglöf
  • Oscar Björkenfeldt
  • Måns Svensson


HUMINFRA  är en ny distribuerad, nationell infrastruktur för forskning inom humaniora, konst och samhällsvetenskap.
  • Gerlof Bouma
  • Markus Forsberg
  • Dimitrios Kokkinakis
  • Elena Volodina

Linguistic networks: Connecting constructions within and between languages

This project uses Construction Grammar to develop a linguistic network that (a) accounts for Swedish grammatical constructions and (b) connects them to constructions in other languages.
  • Benjamin Lyngfelt
  • Peter Ljunglöf
  • Jonatan Uppström
  • Maia Andreasson
  • Linnea Bäckström
  • Steffen Höder
  • linguistic typology

Market Language

The market Language primarily is funded by MAW in which we look at the changing concepts around “the market”. They have transitioned from implying a concrete physical market to increasingly abstract markets like Europe-wide iron markets, as well as marriage and dating markets. They have also increasingly become actors in our lives, “the market reacted badly to the new corona restrictions”. We will complement the conceptual historians in-depth analyses with computational models of change. This project ranges 2022-2025.
  • Henrik Björck
  • Shafqat Virk
  • Claes Ohlsson

Grandma Karl

Accessibility of research data is critical for advances in many research fields, but textual data often cannot be shared due to the presence of personal and sensitive information, e.g names, political opinions. GDPR suggests pseudonymization as a solution, but we need to learn more about it before adopting it for manipulation of research data.
  • Elena Volodina
  • Simon Dobnik
  • Xuan-Son Vu
  • Therese Lindström Tiedemann
  • pseudonymization
  • research data
  • språkteknologi
  • allmän lingvistik
  • svenska som andraspråk
  • pseudonymisering
  • dataintegritet
  • forskningsdata

Rumour mining

The aim of the project is to investigate the role and importance of rumouring for the vaccination skepticism growing on the internet, and how it can be understood as an expression of civic engagement in the present digital times entailing crucial transformations for everyday civic culture.
  • Dimitrios Kokkinakis
  • Lars Borin
  • Mia-Marie Hammarlin
  • Fredrik Miegel
  • digital humanities

Svenska Akademiens samtidsordböcker

Inom ramarna för projektet förvaltas och vidareutvecklas Lexikalisk databas. Vidare bedrivs arbete med Svenska Akademiens båda samtidsordböcker Svenska Akademiens ordlista (SAOL) och Svensk ordbok utgiven av Svenska Akademien (SO). Arbetet sker på uppdrag av och i samarbete med Svenska Akademien.
  • Kristian Blensenius
  • Markus Forsberg
  • Louise Holmer
  • Hans Landqvist
  • Stellan Petersson
  • Emma Sköldberg
  • Jonatan Uppström
  • Ann Lillieström

Xhosa Corpus

Språkbanken Text collaborates with the Department of Philosophy, Linguistics and Theory of Science to create an annotated corpus of Xhosa, an underresourced Bantu language of South Africa (also known as isiXhosa and Xosa).
  • Anne Schumacher
  • Martin Hammarstedt
  • Aleksandrs (Sasha) Berdicevskis
  • Markus Forsberg
  • Eva-Marie Karin Bloom Ström
  • Aron Einar Zahran
  • Onelisa Slater
  • linguistic typology
  • field linguistics
  • African languages
  • Bantu languages
  • glossing
 Proportion of the South African population that speaks isiXhosa as their first language, according to Census 2011 at electoral ward level