The Swedish Parliament (Riksdagen) continuously releases open data on its website, which includes documents approved and used during parliamentary sessions as well as what each member of parliament votes during each roll call (voting session).
In recent years, neural network based approaches (i.e. deep learning) have been the main models for state-of-the-art systems in natural language processing, whether that is in machine translation, natural language inference, language modeling or sentiment analysis.
In our research group, we are exploring ways of analysing language to find early signs of possible cognitive impairment, which may develop to dementia.
At Språkbanken we collect resources, mainly lexica and corpora, most of them in Swedish. So far we have collected Swedish corpora totalling 13 billions of words, in all kinds of genres and from all time periods.
Among the flurry of Språkbanken’s historical resources we find the Kubhist corpus – a diachronic collection of historical newspaper texts – in two versions: Kubhist 1 spanning the time period of 1750–1950, and Kubhist 2 spanning the time period of 1645–1926.
Ordet tsunami var helt okänt för de flesta i Sverige före julhelgen 2004. Då inträffade ju det som så småningom kom att kallas tsunamikatastrofen, en förfärlig naturkatastrof som skördade otaliga dödsoffer i Sydasien och Sydostasien.