Analyzing data from the Swedish Parliament

The Swedish Parliament (Riksdagen) continuously releases open data on its website, which includes documents approved and used during parliamentary sessions as well as what each member of parliament votes during each roll call (voting session). This data can be used to gain insight on what topics members of parliament and parties discuss and vote. In the following post, I will provide some example analyses that were performed with Python, but it could be done similarly with many other programming languages with data …

What are probing tasks in NLP?

In recent years, neural network based approaches (i.e. deep learning) have been the main models for state-of-the-art systems in natural language processing, whether that is in machine translation, natural language inference, language modeling or sentiment analysis. At the same time researchers have asked themselves what kind of linguistic information these neural networks are able to capture. Answering this question is not a trivial undertaking: state-of-the-art model’s are usually multiple layers deep with non-linear transformations learned through billions of mathematical operations. The benchmarks …

Using Språkbanken corpora in NLTK

At Språkbanken we collect resources, mainly lexica and corpora, most of them in Swedish. So far we have collected Swedish corpora totalling 13 billions of words, in all kinds of genres and from all time periods. Most of our corpora are not manually annotated, and the ones that are annotated usually have only one kind of annotation (e.g., part of speech, lemmas, dependency structures, constituent structure, etc). To be able to use the same tools to analyse any corpus, we have devised …

Vad är en tsunami för slags våg egentligen?

Ordet tsunami var helt okänt för de flesta i Sverige före julhelgen 2004. Då inträffade ju det som så småningom kom att kallas tsunamikatastrofen, en förfärlig naturkatastrof som skördade otaliga dödsoffer i Sydasien och Sydostasien. Det är klart att en tsunami är en sorts våg, men vilken sorts våg handlar det om? Hur talade man om detta naturfenomen på svenska före 2004? I fackkretsar – bland seismologer, etc. – har man förstås länge känt till ordet tsunami, men även i allmänspråket dyker …