We lemmatize the Swedish material. This of course is a recurrent task everytime we add a text. In the end it will provide us with material that is fully lemmatized which opens up for many interesting aspects concerning the link between bilingual dictionaries and parallel corpora, among other things.
We are improving the interface in order to make the material searchable for students and researchers here at the faculty, which is a result of the increased interest for the material shown by those who are outside the immediate framework of the project. These are, among others, teachers of second language learning.
Some of the texts in PEDANT have been used for automatically aligning down to the word level with an accuracy as much as 83%. This work has been performed on a very special type of document and will be the subject of a future report.