PhD student in Natural Language Processing
Our lives are deeply intertwined with technology today which allows us to perform our daily tasks simpler, better, faster. The broader aim of my research is to explore how we can exploit the advantages offered by Natural Language Processing (NLP) technologies for the purposes of teaching and learning foreign or second (L2) languages. My specific research interests include the automatic assessment of proficiency levels and linguistic complexity in texts written by and for language learners as well as automatic exercise generation, especially the selection of exercise item candidates from corpora. Since the start of my PhD studies I have been collaborating in the creation of a number of new Swedish lexical resources and corpora with an L2 focus. Moreover, I am actively involved in a number of communities and in the organization of different academic events related both to my narrower research interests and NLP in general. For more information see the "Research" tab.
I devote a considerable amount of effort to making the outcome of my research available to the general public free of use through backend programming for the online learning platform Lärka. Some examples:
Most recent publications
Ildikó Pilán, Elena Volodina and Torsten Zesch. (2016). Predicting proficiency levels in learner writings by transferring a linguistic complexity model from expert-written coursebooks. In Proceedings of the 26th International Conference on Computational Linguistics (COLING), pp. 2101-2110, Osaka, Japan. [pdf]
Ildikó Pilán, Elena Volodina and David Alfter. (2016). Coursebook texts as a helping hand for classifying linguistic complexity in language learners' writings. In Proceedings of the workshop on Computational Linguistics for Linguistic Complexity (CL4LC), COLING 2016, Osaka, Japan. [pdf]
Ildikó Pilán, Elena Volodina. Classification of Language Proficiency Levels in Swedish Learners' Texts. In Proceedings of Swedish Language Technology Conference (SLTC), 2016, Umeå, Sweden. [pdf]
Ildikó Pilán. Detecting Context Dependence in Exercise Item Candidates Selected from Corpora. In Proceedings of the 11th Workshop on Innovative Use of NLP for Building Educational Applications, NAACL 2016, San Diego. [pdf]
David Alfter, Yuri Bizzoni, Anders Agebjörn, Elena Volodina and Ildikó Pilán. From Distributions to Labels: A Lexical Proficiency Analysis using Learner Corpora. In Proceedings of the workshop on NLP4CALL&LA. NEALT Proceedings Series / Linköping Electronic Conference Proceedings, Umeå, Sweden. [pdf]
Elena Volodina, Ildikó Pilán, David Alfter. Classification of Swedish learner essays by CEFR levels. To appear in Proceedings of EuroCALL 2016. Limassol, Cyprus.
Elena Volodina, Ildikó Pilán, Lorena Llozhi, Baptiste Degryse, Thomas François. SweLLex: second language learners' productive vocabulary. In Proceedings of the workshop on NLP4CALL&LA. NEALT Proceedings Series / Linköping Electronic Conference Proceedings, Umeå, Sweden. [pdf]
Elena Volodina, Ildikó Pilán, Ingegerd Enström, Peter Lundkvist, Gunlög Sundberg, Lorena Llozhi, Monica Sandell. 2016. SweLL – en korpus med L2 uppsatser för CEFR studier. Proceedings of Svenskans beskrivning 35, Göteborg 2016. [pdf]
Elena Volodina, Ildikó Pilán, Ingegerd Enström, Lorena Llozhi, Peter Lundkvist, Gunlög Sundberg, Monica Sandell. SweLL on the rise: Swedish Learner Language corpus for European Reference Level studies. To appear in Proceedings of LREC 2016, Slovenia. [pdf]
Thomas François, Elena Volodina, Ildikó Pilán, Anaïs Tack. SVALex: a CEFR-graded lexical resource for Swedish foreign and second language learners. To appear in Proceedings of LREC 2016, Slovenia. [pdf]
Additional publications can be found under the 'Publications' tab.