Skip to main content

Corpus-driven induction of linguistic knowledge

Project description

Recent years have seen a number of technical advances in methods that use unannotated corpora to automatically learn representations of meaning, primarily of single words but increasingly often also of larger linguistic units. On the other hand, very significant efforts have been invested in the development of structured linguistic resources such as grammars and lexicons. The project will explore synergies between these two approaches to the acquisition of linguistic knowledge: we will apply corpus-driven methods as a way to expand and correct existing linguistic resources, and conversely we will use hand-crafted resources as additional sources of supervision when learning meaning representations automatically.

Software

  • SCOUSE, a tool that embeds a sense network into a word space.

Publications BibTeX

2017 BibTeX

2016 BibTeX

2015 BibTeX

2014 BibTeX

Project duration

Project members

  • Richard Johansson
  • Luis Nieto Piña

Funding

  • Swedish Research Council (2013-4944)

Project type

  • Research project