NoDaLiDa 2013 workshop | Språkbanken Text

Lexical semantic resources for NLP

Proceedings volume (online)

<http://www.ep.liu.se/ecp_home/index.en.aspx?issue=088>

Program

Time: Wednesday 22nd May, 2013, 12.00–17.00
Venue: Lecture room 2 (undervisningsrom 2), Georg Sverdrups hus at Blindern, University of Oslo

12.00–13.00	Lunch in Frederikke
13.00–13.05	Welcome
13.05–14.00	Invited presentation: Ontologies versus lexical semantics Graeme Hirst
14.00–14.30	Automatic identification of construction candidates for a Swedish constructicon Linnéa Bäckström, Lars Borin, Markus Forsberg, Benjamin Lyngfelt, Julia Prentice and Emma Sköldberg
14.30–15.00	Coffee break
15.00–15.30	LBK2013: A balanced, annotated national corpus for Norwegian Bokmål Rune Lain Knudsen and Ruth Vatvedt Fjeld
15.30–16.00	Clustering word senses from semantic mirroring data Hampus Lilliehöök and Magnus Merkel
16.00–16.30	Enriching a wordnet from a thesaurus Sanni Nimb, Bolette S. Pedersen, Anna Braasch, Nicolai H. Sørensen and Thomas Troelsgård
16.30–17.00	Concluding discussion

Call for papers

High-quality lexical semantic resources with sufficiently large vocabularies still prove to be a serious bottleneck not only in purely rule-based NLP applications but also in supervised corpus-based approaches. The oldest widely-known lexical semantic resource, Princeton WordNet (PWN), has been around for over two decades. While PWN and the numerous wordnet projects for other languages that it has inspired adhere fairly closely to the traditional dictionary in their conception and organization, there are also lexical-semantic resources where a closer integration of lexical data information and corpus data is attempted. Such resources can be seen either as extremely richly exemplified lexicons or extremely deeply annotated corpora, depending on your outlook. Berkeley FrameNet, VerbNet, PropBank and several others can be mentioned in this connnection. A recent trend in the wake of the increased awareness of the importance of standardization and interoperability of language resources, is the development towards large-scale integration of lexical resources (variously referred to as “lexical cores”, “lexical macroresources”, “lexical resource networks”, and the like) both within and across languages, the ultimate expression of which is at the moment the linked open data in linguistics movement.

For largely extraneous reasons, English-language resources tend to receive most attention in the LT literature, but there is an increasing number of lexical semantic resources under development for many other languages, including Nordic, Baltic and other languages of the NEALT area.

In parallel to this development of new lexical semantic resources, much effort is put into exploring how such resources and formal ontologies can be made to work together in knowledge-based systems. The workshop – a follow-up on the succesful Nodalida 2009 workshop where the focus was on wordnets – intends to bring together researchers involved in building and integrating lexical semantic resources for NLP as well as researchers that are more theoretically interested in investigating the interplay between lexical semantics, lexicography, terminology and formal ontologies.

We invite papers presenting original research relating to lexical semantic resources for NLP on topics such as:

representation of lexical-semantic knowledge for computational use
the interplay between formal ontologies and lexical resources
corpus-based approaches to lexical semantic resources
terminology and lexical semantics: concept-based vs lexical semantic approaches
monolingual vs. multilingual approaches to lexical-semantic resources and ontologies
word-space models for building and expanding ontologies
domain-specific classification: taxonomy and ontology – computational aspects
quality assessment of lexical-semantic resources: criteria, methods
computational use of lexical-semantic resources (information retrieval, semantic tagging of corpora, MT, etc.)
traditional lexicography and NLP lexicons: re-use and differences
cognitive aspects: computational lexical models as opposed to the ‘mental lexicon’

Submission

Papers should conform to the main Nodalida stylesheet .

Submissions must be anonymous, i.e. not reveal author(s) on the title page or through self-references. Papers must be submitted digitally, in PDF, and uploaded through the on-line conference system. Paper submissions that violate either of these requirements will be returned without review.

The page limit for submissions is up to fourteen pages of text, plus unlimited additional pages with bibliographic references. Please note that NoDaLiDa 2013 adapts a single-column, smaller page format, optimized for on-screen reading. In terms of actual word counts, this page limit corresponds to approximately eight pages in a ‘classic’, two-column conference proceedings layout.

All submissions to the workshop must be uploaded electronically, following the above requirements. All submissions will be reviewed by the program committee. All accepted papers will be collected into a proceedings volume to be submitted for publication in the NEALT Proceeding Series (Linköping Electronic Conference Proceedings).

Important dates

24th March (extended deadline): paper submission
11th April: notification of acceptance
25th April: camera-ready papers for publication.
22nd May: Workshop

Invited speaker

Graeme Hirst (University of Toronto)

Workshop organizers

Lars Borin (University of Gothenburg; Organizing chair)
Ruth Vatvedt Fjeld (University of Oslo)
Markus Forsberg (University of Gothenburg)
Sanni Nimb (Association for Danish Language and Literature)
Pierre Nugues (Lund University)
Bolette Sandford Pedersen (University of Copenhagen)

Program committee

Lars Borin (University of Gothenburg)
Ruth Vatvedt Fjeld (University of Oslo)
Markus Forsberg (University of Gothenburg)
Karin Friberg Heppin (University of Gothenburg)
Richard Johansson (University of Gothenburg)
Rune Lain Knudsen (University of Oslo)
Dimitrios Kokkinakis (University of Gothenburg)
André Lynum (University of Oslo)
Sanni Nimb (Association for Danish Language and Literature)
Pierre Nugues (Lund University)
Bolette Sandford Pedersen (University of Copenhagen)
Joel Priestley (University of Oslo)

For all inquiries, please email Lars Borin <lars dot borin at svenska dot gu dot se>.