I am a researcher at Språkbanken, University of Gothenburg. My main research interests are in the areas of language engineering, computational linguistic resources development, semantic parsing, and information extraction. During my PhD, I worked with Grammatical Framework group at University of Gothenburg, and developed resource grammars for six Indo-Iranian languages including Hindi, Urdu, Persian, Punjabi, Nepali, and Sindhi. These grammars are in the form of software libraries and encode different syntactical and lexical aspects of those natural languages.
Recently, I have been mostly involved in semantic parsing and information extraction. As a postdoctorate researcher at Academia Sinica, Taiwan, I worked on a Propbank based semantic role labeling and an information extraction systems for English and Chineese. I was also involved in the French FrameNet project at IRIT, France, where we worked on a FrameNet based semantic role labeling system, and on the extension of the relational structure of English FrameNet.
At Språkbanken, I am involved in the South Asia as a Linguistic Area project, where a primary goal is do a systematic investigation of the claim that South Asia is a classic linguistic area. With the Grierson’s Linguistic Survey of India (LSI; 1903-1927) as a primary data source, we are exploring the use of computational methods for automatic extraction of useful information about various grammatical features of South Asian languages described in the LSI. Largely relying on semantic parsing and open information techniques, we are developing methodologies and/or tools which we expect to be useful for a wider audience.