A multilingual annotated corpus of world’s natural language descriptions

Shafqat Mumtaz Virk, Harald Hammarström, Markus Forsberg, Søren Wichmann The diversity of 7000 languages of the world represents an irreplaceable and abundant resource for understanding the unique communication system of our species (Evans and Levinson, 2009). All comparison and analysis of languages departs from language descriptions — publications that contain facts about particular languages. The typical examples of this genre are grammars and dictionaries (Hammarström and Nordhoff, 2011). Until recently, language descriptions were available in paper form only, with indexes as the …