Taraka Rama
I am enrolled in GSLT. My interests are in creation of language resources, unsupervised methods and computational historical linguistics. I worked on cognate identification and phylogeny in my Masters' thesis.
I am a member of ASJP Consortium.
My first supervisor is Lars Borin. My second supervisor is Søren Wichmann
I am present here
There is an interesting open-source NLP toolkit for South Asian Languages Sanchay (means a Collection in Sanskrit).
My favourite links: phd comics, hal daume's blog, chomsky
Presentations
Taraka Rama, Sudheer Kolachina. Distance-based algorithms in the subgrouping of Dravidian languages. Workshop on comparing approaches to measuring linguistic differences. 2011 [pdf]
Søren Wichmann, Taraka Rama, Eric W. Holman. Phonological diversity, Mean word length and Population Sizes across worlds' languages. CLT Retreat 2011 [pdf]
Sudheer Kolachina, Taraka Rama. Revisiting Unchanged Cognates as criterion in Linguistic Subgrouping. ICHL, Osaka 2011.[pdf]
Taraka Rama, Lars Borin. Estimating language distances from parallel corpus. A study of Europarl
corpus NODALIDA 2011. Latvia. [pdf]
Sudheer Kolachina, Taraka Rama, Lakshmi Bai. Maximum Parsimony for subgrouping in Dravidian. QITL, Berlin 2011. [pdf]
Taraka Rama. Explorations in Phoneme N-grams for Automatic Language Classification CLT Seminar, March 2011. [pdf]
Publications
Søren Wichmann, Eric W. Holman, Taraka Rama, Robert Walker. Correlates of reticulation in linguistic phylogenies. Language Dynamics and Change [paper]
Taraka Rama, Sudheer Kolachina. Distance-based Phylogenetic Inference Algorithms
in the Subgrouping of Dravidian Languages. Submitted [paper]
Wichmann, Søren, Taraka Rama, and Eric W. Holman. Phonological diversity, word length, and population sizes across languages: The ASJP evidence. Linguistic Typology. [paper],[supplementary materials]
Taraka Rama, Lars Borin. 2011 Estimating language relationships from a parallel corpus. A study of the Europarl corpus. NODALIDA [pdf]
Sudheer Kolachina, Taraka Rama, Lakshmi Bai. 2011 Maximum parsimony method in the subgrouping of Dravidian languages. QITL-4 [pdf]
Anil Kumar Singh, Sethuramalingam Subramaniam and Taraka Rama. 2010 Transliteration as Alignment vs. Transliteration as Generation for the Purpose of Crosslingual Information Retrieval. Traitement Automatique des Langues, Special Issue on Multilingualism and NLP. Vol. 51, Number 2. 2010. [pdf][Bibtex]
Taraka Rama, Sudheer Kolachina and Lakshmi Bai B. 2009 Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian. Indian Linguistics, Vol. 70, 2009.[pdf] [Bibtex]
Karthik Gali, Sriram Venkatapathy and Taraka Rama. 2009 From Factorial to Quadtratic Time Complexity for Sentence Realization using Nearest Neighbour Algorithm. STIL 2009, Brazil
Taraka Rama, Anil Kumar Singh. 2009 From Bag of Languages to Family Trees from Noisy Corpus. RANLP 2009, Borovets, Bulgaria.[pdf]
Taraka Rama, Karthik Gali. 2009 Modeling Transliteration as a Phrase Based Statistical Machine Translation Problem, NEWS 2009, ACL-IJCNLP 2009, Singapore [pdf]
Taraka Rama, Anil Kumar Singh and Sudheer Kolachina. 2009 Modeling Letter to Phoneme Conversion as a Phrase Based Statistical Machine Translation Problem with Minimum Error Rate Training NAACL HLT 2009 Student Research Workshop, Boulder, Colorado, USA[pdf]
Taraka Rama, Karthik Gali and Avinesh PVS. 2008 Does Syntactic Knowledge help English-Hindi SMT ? Procedings of the NLP Tools contest, ICON 2008.[pdf]