Natural Language Processing
I am enrolled at GSLT as a PhD student since September 16th, 2010. My interests are in computational historical linguistics, creation of language resources, and unsupervised methods in NLP. I am also affiliated with Centre for Language Technology in Göteborg.
I was a research assistant in Digital Areal Linguistics project from 2010 till 2014.
I did my Masters in Technology in CL from IIIT-Hyderabad in 2009 and Batchelors in Technology in ICT from DA-IICT.
My Google Scholar page has full list of publications.
Here is my CV.
I am writing my thesis and am very much interested in taking up a research position by September 2015. I am broadly interested in language evolution and collecting comparative data for larger families and applying computationally intensive methods for modeling language change and investigating questions about language prehistory and contact in South Asia.
I contribute code to the Quantitative historical linguistics project. Clone it here.
A must read for any PhD student
with Søren Wichmann. Jackknifing the black sheep: ASJP classification performance and Austronesian. For the proceedings of the symposium "Let's talk about trees", National Museum of Ethnology, Osaka, Febr. 9-10, 2013. pdf
with Lars Borin. Properties of phoneme N -grams across the world’s language families. pdf
Publications before 2010
A Computational Model of the Phonetic Space and Its Applications. In process
Anil Kumar Singh, Sethuramalingam Subramaniam and Taraka Rama. 2010 Transliteration as Alignment vs. Transliteration as Generation for the Purpose of Crosslingual Information Retrieval. Traitement Automatique des Langues, Special Issue on Multilingualism and NLP. Vol. 51, Number 2. 2010. [pdf][Bibtex]
Taraka Rama, Sudheer Kolachina and Lakshmi Bai B. 2009 Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian. Indian Linguistics, Vol. 70, 2009.[pdf]
Karthik Gali, Sriram Venkatapathy and Taraka Rama. 2009 From Factorial to Quadtratic Time Complexity for Sentence Realization using Nearest Neighbour Algorithm. STIL 2009, Brazil
Taraka Rama, Anil Kumar Singh. 2009 From Bag of Languages to Family Trees from Noisy Corpus. RANLP 2009, Borovets, Bulgaria.[pdf]
Taraka Rama, Karthik Gali. 2009 Modeling Transliteration as a Phrase Based Statistical Machine Translation Problem, NEWS 2009, ACL-IJCNLP 2009, Singapore [pdf]
Taraka Rama, Anil Kumar Singh and Sudheer Kolachina. 2009 Modeling Letter to Phoneme Conversion as a Phrase Based Statistical Machine Translation Problem with Minimum Error Rate Training NAACL HLT 2009 Student Research Workshop, Boulder, Colorado, USA[pdf]
Taraka Rama, Karthik Gali and Avinesh PVS. 2008 Does Syntactic Knowledge help English-Hindi SMT ? Procedings of the NLP Tools contest, ICON 2008.[pdf]
I maintain this page on references to Computational historical linguistics.