Site map

Taraka Rama

Taraka Rama

Contact information

Taraka Rama

Språkbanken, Göteborgs universitet, Box 200
405 30 Göteborg

Visiting address:
Lennart Torstenssonsgatan 8

Mobile:
070 4194962

Contact form

PhD Student


 

I am enrolled in GSLT. My interests are in creation of language resources, unsupervised methods and computational historical linguistics. I am also a member with CLT

I worked on cognate identification and phylogeny in my Masters' thesis

I am a member of ASJP Consortium.

My first supervisor is Lars Borin.
My second supervisor is Søren Wichmann

I spent the spring of 2012 at Max Planck Institute for Evolutionary Anthropology, Leipzig.

I spent two months (March and April of 2013) as a Guest Researcher at Alpha-Informatica

There is an interesting open-source NLP toolkit for South Asian Languages Sanchay (means a Collection in Sanskrit).

A very honest PhD thesis

Presentations

Poster in COLING 2012, IIT Mumbai[pdf]

ESSLI workshop on computational approaches to typological and dialectogical variation. 2012 [pdf]

CLT fall seminar. 2012 [pdf]

SLTC talk at Lund. 2012 [pdf]

First year at GSLT. 2011 [pdf]

Taraka Rama, Sudheer Kolachina. Distance-based algorithms in the subgrouping of Dravidian languages. Workshop on comparing approaches to measuring linguistic differences. 2011 [pdf]

Søren Wichmann, Taraka Rama, Eric W. Holman. Phonological diversity, Mean word length and Population Sizes across worlds' languages. CLT Retreat 2011 [pdf]

Sudheer Kolachina, Taraka Rama. Revisiting Unchanged Cognates as criterion in Linguistic Subgrouping. ICHL, Osaka 2011.[pdf]

Taraka Rama, Lars Borin. Estimating language distances from parallel corpus. A study of Europarl corpus NODALIDA 2011. Latvia. [pdf]

Sudheer Kolachina, Taraka Rama, Lakshmi Bai. Maximum Parsimony for subgrouping in Dravidian. QITL, Berlin 2011. [pdf]

Taraka Rama. Explorations in Phoneme N-grams for Automatic Language Classification CLT Seminar, March 2011. [pdf]

Submitted

Two methods for automatic cognate identification.

Comparative evaluation of string similarity measures for automatic language classification.

Accepted

N-gram approaches to the historical dynamics of basic vocabulary.

Rejected

Properties of phoneme N -grams across the world’s language families.

Publications before 2010

A Computational Model of the Phonetic Space and Its Applications. In process

Anil Kumar Singh, Sethuramalingam Subramaniam and Taraka Rama. 2010 Transliteration as Alignment vs. Transliteration as Generation for the Purpose of Crosslingual Information Retrieval. Traitement Automatique des Langues, Special Issue on Multilingualism and NLP. Vol. 51, Number 2. 2010. [pdf][Bibtex]

Taraka Rama, Sudheer Kolachina and Lakshmi Bai B. 2009 Quantitative methods for Phylogenetic Inference in Historical Linguistics: An experimental case study of South Central Dravidian. Indian Linguistics, Vol. 70, 2009.[pdf]

Karthik Gali, Sriram Venkatapathy and Taraka Rama. 2009 From Factorial to Quadtratic Time Complexity for Sentence Realization using Nearest Neighbour Algorithm. STIL 2009, Brazil

Taraka Rama, Anil Kumar Singh. 2009 From Bag of Languages to Family Trees from Noisy Corpus. RANLP 2009, Borovets, Bulgaria.[pdf]

Taraka Rama, Karthik Gali. 2009 Modeling Transliteration as a Phrase Based Statistical Machine Translation Problem, NEWS 2009, ACL-IJCNLP 2009, Singapore [pdf]

Taraka Rama, Anil Kumar Singh and Sudheer Kolachina. 2009 Modeling Letter to Phoneme Conversion as a Phrase Based Statistical Machine Translation Problem with Minimum Error Rate Training NAACL HLT 2009 Student Research Workshop, Boulder, Colorado, USA[pdf]

Taraka Rama, Karthik Gali and Avinesh PVS. 2008 Does Syntactic Knowledge help English-Hindi SMT ? Procedings of the NLP Tools contest, ICON 2008.[pdf]

Resources

I maintain this page on references to Computational historical linguistics. It is improved and worked on everyday. http://lingphylo.wikidot.com/start


Publications

Taraka Rama, Sudheer Kolachina 2013. Distance-based Phylogenetic Inference Algorithms in the Subgrouping of Dravidian Languages Approaches to Measuring Linguistic DifferencesDe Gruyter Mouton [pdf] BibTeX

Taraka Rama 2013. Phonotactic diversity predicts the time depth of the world\'s language families PLoS ONE [pdf] BibTeX

Taraka Rama 2012. N-gram approaches to the historical dynamics of basic vocabulary. Preproceedings of Computational approaches to the study of dialectal and typological variation [pdf] BibTeX

Taraka Rama, Lars Borin 2012. Properties of phoneme N -grams across the world’s language families Proceedings of the Fourth Swedish Language Technology Conference (SLTC) BibTeX

Søren Wichmann, Eric Holman, Taraka Rama, Robert S. Walker 2012. Correlates of linguistic phylogenetic reticulation. Language Dynamics and Change. [pdf] BibTeX

Taraka Rama, Prasanth Kolachina 2012. How good are typological distances for determining genealogical relationships among languages? Proceedings of the 24th International Conference on Computational Linguistics [pdf] BibTeX

Søren Wichmann, Taraka Rama, Eric Holman 2011. Phonological diversity, word length, and population sizes across languages: The ASJP evidence. Linguistic Typology [pdf] BibTeX

Sudheer Kolachina, Taraka Rama, Lakshmi Bai 2011. Maximum parsimony method in the subgrouping of Dravidian languages. Quantitative Investigations in Theoretical Linguistics [pdf] BibTeX

Taraka Rama, Lars Borin 2011. Estimating Language Relationships from a Parallel Corpus. A Study of the Europarl Corpus NEALT Proceedings Series (NODALIDA 2011 Conference Proceedings). 161-167 [pdf] BibTeX

All publications as BibTeX

© University of Gothenburg 2009, Box 100, 405 30 Gothenburg, Sweden
Tel +46 31 786 0000, Contact

About the site

X
Loading