Linguistic Survey of India Sammanfattning Resurstyp Corpus Språk engelska Tokens 1 193 437 Sentences 72 669 Ladda ned lsi.xml.bz2 corpus (XML) licens: CC BY 4.0 (attribution)