Skip to main content
Språkbanken Text is a part of Språkbanken.

COCTAILL

Standard reference Information

Data citation Information

Elena Volodina, & Ildikó Pilán (2025). COCTAILL (updated: 2025-01-19). [Data set]. Språkbanken Text. https://doi.org/10.23695/v8xs-p564
BibTeX Additional ways to cite the dataset.
Corpus of coursebooks used for teaching L2 Swedish. Annotated manually for text structure and pedagogical/didactical categories; automatically linguistically annotated. See more here https://spraakbanken.gu.se/forskning/teman/icall/icall-l2-projects/l2-data

Caveats

Full texts are not available due to copyright restrictions.

Accessible through

Access Platform Licence
CC BY 4.0
attribution

Download

File Size Modified Licence
coctaill.xml.bz2
this file contains a scrambled version of the corpus Information (XML)
16.57 MB 2017-10-30 CC BY 4.0
attribution
stats_COCTAILL.txt
Word statistics: Information (CSV)
3.03 MB 2017-11-05 CC BY 4.0
attribution

Collection

Type

  • Corpus

Language

Swedish

Size

Sentences: 89,086
Tokens: 710,251

Keywords

  • course books texts
  • language learning
  • Swedish as a second language

Creators

  • Elena Volodina
  • Ildikó Pilán

Updated

2025-01-19

Contact

Språkbanken
sb-info@svenska.gu.se