@inproceedings{gamback-das-2016-comparing,
    title = "Comparing the Level of Code-Switching in Corpora",
    author = {Gamb{\"a}ck, Bj{\"o}rn  and
      Das, Amitava},
    editor = "Calzolari, Nicoletta  and
      Choukri, Khalid  and
      Declerck, Thierry  and
      Goggi, Sara  and
      Grobelnik, Marko  and
      Maegaard, Bente  and
      Mariani, Joseph  and
      Mazo, Helene  and
      Moreno, Asuncion  and
      Odijk, Jan  and
      Piperidis, Stelios",
    booktitle = "Proceedings of the Tenth International Conference on Language Resources and Evaluation ({LREC}'16)",
    month = may,
    year = "2016",
    address = "Portoro{\v{z}}, Slovenia",
    publisher = "European Language Resources Association (ELRA)",
    url = "https://preview.aclanthology.org/ingest-emnlp/L16-1292/",
    pages = "1850--1855",
    abstract = "Social media texts are often fairly informal and conversational, and when produced by bilinguals tend to be written in several different languages simultaneously, in the same way as conversational speech. The recent availability of large social media corpora has thus also made large-scale code-switched resources available for research. The paper addresses the issues of evaluation and comparison these new corpora entail, by defining an objective measure of corpus level complexity of code-switched texts. It is also shown how this formal measure can be used in practice, by applying it to several code-switched corpora."
}Markdown (Informal)
[Comparing the Level of Code-Switching in Corpora](https://preview.aclanthology.org/ingest-emnlp/L16-1292/) (Gambäck & Das, LREC 2016)
ACL
- Björn Gambäck and Amitava Das. 2016. Comparing the Level of Code-Switching in Corpora. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16), pages 1850–1855, Portorož, Slovenia. European Language Resources Association (ELRA).