@inproceedings{li-etal-2023-comparing,
    title = "Comparing and Predicting Eye-tracking Data of {M}andarin and {C}antonese",
    author = "Li, Junlin  and
      Peng, Bo  and
      Hsu, Yu-yin  and
      Chersoni, Emmanuele",
    editor = {Scherrer, Yves  and
      Jauhiainen, Tommi  and
      Ljube{\v{s}}i{\'c}, Nikola  and
      Nakov, Preslav  and
      Tiedemann, J{\"o}rg  and
      Zampieri, Marcos},
    booktitle = "Tenth Workshop on NLP for Similar Languages, Varieties and Dialects (VarDial 2023)",
    month = may,
    year = "2023",
    address = "Dubrovnik, Croatia",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2023.vardial-1.12/",
    doi = "10.18653/v1/2023.vardial-1.12",
    pages = "121--132",
    abstract = "Eye-tracking data in Chinese languages present unique challenges due to the non-alphabetic and unspaced nature of the Chinese writing systems. This paper introduces the first deeply-annotated joint Mandarin-Cantonese eye-tracking dataset, from which we achieve a unified eye-tracking prediction system for both language varieties. In addition to the commonly studied first fixation duration and the total fixation duration, this dataset also includes the second fixation duration, expressing fixation patterns that are more relevant to higher-level, structural processing. A basic comparison of the features and measurements in our dataset revealed variation between Mandarin and Cantonese on fixation patterns related to word class and word position. The test of feature usefulness suggested that traditional features are less powerful in predicting the second-pass fixation, to which the linear distance to root makes a leading contribution in Mandarin. In contrast, Cantonese eye-movement behavior relies more on word position and part of speech."
}Markdown (Informal)
[Comparing and Predicting Eye-tracking Data of Mandarin and Cantonese](https://preview.aclanthology.org/ingest-emnlp/2023.vardial-1.12/) (Li et al., VarDial 2023)
ACL