ScanEZ: Integrating Cognitive Models with Self-Supervised Learning for Spatiotemporal Scanpath Prediction

Ekta Sood, Prajit Dhar, Enrica Troiano, Rosy Southwell, Sidney K. D’Mello


Abstract
Accurately predicting human scanpaths during reading is vital for diverse fields and downstream tasks, from educational technologies to automatic question answering. To date, however, progress in this direction remains limited by scarce gaze data. We overcome the issue with ScanEZ, a self-supervised framework grounded in cognitive models of reading. ScanEZ jointly models the spatial and temporal dimensions of scanpaths by leveraging synthetic data and a 3-D gaze objective inspired by masked language modeling. With this framework, we provide evidence that two key factors in scanpath prediction during reading are: the use of masked modeling of both spatial and temporal patterns of eye movements, and cognitive model simulations as an inductive bias to kick-start training. Our approach achieves state-of-the-art results on established datasets (e.g., up to 31.4% negative log-likelihood improvement on CELER L1), and proves portable across different experimental conditions.
Anthology ID:
2025.acl-short.89
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
1132–1142
Language:
URL:
https://preview.aclanthology.org/corrections-2025-10/2025.acl-short.89/
DOI:
10.18653/v1/2025.acl-short.89
Bibkey:
Cite (ACL):
Ekta Sood, Prajit Dhar, Enrica Troiano, Rosy Southwell, and Sidney K. D’Mello. 2025. ScanEZ: Integrating Cognitive Models with Self-Supervised Learning for Spatiotemporal Scanpath Prediction. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 1132–1142, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
ScanEZ: Integrating Cognitive Models with Self-Supervised Learning for Spatiotemporal Scanpath Prediction (Sood et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/corrections-2025-10/2025.acl-short.89.pdf