Sentence Complexity in Context

Benedetta Iavarone, Dominique Brunato, Felice Dell’Orletta


Abstract
We study the influence of context on how humans evaluate the complexity of a sentence in English. We collect a new dataset of sentences, where each sentence is rated for perceived complexity within different contextual windows. We carry out an in-depth analysis to detect which linguistic features correlate more with complexity judgments and with the degree of agreement among annotators. We train several regression models, using either explicit linguistic features or contextualized word embeddings, to predict the mean complexity values assigned to sentences in the different contextual windows, as well as their standard deviation. Results show that models leveraging explicit features capturing morphosyntactic and syntactic phenomena perform always better, especially when they have access to features extracted from all contextual sentences.
Anthology ID:
2021.cmcl-1.23
Volume:
Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics
Month:
June
Year:
2021
Address:
Online
Editors:
Emmanuele Chersoni, Nora Hollenstein, Cassandra Jacobs, Yohei Oseki, Laurent Prévot, Enrico Santus
Venue:
CMCL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
186–199
Language:
URL:
https://aclanthology.org/2021.cmcl-1.23
DOI:
10.18653/v1/2021.cmcl-1.23
Bibkey:
Cite (ACL):
Benedetta Iavarone, Dominique Brunato, and Felice Dell’Orletta. 2021. Sentence Complexity in Context. In Proceedings of the Workshop on Cognitive Modeling and Computational Linguistics, pages 186–199, Online. Association for Computational Linguistics.
Cite (Informal):
Sentence Complexity in Context (Iavarone et al., CMCL 2021)
Copy Citation:
PDF:
https://preview.aclanthology.org/improve-issue-templates/2021.cmcl-1.23.pdf
Optional supplementary code:
 2021.cmcl-1.23.OptionalSupplementaryCode.zip