Readability Reconsidered A Cross-Dataset Analysis of Reference-Free Metrics
Catarina Belem, Parker Glenn, Alfy Samuel, Anoop Kumar, Daben Liu
Abstract
Automatic readability assessment plays a key role in ensuring effective communication between humans and language models. Despite significant progress the field is hindered by inconsistent definitions of readability and measurements that rely on surface-level text properties. In this work we investigate the factors shaping human perceptions of readability through the analysis of 1.2k judgments finding that beyond surface-level cues information content and topic strongly shape text comprehensibility. Furthermore we evaluate 15 popular readability metrics across 5 datasets contrasting them with 5 more nuanced model-based metrics. Our results show that four model-based metrics consistently place among the top 4 in rank correlations with human judgments while the best performing traditional metric achieves an average rank of 7.8. These findings highlight a mismatch between current readability metrics and human perceptions pointing to model-based approaches as a more promising direction.- Anthology ID:
- 2025.tsar-1.4
- Volume:
- Proceedings of the Fourth Workshop on Text Simplification, Accessibility and Readability (TSAR 2025)
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Matthew Shardlow, Fernando Alva-Manchego, Kai North, Regina Stodden, Horacio Saggion, Nouran Khallaf, Akio Hayakawa
- Venues:
- TSAR | WS
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 47–69
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.tsar-1.4/
- DOI:
- Cite (ACL):
- Catarina Belem, Parker Glenn, Alfy Samuel, Anoop Kumar, and Daben Liu. 2025. Readability Reconsidered A Cross-Dataset Analysis of Reference-Free Metrics. In Proceedings of the Fourth Workshop on Text Simplification, Accessibility and Readability (TSAR 2025), pages 47–69, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- Readability Reconsidered A Cross-Dataset Analysis of Reference-Free Metrics (Belem et al., TSAR 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.tsar-1.4.pdf