PUCP-Metrix: An Open-source and Comprehensive Toolkit for Linguistic Analysis of Spanish Texts
Javier Alonso Villegas Luis, Marco Antonio Sobrevilla Cabezudo
Abstract
Linguistic features remain essential for interpretability and tasks that involve style, structure, and readability, but existing Spanish tools offer limited coverage. We present PUCPMetrix, an open-source and comprehensive toolkit for linguistic analysis of Spanish texts. PUCP-Metrix includes 182 linguistic metrics spanning lexical diversity, syntactic and semantic complexity, cohesion, psycholinguistics, and readability. It enables fine-grained, interpretable text analysis. We evaluate its usefulness on Automated Readability Assessment and Machine-Generated Text Detection, showing competitive performance compared to an existing repository and strong neural baselines. PUCP-Metrix offers a comprehensive and extensible resource for Spanish, supporting diverse NLP applications.- Anthology ID:
- 2026.eacl-demo.28
- Volume:
- Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations)
- Month:
- March
- Year:
- 2026
- Address:
- Rabat, Marocco
- Editors:
- Danilo Croce, Jochen Leidner, Nafise Sadat Moosavi
- Venue:
- EACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 407–416
- Language:
- URL:
- https://preview.aclanthology.org/ingest-eacl/2026.eacl-demo.28/
- DOI:
- Cite (ACL):
- Javier Alonso Villegas Luis and Marco Antonio Sobrevilla Cabezudo. 2026. PUCP-Metrix: An Open-source and Comprehensive Toolkit for Linguistic Analysis of Spanish Texts. In Proceedings of the 19th Conference of the European Chapter of the Association for Computational Linguistics (Volume 3: System Demonstrations), pages 407–416, Rabat, Marocco. Association for Computational Linguistics.
- Cite (Informal):
- PUCP-Metrix: An Open-source and Comprehensive Toolkit for Linguistic Analysis of Spanish Texts (Luis & Cabezudo, EACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-eacl/2026.eacl-demo.28.pdf