Semantic Networks Extracted from Students’ Think-Aloud Data are Correlated with Students’ Learning Performance
Pingjing Yang, Sullam Jeoung, Jennifer Cromley, Jana Diesner
Abstract
When students reflect on their learning from a textbook via think-aloud processes, network representations can be used to capture the concepts and relations from these data. What can we learn from the resulting network representations about students’ learning processes, knowledge acquisition, and learning outcomes? This study brings methods from entity and relation extraction using classic and LLM-based methods to the application domain of educational psychology. We built a ground-truth baseline of relational data that represents relevant (to educational science), textbook-based information as a semantic network. Among the tested models, SPN4RE and LUKE achieved the best performance in extracting concepts and relations from students’ verbal data. Network representations of students’ verbalizations varied in structure, reflecting different learning processes. Correlating the students’ semantic networks with learning outcomes revealed that denser and more interconnected semantic networks were associated with more elaborated knowledge acquisition. Structural features such as the number of edges and surface overlap with textbook networks significantly correlated with students’ posttest performance.- Anthology ID:
- 2025.emnlp-main.1309
- Volume:
- Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
- Month:
- November
- Year:
- 2025
- Address:
- Suzhou, China
- Editors:
- Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
- Venue:
- EMNLP
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 25802–25815
- Language:
- URL:
- https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1309/
- DOI:
- Cite (ACL):
- Pingjing Yang, Sullam Jeoung, Jennifer Cromley, and Jana Diesner. 2025. Semantic Networks Extracted from Students’ Think-Aloud Data are Correlated with Students’ Learning Performance. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 25802–25815, Suzhou, China. Association for Computational Linguistics.
- Cite (Informal):
- Semantic Networks Extracted from Students’ Think-Aloud Data are Correlated with Students’ Learning Performance (Yang et al., EMNLP 2025)
- PDF:
- https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1309.pdf