Abstract
This paper describes a personalized text retrieval algorithm that helps language learners select the most suitable reading material in terms of vocabulary complexity. The user first rates their knowledge of a small set of words, chosen by a graph-based active learning model. The system trains a complex word identification model on this set, and then applies the model to find texts that contain the desired proportion of new, challenging, and familiar vocabulary. In an evaluation on learners of Chinese as a foreign language, we show that this algorithm is effective in identifying simpler texts for low-proficiency learners, and more challenging ones for high-proficiency learners.- Anthology ID:
- C18-1292
- Volume:
- Proceedings of the 27th International Conference on Computational Linguistics
- Month:
- August
- Year:
- 2018
- Address:
- Santa Fe, New Mexico, USA
- Venue:
- COLING
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 3448–3455
- Language:
- URL:
- https://aclanthology.org/C18-1292
- DOI:
- Cite (ACL):
- Chak Yan Yeung and John Lee. 2018. Personalized Text Retrieval for Learners of Chinese as a Foreign Language. In Proceedings of the 27th International Conference on Computational Linguistics, pages 3448–3455, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
- Cite (Informal):
- Personalized Text Retrieval for Learners of Chinese as a Foreign Language (Yeung & Lee, COLING 2018)
- PDF:
- https://preview.aclanthology.org/nodalida-main-page/C18-1292.pdf