Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese
Jingshen Zhang, Xinglu Chen, Xinying Qiu, Zhimin Wang, Wenhe Feng
Abstract
“Chinese sentence simplification faces challenges due to the lack of large-scale labeledparallel corpora and the prevalence of idioms. To address these challenges, we pro-pose Readability-guided Idiom-aware Sentence Simplification (RISS), a novel frameworkthat combines data augmentation techniques. RISS introduces two key components: (1)Readability-guided Paraphrase Selection (RPS), a method for mining high-quality sen-tence pairs, and (2) Idiom-aware Simplification (IAS), a model that enhances the compre-hension and simplification of idiomatic expressions. By integrating RPS and IAS usingmulti-stage and multi-task learning strategies, RISS outperforms previous state-of-the-artmethods on two Chinese sentence simplification datasets. Furthermore, RISS achievesadditional improvements when fine-tuned on a small labeled dataset. Our approachdemonstrates the potential for more effective and accessible Chinese text simplification.”- Anthology ID:
- 2024.ccl-1.92
- Volume:
- Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 1: Main Conference)
- Month:
- July
- Year:
- 2024
- Address:
- Taiyuan, China
- Editors:
- Sun Maosong, Liang Jiye, Han Xianpei, Liu Zhiyuan, He Yulan
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 1183–1200
- Language:
- English
- URL:
- https://preview.aclanthology.org/author-degibert/2024.ccl-1.92/
- DOI:
- Cite (ACL):
- Jingshen Zhang, Xinglu Chen, Xinying Qiu, Zhimin Wang, and Wenhe Feng. 2024. Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese. In Proceedings of the 23rd Chinese National Conference on Computational Linguistics (Volume 1: Main Conference), pages 1183–1200, Taiyuan, China. Chinese Information Processing Society of China.
- Cite (Informal):
- Readability-guided Idiom-aware Sentence Simplification (RISS) for Chinese (Zhang et al., CCL 2024)
- PDF:
- https://preview.aclanthology.org/author-degibert/2024.ccl-1.92.pdf