Rina Miyata
2025
Unsupervised Sentence Readability Estimation Based on Parallel Corpora for Text Simplification
Rina Miyata
|
Toru Urakawa
|
Hideaki Tamori
|
Tomoyuki Kajiwara
Proceedings of the 20th Workshop on Innovative Use of NLP for Building Educational Applications (BEA 2025)
We train a relative sentence readability estimator from a corpus without absolute sentence readability.Since sentence readability depends on the reader’s knowledge, objective and absolute readability assessments require costly annotation by experts.Therefore, few corpora have absolute sentence readability, while parallel corpora for text simplification with relative sentence readability between two sentences are available for many languages.With multilingual applications in mind, we propose a method to estimate relative sentence readability based on parallel corpora for text simplification.Experimental results on ranking a set of English sentences by readability show that our method outperforms existing unsupervised methods and is comparable to supervised methods based on absolute sentence readability.
EhiMeNLP at TSAR 2025 Shared Task Candidate Generation via Iterative Simplification and Reranking by Readability and Semantic Similarity
Rina Miyata
|
Koki Horiguchi
|
Risa Kondo
|
Yuki Fujiwara
|
Tomoyuki Kajiwara
Proceedings of the Fourth Workshop on Text Simplification, Accessibility and Readability (TSAR 2025)
We introduce the EhiMeNLP submission, which won the TSAR 2025 Shared Task on Readability-Controlled Text Simplification. Our system employed a two-step strategy of candidate generation and reranking. For candidate generation, we simplified the given text into more readable versions by combining multiple large language models with prompts. Then, for reranking, we selected the best candidate by readability-based filtering and ranking based on semantic similarity to the original text.
Search
Fix author
Co-authors
- Tomoyuki Kajiwara 2
- Yuki Fujiwara 1
- Koki Horiguchi 1
- Risa Kondo 1
- Hideaki Tamori 1
- show all...