Tree-CoT-RT: An Explainable Multi-Path Tree-Guided Chain-of-Thought and Reinforcement Learning Framework for Aspect Sentiment Quad Prediction
Hao Zhang, Jiahao Wang, Zhenke Duan, Xin Yin, Haichuan Hu, Hualong Chen, Suyi, Congqing He, Yike Tan, Yu-N Cheah
Abstract
Aspect Sentiment Quad Prediction (ASQP) is a fundamental yet challenging task in fine-grained sentiment analysis, particularly when aspects or opinions are implicit. Existing methods often lack explainability and generalization, making it difficult to justify inference decisions and to detect implicit sentiment across domains and varied expression patterns. To address these limitations, we propose Tree-CoT-RT, an explainable multi-path tree-guided chain-of-thought and reinforcement learning framework specifically designed for ASQP. The core idea is to use sentiment tree structures to design type-specific reasoning templates that guide LLMs in generating explainable chains, including both final sentiment quadruples and intermediate inference steps for transparent implicit reasoning. However, the generated reasoning chains often vary in quality and may contain logical inconsistencies. To mitigate this, we introduce a reinforcement learning strategy with a rule-based reward function to generate high-quality reasoning traces, which are then used to fine-tune the LLM and enable controlled sampling. Experiments on benchmark datasets demonstrate that Tree-CoT-RT substantially outperforms strong baselines, particularly in scenarios involving implicit sentiment analysis.- Anthology ID:
- 2026.findings-acl.806
- Volume:
- Findings of the Association for Computational Linguistics: ACL 2026
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 16372–16391
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.806/
- DOI:
- Cite (ACL):
- Hao Zhang, Jiahao Wang, Zhenke Duan, Xin Yin, Haichuan Hu, Hualong Chen, Suyi, Congqing He, Yike Tan, and Yu-N Cheah. 2026. Tree-CoT-RT: An Explainable Multi-Path Tree-Guided Chain-of-Thought and Reinforcement Learning Framework for Aspect Sentiment Quad Prediction. In Findings of the Association for Computational Linguistics: ACL 2026, pages 16372–16391, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- Tree-CoT-RT: An Explainable Multi-Path Tree-Guided Chain-of-Thought and Reinforcement Learning Framework for Aspect Sentiment Quad Prediction (Zhang et al., Findings 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.findings-acl.806.pdf