Multimodal Chemical Structure-Text Coreference in Intellectual Property via Rule-guided Reinforcement Learning

Hanmeng Zhong, Wentao Wu, Linqing Chen, Peng Zhou


Abstract
Navigating biopharmaceutical intellectual property necessitates precisely associating visual chemical structures with their textual referents across lengthy documents. Despite its critical role in drug discovery, this multimodal coreference task remains underexplored. It presents unique challenges, including handling Markush structures and distinguishing the atom-level differences between adjacent structures. To bridge this gap, we define the multimodal Chemical Structure-Text coreference and introduce CheST, the first dataset explicitly designed for the task. Furthermore, to satisfy the strict logical consistency in the task, we propose RULER, a RULE-guided multimodal Reinforcement learning framework built upon an SFT cold start. RULER utilizes rule-driven reward functions operationalizing multidimensional consistencies, acting as a domain-specific "verifier" to obtain the correct domain knowledge. Experimental results demonstrate that RULER achieves a 40% improvement over the strongest baseline–Gemini-2.5-Pro, demonstrating the superior efficacy.
Anthology ID:
2026.findings-acl.1489
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
29784–29796
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1489/
DOI:
Bibkey:
Cite (ACL):
Hanmeng Zhong, Wentao Wu, Linqing Chen, and Peng Zhou. 2026. Multimodal Chemical Structure-Text Coreference in Intellectual Property via Rule-guided Reinforcement Learning. In Findings of the Association for Computational Linguistics: ACL 2026, pages 29784–29796, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
Multimodal Chemical Structure-Text Coreference in Intellectual Property via Rule-guided Reinforcement Learning (Zhong et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1489.pdf
Checklist:
 2026.findings-acl.1489.checklist.pdf