Structural Reasoning Improves Molecular Understanding of LLM

Yunhui Jang, Jaehyung Kim, Sungsoo Ahn


Abstract
Recently, large language models (LLMs) have shown significant progress, approaching human perception levels. In this work, we demonstrate that despite these advances, LLMs still struggle to reason using molecular structural information. This gap is critical because many molecular properties, including functional groups, depend heavily on such structural details. To address this limitation, we propose an approach that sketches molecular structures for reasoning. Specifically, we introduce Molecular Structural Reasoning (MSR) framework to enhance the understanding of LLMs by explicitly incorporating the key structural features. We present two frameworks for scenarios where the target molecule is known or unknown. We verify that our MSR improves molecular understanding through extensive experiments.
Anthology ID:
2025.acl-long.1023
Volume:
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2025
Address:
Vienna, Austria
Editors:
Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
21016–21036
Language:
URL:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-long.1023/
DOI:
Bibkey:
Cite (ACL):
Yunhui Jang, Jaehyung Kim, and Sungsoo Ahn. 2025. Structural Reasoning Improves Molecular Understanding of LLM. In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 21016–21036, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):
Structural Reasoning Improves Molecular Understanding of LLM (Jang et al., ACL 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-acl-25/2025.acl-long.1023.pdf