FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction

Zhaohan Meng, Zaiqiao Meng, Ke Yuan, Iadh Ounis


Abstract
Predicting drug-target interaction (DTI) is critical in the drug discovery process. Despite remarkable advances in recent DTI models through the integration of representations from diverse drug and target encoders, such models often struggle to capture the fine-grained interactions between drugs and protein, i.e. the binding of specific drug atoms (or substructures) and key amino acids of proteins, which is crucial for understanding the binding mechanisms and optimising drug design. To address this issue, this paper introduces a novel model, called FusionDTI, which uses a token-level **Fusion** module to effectively learn fine-grained information for **D**rug-**T**arget **I**nteraction. In particular, our FusionDTI model uses the SELFIES representation of drugs to mitigate sequence fragment invalidation and incorporates the structure-aware (SA) vocabulary of target proteins to address the limitation of amino acid sequences in structural information, additionally leveraging pre-trained language models extensively trained on large-scale biomedical datasets as encoders to capture the complex information of drugs and targets. Experiments on three well-known benchmark datasets show that our proposed FusionDTI model achieves the best performance in DTI prediction compared with eight existing state-of-the-art baselines. Furthermore, our case study indicates that FusionDTI could highlight the potential binding sites, enhancing the explainability of the DTI prediction.
Anthology ID:
2025.findings-emnlp.237
Volume:
Findings of the Association for Computational Linguistics: EMNLP 2025
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
4425–4444
Language:
URL:
https://preview.aclanthology.org/name-variant-enfa-fane/2025.findings-emnlp.237/
DOI:
10.18653/v1/2025.findings-emnlp.237
Bibkey:
Cite (ACL):
Zhaohan Meng, Zaiqiao Meng, Ke Yuan, and Iadh Ounis. 2025. FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction. In Findings of the Association for Computational Linguistics: EMNLP 2025, pages 4425–4444, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
FusionDTI: Fine-grained Binding Discovery with Token-level Fusion for Drug-Target Interaction (Meng et al., Findings 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/name-variant-enfa-fane/2025.findings-emnlp.237.pdf
Checklist:
 2025.findings-emnlp.237.checklist.pdf