MolSafeEval: A Benchmark for Uncovering Safety Risks in AI-Generated Molecules

Tong Xu, Xinzhe Cao, Zhihui Zhu, Keyan Ding, Huajun Chen


Abstract
Current molecular generation benchmarks emphasize task complexity, molecule novelty, and property alignment; they largely overlook a critical concern: the potential safety risks of AI-generated molecules. In practice, many generative models may produce molecules with toxic, reactive, or otherwise hazardous characteristics—posing hidden dangers that remain insufficiently addressed. To address this gap, we introduce MolSafeEval, a benchmark dedicated to evaluating and analyzing the safety risks of molecular generation. Unlike prior approaches that rely on narrow toxicity predictors, MolSafeEval integrates heterogeneous safety knowledge—ranging from toxicological databases to hazard rules—into a structured molecular safety knowledge graph. This graph serves as a foundation for large language model–based reasoning, enabling systematic detection and explanation of unsafe features in generated compounds. We further categorize molecular generative models into four representative task types—unconditional generation, property optimization, target protein–based design, and text-based generation—and provide standardized datasets and safety evaluation protocols for each.
Anthology ID:
2026.findings-acl.1679
Volume:
Findings of the Association for Computational Linguistics: ACL 2026
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
Findings
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
33621–33648
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1679/
DOI:
Bibkey:
Cite (ACL):
Tong Xu, Xinzhe Cao, Zhihui Zhu, Keyan Ding, and Huajun Chen. 2026. MolSafeEval: A Benchmark for Uncovering Safety Risks in AI-Generated Molecules. In Findings of the Association for Computational Linguistics: ACL 2026, pages 33621–33648, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
MolSafeEval: A Benchmark for Uncovering Safety Risks in AI-Generated Molecules (Xu et al., Findings 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1679.pdf
Checklist:
 2026.findings-acl.1679.checklist.pdf