AI for Science in the Era of Large Language Models

Zhenyu Bi, Minghao Xu, Jian Tang, Xuan Wang


Abstract
The capabilities of AI in the realm of science span a wide spectrum, from the atomic level, where it solves partial differential equations for quantum systems, to the molecular level, predicting chemical or protein structures, and even extending to societal predictions like infectious disease outbreaks. Recent advancements in large language models (LLMs), exemplified by models like ChatGPT, have showcased significant prowess in tasks involving natural language, such as translating languages, constructing chatbots, and answering questions. When we consider scientific data, we notice a resemblance to natural language in terms of sequences – scientific literature and health records presented as text, bio-omics data arranged in sequences, or sensor data like brain signals. The question arises: Can we harness the potential of these recent LLMs to drive scientific progress? In this tutorial, we will explore the application of large language models to three crucial categories of scientific data: 1) textual data, 2) biomedical sequences, and 3) brain signals. Furthermore, we will delve into LLMs’ challenges in scientific research, including ensuring trustworthiness, achieving personalization, and adapting to multi-modal data representation.
Anthology ID:
2024.emnlp-tutorials.5
Volume:
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts
Month:
November
Year:
2024
Address:
Miami, Florida, USA
Editors:
Jessy Li, Fei Liu
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
32–38
Language:
URL:
https://aclanthology.org/2024.emnlp-tutorials.5
DOI:
10.18653/v1/2024.emnlp-tutorials.5
Bibkey:
Cite (ACL):
Zhenyu Bi, Minghao Xu, Jian Tang, and Xuan Wang. 2024. AI for Science in the Era of Large Language Models. In Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing: Tutorial Abstracts, pages 32–38, Miami, Florida, USA. Association for Computational Linguistics.
Cite (Informal):
AI for Science in the Era of Large Language Models (Bi et al., EMNLP 2024)
Copy Citation:
PDF:
https://preview.aclanthology.org/landing_page/2024.emnlp-tutorials.5.pdf