PIEKM: ML-based Procedural Information Extraction and Knowledge Management System for Materials Science Literature

Huichen Yang, Carlos Aguirre, William Hsu


Abstract
The published materials science literature contains abundant description information about synthesis procedures that can help discover new material areas, deepen the study of materials synthesis, and accelerate its automated planning. Nevertheless, this information is expressed in unstructured text, and manually processing and assimilating useful information is expensive and time-consuming for researchers. To address this challenge, we develop a Machine Learning-based procedural information extraction and knowledge management system (PIEKM) that extracts procedural information recipe steps, figures, and tables from materials science articles, and provides information retrieval capability and the statistics visualization functionality. Our system aims to help researchers to gain insights and quickly understand the connections among massive data. Moreover, we demonstrate that the machine learning-based system performs well in low-resource scenarios (i.e., limited annotated data) for domain adaption.
Anthology ID:
2022.aacl-demo.7
Volume:
Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing: System Demonstrations
Month:
November
Year:
2022
Address:
Taipei, Taiwan
Venues:
AACL | IJCNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
57–62
Language:
URL:
https://aclanthology.org/2022.aacl-demo.7
DOI:
Bibkey:
Cite (ACL):
Huichen Yang, Carlos Aguirre, and William Hsu. 2022. PIEKM: ML-based Procedural Information Extraction and Knowledge Management System for Materials Science Literature. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing: System Demonstrations, pages 57–62, Taipei, Taiwan. Association for Computational Linguistics.
Cite (Informal):
PIEKM: ML-based Procedural Information Extraction and Knowledge Management System for Materials Science Literature (Yang et al., AACL-IJCNLP 2022)
Copy Citation:
PDF:
https://preview.aclanthology.org/emnlp-22-attachments/2022.aacl-demo.7.pdf