Seung-Cheol Lee
2022
Using Sentence-level Classification Helps Entity Extraction from Material Science Literature
Ankan Mullick
|
Shubhraneel Pal
|
Tapas Nayak
|
Seung-Cheol Lee
|
Satadeep Bhattacharjee
|
Pawan Goyal
Proceedings of the Thirteenth Language Resources and Evaluation Conference
In the last few years, several attempts have been made on extracting information from material science research domain. Material Science research articles are a rich source of information about various entities related to material science such as names of the materials used for experiments, the computational software used along with its parameters, the method used in the experiments, etc. But the distribution of these entities is not uniform across different sections of research articles. Most of the sentences in the research articles do not contain any entity. In this work, we first use a sentence-level classifier to identify sentences containing at least one entity mention. Next, we apply the information extraction models only on the filtered sentences, to extract various entities of interest. Our experiments for named entity recognition in the material science research articles show that this additional sentence-level classification step helps to improve the F1 score by more than 4%.