Measurement Extraction with Natural Language Processing: A Review
Jan Göpfert, Patrick Kuckertz, Jann Weinand, Leander Kotzur, Detlef Stolten
Abstract
Quantitative data is important in many domains. Information extraction methods draw structured data from documents. However, the extraction of quantities and their contexts has received little attention in the history of information extraction. In this review, an overview of prior work on measurement extraction is presented. We describe different approaches to measurement extraction and outline the challenges posed by this task. The review concludes with an outline of potential future research. Research strains in measurement extraction tend to be isolated and lack a common terminology. Improvements in numerical reasoning, more extensive datasets, and the consideration of wider contexts may lead to significant improvements in measurement extraction.- Anthology ID:
- 2022.findings-emnlp.161
- Volume:
- Findings of the Association for Computational Linguistics: EMNLP 2022
- Month:
- December
- Year:
- 2022
- Address:
- Abu Dhabi, United Arab Emirates
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 2191–2215
- Language:
- URL:
- https://aclanthology.org/2022.findings-emnlp.161
- DOI:
- 10.18653/v1/2022.findings-emnlp.161
- Cite (ACL):
- Jan Göpfert, Patrick Kuckertz, Jann Weinand, Leander Kotzur, and Detlef Stolten. 2022. Measurement Extraction with Natural Language Processing: A Review. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 2191–2215, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
- Cite (Informal):
- Measurement Extraction with Natural Language Processing: A Review (Göpfert et al., Findings 2022)
- PDF:
- https://preview.aclanthology.org/remove-xml-comments/2022.findings-emnlp.161.pdf