Nicholas A. Miller
2019
DEFT: A corpus for definition extraction in free- and semi-structured text
Sasha Spala
|
Nicholas A. Miller
|
Yiming Yang
|
Franck Dernoncourt
|
Carl Dockhorn
Proceedings of the 13th Linguistic Annotation Workshop
Definition extraction has been a popular topic in NLP research for well more than a decade, but has been historically limited to well-defined, structured, and narrow conditions. In reality, natural language is messy, and messy data requires both complex solutions and data that reflects that reality. In this paper, we present a robust English corpus and annotation schema that allows us to explore the less straightforward examples of term-definition structures in free and semi-structured text.