Venkata Subrahmanyan Govindarajan
2020
The Universal Decompositional Semantics Dataset and Decomp Toolkit
Aaron Steven White
|
Elias Stengel-Eskin
|
Siddharth Vashishtha
|
Venkata Subrahmanyan Govindarajan
|
Dee Ann Reisinger
|
Tim Vieira
|
Keisuke Sakaguchi
|
Sheng Zhang
|
Francis Ferraro
|
Rachel Rudinger
|
Kyle Rawlins
|
Benjamin Van Durme
Proceedings of the Twelfth Language Resources and Evaluation Conference
We present the Universal Decompositional Semantics (UDS) dataset (v1.0), which is bundled with the Decomp toolkit (v0.1). UDS1.0 unifies five high-quality, decompositional semantics-aligned annotation sets within a single semantic graph specification—with graph structures defined by the predicative patterns produced by the PredPatt tool and real-valued node and edge attributes constructed using sophisticated normalization procedures. The Decomp toolkit provides a suite of Python 3 tools for querying UDS graphs using SPARQL. Both UDS1.0 and Decomp0.1 are publicly available at http://decomp.io.
Help! Need Advice on Identifying Advice
Venkata Subrahmanyan Govindarajan
|
Benjamin Chen
|
Rebecca Warholic
|
Katrin Erk
|
Junyi Jessy Li
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)
Humans use language to accomplish a wide variety of tasks - asking for and giving advice being one of them. In online advice forums, advice is mixed in with non-advice, like emotional support, and is sometimes stated explicitly, sometimes implicitly. Understanding the language of advice would equip systems with a better grasp of language pragmatics; practically, the ability to identify advice would drastically increase the efficiency of advice-seeking online, as well as advice-giving in natural language generation systems. We present a dataset in English from two Reddit advice forums - r/AskParents and r/needadvice - annotated for whether sentences in posts contain advice or not. Our analysis reveals rich linguistic phenomena in advice discourse. We present preliminary models showing that while pre-trained language models are able to capture advice better than rule-based systems, advice identification is challenging, and we identify directions for future research.