An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks

Anirban Laha, Vikas Raykar


Abstract
Several tasks in argumentation mining and debating, question-answering, and natural language inference involve classifying a sequence in the context of another sequence (referred as bi-sequence classification). For several single sequence classification tasks, the current state-of-the-art approaches are based on recurrent and convolutional neural networks. On the other hand, for bi-sequence classification problems, there is not much understanding as to the best deep learning architecture. In this paper, we attempt to get an understanding of this category of problems by extensive empirical evaluation of 19 different deep learning architectures (specifically on different ways of handling context) for various problems originating in natural language processing like debating, textual entailment and question-answering. Following the empirical evaluation, we offer our insights and conclusions regarding the architectures we have considered. We also establish the first deep learning baselines for three argumentation mining tasks.
Anthology ID:
C16-1260
Volume:
Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers
Month:
December
Year:
2016
Address:
Osaka, Japan
Venue:
COLING
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
2762–2773
Language:
URL:
https://aclanthology.org/C16-1260
DOI:
Bibkey:
Cite (ACL):
Anirban Laha and Vikas Raykar. 2016. An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks. In Proceedings of COLING 2016, the 26th International Conference on Computational Linguistics: Technical Papers, pages 2762–2773, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
An Empirical Evaluation of various Deep Learning Architectures for Bi-Sequence Classification Tasks (Laha & Raykar, COLING 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/C16-1260.pdf
Data
WikiQA