Abstract
Community Question Answering forums are very popular nowadays, as they represent effective means for communities to share information around particular topics. But the information shared on these forums are often not authentic. This paper presents the ColumbiaNLP submission for the SemEval-2019 Task 8: Fact-Checking in Community Question Answering Forums. We show how fine-tuning a language model on a large unannotated corpus of old threads from Qatar Living forum helps us to classify question types (factual, opinion, socializing) and to judge the factuality of answers on the shared task labeled data from the same forum. Our system finished 4th and 2nd on Subtask A (question type classification) and B (answer factuality prediction), respectively, based on the official metric of accuracy.- Anthology ID:
- S19-2200
- Volume:
- Proceedings of the 13th International Workshop on Semantic Evaluation
- Month:
- June
- Year:
- 2019
- Address:
- Minneapolis, Minnesota, USA
- Editors:
- Jonathan May, Ekaterina Shutova, Aurelie Herbelot, Xiaodan Zhu, Marianna Apidianaki, Saif M. Mohammad
- Venue:
- SemEval
- SIG:
- SIGLEX
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 1144–1148
- Language:
- URL:
- https://aclanthology.org/S19-2200
- DOI:
- 10.18653/v1/S19-2200
- Cite (ACL):
- Tuhin Chakrabarty and Smaranda Muresan. 2019. ColumbiaNLP at SemEval-2019 Task 8: The Answer is Language Model Fine-tuning. In Proceedings of the 13th International Workshop on Semantic Evaluation, pages 1144–1148, Minneapolis, Minnesota, USA. Association for Computational Linguistics.
- Cite (Informal):
- ColumbiaNLP at SemEval-2019 Task 8: The Answer is Language Model Fine-tuning (Chakrabarty & Muresan, SemEval 2019)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-5/S19-2200.pdf