From Fake to Hyperpartisan News Detection Using Domain Adaptation
Răzvan-Alexandru Smădu, Sebastian-Vasile Echim, Dumitru-Clementin Cercel, Iuliana Marin, Florin Pop
Abstract
Unsupervised Domain Adaptation (UDA) is a popular technique that aims to reduce the domain shift between two data distributions. It was successfully applied in computer vision and natural language processing. In the current work, we explore the effects of various unsupervised domain adaptation techniques between two text classification tasks: fake and hyperpartisan news detection. We investigate the knowledge transfer from fake to hyperpartisan news detection without involving target labels during training. Thus, we evaluate UDA, cluster alignment with a teacher, and cross-domain contrastive learning. Extensive experiments show that these techniques improve performance, while including data augmentation further enhances the results. In addition, we combine clustering and topic modeling algorithms with UDA, resulting in improved performances compared to the initial UDA setup.- Anthology ID:
- 2023.ranlp-1.117
- Volume:
- Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing
- Month:
- September
- Year:
- 2023
- Address:
- Varna, Bulgaria
- Editors:
- Ruslan Mitkov, Galia Angelova
- Venue:
- RANLP
- SIG:
- Publisher:
- INCOMA Ltd., Shoumen, Bulgaria
- Note:
- Pages:
- 1095–1109
- Language:
- URL:
- https://aclanthology.org/2023.ranlp-1.117
- DOI:
- Cite (ACL):
- Răzvan-Alexandru Smădu, Sebastian-Vasile Echim, Dumitru-Clementin Cercel, Iuliana Marin, and Florin Pop. 2023. From Fake to Hyperpartisan News Detection Using Domain Adaptation. In Proceedings of the 14th International Conference on Recent Advances in Natural Language Processing, pages 1095–1109, Varna, Bulgaria. INCOMA Ltd., Shoumen, Bulgaria.
- Cite (Informal):
- From Fake to Hyperpartisan News Detection Using Domain Adaptation (Smădu et al., RANLP 2023)
- PDF:
- https://preview.aclanthology.org/ingest-bitext-workshop/2023.ranlp-1.117.pdf