HITS at DISRPT 2025: Discourse Segmentation, Connective Detection, and Relation Classification

Souvik Banerjee; Yi Fan; Michael Strube

HITS at DISRPT 2025: Discourse Segmentation, Connective Detection, and Relation Classification

Abstract

This paper describes the submission of the HITS team to the DISRPT 2025 shared task. The shared task includes three sub-tasks: (1) discourse unit segmentation across formalisms, (2) cross-lingual discourse connective identification, and (3) cross-formalism discourse relation classification. This paper presents our strategies for the DISRPT 2025 Shared Task. In Task 1, our approach involves fine-tuning through multilingual joint training on linguistically motivated language groups. We incorporated two key techniques to improve model performance: a weighted loss function to address the task’s significant class imbalance and Fast Gradient Method (FGM) adversarial training to boost the model’s robustness. In task 2, our approach involves building an ensemble of three encoder models whose embeddings are smartly fused together with a multi-head attention layer. We also add Part-Of-Speech tags and dependency relations present in the training file as linguistic features. A CRF layer is added after the classification layer to account for dependencies between adjacent labels. To account for label imbalance, we use focal loss and label smoothing. This ensures our model is robust and flexible enough to handle different languages. In task 3, we use two-stage fine-tuning framework designed to transfer the nuanced reasoning capabilities of a very large “teacher” model to a compact “student” model so that the smaller model can learn complex discourse relationships. The fine-tuning process follows a curriculum learning framework. In such a framework the model learns to perform increasingly harder tasks. In our case, the model first learns to look at the discourse units and then predict the label followed by looking at Chain-Of-Thought reasoning for harder examples. This way it can learn to internalise such reasoning and increase prediction accuracy on the harder samples.

Anthology ID:: 2025.disrpt-1.5
Volume:: Proceedings of the 4th Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2025)
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Chloé Braud, Yang Janet Liu, Philippe Muller, Amir Zeldes, Chuyuan Li
Venues:: DISRPT | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 63–78
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.disrpt-1.5/
DOI:
Bibkey:
Cite (ACL):: Souvik Banerjee, Yi Fan, and Michael Strube. 2025. HITS at DISRPT 2025: Discourse Segmentation, Connective Detection, and Relation Classification. In Proceedings of the 4th Shared Task on Discourse Relation Parsing and Treebanking (DISRPT 2025), pages 63–78, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: HITS at DISRPT 2025: Discourse Segmentation, Connective Detection, and Relation Classification (Banerjee et al., DISRPT 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.disrpt-1.5.pdf

PDF Cite Search Fix data