Darbarer @ AutoMin2023: Transcription simplification for concise minute generation from multi-party conversations
Ismaël Rousseau, Loïc Fosse, Youness Dkhissi, Geraldine Damnati, Gwénolé Lecorvé
Abstract
This document reports the approach of our team Darbarer for the main task (Task A) of the AutoMin 2023 challenge. Our system is composed of four main modules. The first module relies on a text simplification model aiming at standardizing the utterances of the conversation and compressing the input in order to focus on informative content. The second module handles summarization by employing a straightforward segmentation strategy and a fine-tuned BART-based generative model. Then a titling module has been trained in order to propose a short description of each summarized block. Lastly, we apply a post-processing step aimed at enhancing readability through specific formatting rules. Our contributions lie in the first, third and last steps. Our system generates precise and concise minutes. We provide a detailed description of our modules, discuss the difficulty of evaluating their impact and propose an analysis of observed errors in our generated minutes.- Anthology ID:
- 2023.inlg-genchal.17
- Volume:
- Proceedings of the 16th International Natural Language Generation Conference: Generation Challenges
- Month:
- September
- Year:
- 2023
- Address:
- Prague, Czechia
- Editor:
- Simon Mille
- Venues:
- INLG | SIGDIAL
- SIG:
- SIGGEN
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 121–131
- Language:
- URL:
- https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2023.inlg-genchal.17/
- DOI:
- Cite (ACL):
- Ismaël Rousseau, Loïc Fosse, Youness Dkhissi, Geraldine Damnati, and Gwénolé Lecorvé. 2023. Darbarer @ AutoMin2023: Transcription simplification for concise minute generation from multi-party conversations. In Proceedings of the 16th International Natural Language Generation Conference: Generation Challenges, pages 121–131, Prague, Czechia. Association for Computational Linguistics.
- Cite (Informal):
- Darbarer @ AutoMin2023: Transcription simplification for concise minute generation from multi-party conversations (Rousseau et al., INLG-SIGDIAL 2023)
- PDF:
- https://preview.aclanthology.org/sigedu-bea-out-of-sync-correction/2023.inlg-genchal.17.pdf