Developing Corpus of Lecture Utterances Aligned to Slide Components

Ryo Minamiguchi, Masatoshi Tsuchiya


Abstract
The approach which formulates the automatic text summarization as a maximum coverage problem with knapsack constraint over a set of textual units and a set of weighted conceptual units is promising. However, it is quite important and difficult to determine the appropriate granularity of conceptual units for this formulation. In order to resolve this problem, we are examining to use components of presentation slides as conceptual units to generate a summary of lecture utterances, instead of other possible conceptual units like base noun phrases or important nouns. This paper explains our developing corpus designed to evaluate our proposing approach, which consists of presentation slides and lecture utterances aligned to presentation slide components.
Anthology ID:
W16-5404
Volume:
Proceedings of the 12th Workshop on Asian Language Resources (ALR12)
Month:
December
Year:
2016
Address:
Osaka, Japan
Venue:
ALR
SIG:
Publisher:
The COLING 2016 Organizing Committee
Note:
Pages:
30–37
Language:
URL:
https://aclanthology.org/W16-5404
DOI:
Bibkey:
Cite (ACL):
Ryo Minamiguchi and Masatoshi Tsuchiya. 2016. Developing Corpus of Lecture Utterances Aligned to Slide Components. In Proceedings of the 12th Workshop on Asian Language Resources (ALR12), pages 30–37, Osaka, Japan. The COLING 2016 Organizing Committee.
Cite (Informal):
Developing Corpus of Lecture Utterances Aligned to Slide Components (Minamiguchi & Tsuchiya, ALR 2016)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingestion-script-update/W16-5404.pdf