MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields

Jiaying Lu; Yongchen Qian; Shifan Zhao; Yuanzhe Xi; Carl Yang

doi:10.18653/v1/2023.findings-emnlp.354

MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields

Jiaying Lu, Yongchen Qian, Shifan Zhao, Yuanzhe Xi, Carl Yang

Abstract

Previous research has demonstrated the advantages of integrating data from multiple sources over traditional unimodal data, leading to the emergence of numerous novel multimodal applications. We propose a multimodal classification benchmark MuG with eight datasets that allows researchers to evaluate and improve their models. These datasets are collected from four various genres of games that cover tabular, textual, and visual modalities. We conduct multi-aspect data analysis to provide insights into the benchmark, including label balance ratios, percentages of missing features, distributions of data within each modality, and the correlations between labels and input modalities. We further present experimental results obtained by several state-of-the-art unimodal classifiers and multimodal classifiers, which demonstrate the challenging and multimodal-dependent properties of the benchmark. MuG is released at https://github.com/lujiaying/MUG-Bench with the data, tutorials, and implemented baselines.

Anthology ID:: 2023.findings-emnlp.354
Volume:: Findings of the Association for Computational Linguistics: EMNLP 2023
Month:: December
Year:: 2023
Address:: Singapore
Editors:: Houda Bouamor, Juan Pino, Kalika Bali
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 5332–5346
Language:
URL:: https://aclanthology.org/2023.findings-emnlp.354
DOI:: 10.18653/v1/2023.findings-emnlp.354
Bibkey:
Cite (ACL):: Jiaying Lu, Yongchen Qian, Shifan Zhao, Yuanzhe Xi, and Carl Yang. 2023. MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 5332–5346, Singapore. Association for Computational Linguistics.
Cite (Informal):: MuG: A Multimodal Classification Benchmark on Game Data with Tabular, Textual, and Visual Fields (Lu et al., Findings 2023)
Copy Citation:
PDF:: https://preview.aclanthology.org/nschneid-patch-4/2023.findings-emnlp.354.pdf

PDF Search