Fanheng Kong
2025
TUNA: Comprehensive Fine-grained Temporal Understanding Evaluation on Dense Dynamic Videos
Fanheng Kong
|
Jingyuan Zhang
|
Hongzhi Zhang
|
Shi Feng
|
Daling Wang
|
Linhao Yu
|
Xingguang Ji
|
Yu Tian
|
V. W.
|
Fuzheng Zhang
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Evaluating Multimodal Large Language Models on Video Captioning via Monte Carlo Tree Search
Linhao Yu
|
Xingguang Ji
|
Yahui Liu
|
Fanheng Kong
|
Chenxi Sun
|
Jingyuan Zhang
|
Hongzhi Zhang
|
V. W.
|
Fuzheng Zhang
|
Deyi Xiong
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
2024
STICKERCONV: Generating Multimodal Empathetic Responses from Scratch
Yiqun Zhang
|
Fanheng Kong
|
Peidong Wang
|
Shuang Sun
|
SWangLing SWangLing
|
Shi Feng
|
Daling Wang
|
Yifei Zhang
|
Kaisong Song
Proceedings of the 62nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
TIGER: A Unified Generative Model Framework for Multimodal Dialogue Response Generation
Fanheng Kong
|
Peidong Wang
|
Shi Feng
|
Daling Wang
|
Yifei Zhang
Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024)