Dong Zhang
Other people with similar names: Dong Zhang
Unverified author pages with similar names: Dong Zhang
2026
Don’t Just Listen, Try Planning: Graph-based Retrieval-Generation Agent for Long-form Audio Meeting Understanding
Quanwei Tang | Dong Zhang | Shoushan Li | Guodong Zhou
Findings of the Association for Computational Linguistics: ACL 2026
Quanwei Tang | Dong Zhang | Shoushan Li | Guodong Zhou
Findings of the Association for Computational Linguistics: ACL 2026
Long-form audio meeting understanding (LAMU) is gaining attention, but dedicated question answering (QA) datasets are lacking. Previous tailored speech QA and existing Speech LLMs suffer from acoustic information loss and poor long-term dependency capture. We construct the LongAudioQA dataset and propose the GRGA model, which models heterogeneous audio features into a multi-dimensional graph and leverages agent planning for retrieval and answer generation, effectively addressing existing limitations.
2025
Zero-shot Cross-lingual NER via Mitigating Language Difference: An Entity-aligned Translation Perspective
Zhihao Zhang | Sophia Yat Mei Lee | Dong Zhang | Shoushan Li | Guodong Zhou
Findings of the Association for Computational Linguistics: EMNLP 2025
Zhihao Zhang | Sophia Yat Mei Lee | Dong Zhang | Shoushan Li | Guodong Zhou
Findings of the Association for Computational Linguistics: EMNLP 2025
Cross-lingual Named Entity Recognition (CL-NER) aims to transfer knowledge from high-resource languages to low-resource languages. However, existing zero-shot CL-NER (ZCL-NER) approaches primarily focus on Latin script language (LSL), where shared linguistic features facilitate effective knowledge transfer. In contrast, for non-Latin script language (NSL), such as Chinese and Japanese, performance often degrades due to deep structural differences. To address these challenges, we propose an entity-aligned translation (EAT) approach. Leveraging large language models (LLMs), EAT employs a dual-translation strategy to align entities between NSL and English. In addition, we fine-tune LLMs using multilingual Wikipedia data to enhance the entity alignment from source to target languages.