Jian Gao
2026
LaoBench: A Large-Scale Multidimensional Lao Benchmark for Large Language Models
Jian Gao | Richeng Xuan | Zhaolu Kang | Dingshi Liao | Wenxin Huang | Zongmou Huang | Yangdi Xu | Bowen Qin | Zheqi He | Xi Yang | Changjinli | Yonghua Lin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Jian Gao | Richeng Xuan | Zhaolu Kang | Dingshi Liao | Wenxin Huang | Zongmou Huang | Yangdi Xu | Bowen Qin | Zheqi He | Xi Yang | Changjinli | Yonghua Lin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The rapid advancement of large language models (LLMs) has not been matched by their evaluation in low-resource languages, especially Southeast Asian languages like Lao. To fill this gap, we introduce LaoBench, the first large-scale, high-quality, and multidimensional benchmark for assessing LLM language understanding and reasoning in Lao. LaoBench contains 17,000+ expert-curated samples across three dimensions: culturally grounded knowledge application, curriculum-aligned K12 education, and bilingual translation among Lao, Chinese, and English. It includes open-source and held-out subsets, where the held-out portion enables secure black-box evaluation via a controlled service to improve fairness and data security. We construct LaoBench with a hybrid pipeline that combines expert authoring with agent-assisted verification, ensuring linguistic accuracy, cultural relevance, and educational validity. We evaluate diverse state-of-the-art open-source and closed-source LLMs, and find that even strong multilingual models lag behind human experts, particularly in culturally grounded reasoning and translation fidelity. We hope LaoBench will catalyze research on Lao and other underrepresented Southeast Asian languages for more inclusive multilingual evaluation.
2025
Taming Text-to-Image Synthesis for Novices: User-centric Prompt Generation via Multi-turn Guidance
Yilun Liu | Minggui He | Feiyu Yao | Yuhe Ji | Shimin Tao | Jingzhou Du | Justin Li | Jian Gao | Zhang Li | Hao Yang | Boxing Chen | Osamu Yoshie
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Yilun Liu | Minggui He | Feiyu Yao | Yuhe Ji | Shimin Tao | Jingzhou Du | Justin Li | Jian Gao | Zhang Li | Hao Yang | Boxing Chen | Osamu Yoshie
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
The emergence of text-to-image synthesis (TIS) models has significantly influenced digital image creation by producing high-quality visuals from written descriptions. Yet these models are sensitive on textual prompts, posing a challenge for novice users who may not be familiar with TIS prompt writing. Existing solutions relieve this via automatic prompt expansion or generation from a user query. However, this single-turn manner suffers from limited user-centricity in terms of result interpretability and user interactivity. Thus, we propose DialPrompt, a dialogue-based TIS prompt generation model that emphasizes user experience for novice users. DialPrompt is designed to follow a multi-turn workflow, where in each round of dialogue the model guides user to express their preferences on possible optimization dimensions before generating the final TIS prompt. To achieve this, we mined 15 essential dimensions for high-quality prompts from advanced users and curated a multi-turn dataset. Through training on this dataset, DialPrompt improves user-centricity by allowing users to perceive and control the creation process of TIS prompts. Experiments indicate that DialPrompt improves significantly in user-centricity score compared with existing approaches while maintaining a competitive quality of synthesized images. In our user evaluation, DialPrompt is highly rated by 19 human reviewers (especially novices).
2022
Composition-based Heterogeneous Graph Multi-channel Attention Network for Multi-aspect Multi-sentiment Classification
Hao Niu | Yun Xiong | Jian Gao | Zhongchen Miao | Xiaosu Wang | Hongrun Ren | Yao Zhang | Yangyong Zhu
Proceedings of the 29th International Conference on Computational Linguistics
Hao Niu | Yun Xiong | Jian Gao | Zhongchen Miao | Xiaosu Wang | Hongrun Ren | Yao Zhang | Yangyong Zhu
Proceedings of the 29th International Conference on Computational Linguistics
Aspect-based sentiment analysis (ABSA) has drawn more and more attention because of its extensive applications. However, towards the sentence carried with more than one aspect, most existing works generate an aspect-specific sentence representation for each aspect term to predict sentiment polarity, which neglects the sentiment relationship among aspect terms. Besides, most current ABSA methods focus on sentences containing only one aspect term or multiple aspect terms with the same sentiment polarity, which makes ABSA degenerate into sentence-level sentiment analysis. In this paper, to deal with this problem, we construct a heterogeneous graph to model inter-aspect relationships and aspect-context relationships simultaneously and propose a novel Composition-based Heterogeneous Graph Multi-channel Attention Network (CHGMAN) to encode the constructed heterogeneous graph. Meanwhile, we conduct extensive experiments on three datasets: MAMSATSA, Rest14, and Laptop14, experimental results show the effectiveness of our method.
2020
Financial News Annotation by Weakly-Supervised Hierarchical Multi-label Learning
Hang Jiang | Zhongchen Miao | Yuefeng Lin | Chenyu Wang | Mengjun Ni | Jian Gao | Jidong Lu | Guangwei Shi
Proceedings of the Second Workshop on Financial Technology and Natural Language Processing
Hang Jiang | Zhongchen Miao | Yuefeng Lin | Chenyu Wang | Mengjun Ni | Jian Gao | Jidong Lu | Guangwei Shi
Proceedings of the Second Workshop on Financial Technology and Natural Language Processing
2010
Search
Fix author
Co-authors
- Zhongchen Miao 2
- Changjinli 1
- Boxing Chen 1
- Jingzhou Du 1
- Minggui He 1
- Zheqi He 1
- He-Yan Huang (黄河燕) 1
- Wenxin Huang 1
- Zongmou Huang 1
- Yuhe Ji 1
- Hang Jiang 1
- Zhaolu Kang 1
- Justin Li 1
- Zhang Li 1
- Dingshi Liao 1
- Yonghua Lin 1
- Yuefeng Lin 1
- Yilun Liu 1
- Jidong Lu 1
- Qian Mo 1
- Mengjun Ni 1
- Hao Niu 1
- Bowen Qin 1
- Hongrun Ren 1
- Guangwei Shi 1
- Shimin Tao 1
- Chenyu Wang 1
- Xiaosu Wang 1
- Yun Xiong 1
- Yangdi Xu 1
- Richeng Xuan 1
- Hao Yang 1
- Xi Yang 1
- Feiyu Yao 1
- Osamu Yoshie 1
- Hua-Ping Zhang 1
- Yao Zhang 1
- Yangyong Zhu 1