Weihao Han
2026
DUET: Joint Exploration of User–Item Profiles in Recommendation System
Yue Chen | Yifei Sun | Lu Wang | Fangkai Yang | Pu Zhao | Minjie Hong | Yifei Dong | Minghua He | Nan Hu | Jianjin Zhang | Zhiwei Dai | Yuefeng Zhan | Weihao Han | Hao Sun | Qingwei Lin | Weiwei Deng | Feng Sun | Qi Zhang | Saravan Rajmohan | Dongmei Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Yue Chen | Yifei Sun | Lu Wang | Fangkai Yang | Pu Zhao | Minjie Hong | Yifei Dong | Minghua He | Nan Hu | Jianjin Zhang | Zhiwei Dai | Yuefeng Zhan | Weihao Han | Hao Sun | Qingwei Lin | Weiwei Deng | Feng Sun | Qi Zhang | Saravan Rajmohan | Dongmei Zhang
Findings of the Association for Computational Linguistics: ACL 2026
Traditional recommendation systems represent users and items as dense vectors and learn to align them in a shared latent space for relevance estimation. Recent LLM-based recommenders instead leverage natural-language representations that are easier to interpret and integrate with downstream reasoning modules. This paper studies how to construct effective textual profiles for users and items, and how to align them for recommendation.A central difficulty is that the best profile format is not known a priori: manually designed templates can be brittle and misaligned with task objectives. Moreover, generating user and item profiles independently may produce descriptions that are individually plausible yet semantically inconsistent for a specific user–item pair. We propose Duet, an interaction-aware profile generator that jointly produces user and item profiles conditioned on both user history and item evidence. Duet follows a three-stage procedure: it first turns raw histories and metadata into compact cues, then expands these cues into paired profile prompts and then generate profiles, and finally optimizes the generation policy with reinforcement learning using downstream recommendation performance as feedback. Experiments on three real-world datasets show that Duet consistently outperforms strong baselines, demonstrating the benefits of template-free profile exploration and joint user–item textual alignment. Project page: https://duet-rec.github.io/.
2025
Alleviating Performance Degradation Caused by Out-of-Distribution Issues in Embedding-Based Retrieval
Haotong Bao | Jianjin Zhang | Qi Chen | Weihao Han | Zhengxin Zeng | Ruiheng Chang | Mingzheng Li | Hao Sun | Weiwei Deng | Feng Sun | Qi Zhang
Findings of the Association for Computational Linguistics: EMNLP 2025
Haotong Bao | Jianjin Zhang | Qi Chen | Weihao Han | Zhengxin Zeng | Ruiheng Chang | Mingzheng Li | Hao Sun | Weiwei Deng | Feng Sun | Qi Zhang
Findings of the Association for Computational Linguistics: EMNLP 2025
In Embedding Based Retrieval (EBR), Approximate Nearest Neighbor (ANN) algorithms are widely adopted for efficient large-scale search. However, recent studies reveal a query out-of-distribution (OOD) issue, where query and base embeddings follow mismatched distributions, significantly degrading ANN performance. In this work, we empirically verify the generality of this phenomenon and provide a quantitative analysis. To mitigate the distributional gap, we introduce a distribution regularizer into the encoder training objective, encouraging alignment between query and base embeddings. Extensive experiments across multiple datasets, encoders, and ANN indices show that our method consistently improves retrieval performance.
2023
To Copy Rather Than Memorize: A Vertical Learning Paradigm for Knowledge Graph Completion
Rui Li | Xu Chen | Chaozhuo Li | Yanming Shen | Jianan Zhao | Yujing Wang | Weihao Han | Hao Sun | Weiwei Deng | Qi Zhang | Xing Xie
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Rui Li | Xu Chen | Chaozhuo Li | Yanming Shen | Jianan Zhao | Yujing Wang | Weihao Han | Hao Sun | Weiwei Deng | Qi Zhang | Xing Xie
Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Embedding models have shown great power in knowledge graph completion (KGC) task. By learning structural constraints for each training triple, these methods implicitly memorize intrinsic relation rules to infer missing links. However, this paper points out that the multi-hop relation rules are hard to be reliably memorized due to the inherent deficiencies of such implicit memorization strategy, making embedding models underperform in predicting links between distant entity pairs. To alleviate this problem, we present Vertical Learning Paradigm (VLP), which extends embedding models by allowing to explicitly copy target information from related factual triples for more accurate prediction. Rather than solely relying on the implicit memory, VLP directly provides additional cues to improve the generalization ability of embedding models, especially making the distant link prediction significantly easier. Moreover, we also propose a novel relative distance based negative sampling technique (ReD) for more effective optimization. Experiments demonstrate the validity and generality of our proposals on two standard benchmarks. Our code is available at https://github.com/rui9812/VLP.
Search
Fix author
Co-authors
- Weiwei Deng 3
- Hao Sun 2
- Feng Sun 2
- Qi Zhang 2
- Jianjin Zhang 2
- Haotong Bao 1
- Ruiheng Chang 1
- Xu Chen 1
- Yue Chen 1
- Qi Chen 1
- Zhiwei Dai 1
- Yifei Dong 1
- Minghua He 1
- Minjie Hong 1
- Nan Hu 1
- Rui Li 1
- Chaozhuo Li 1
- Mingzheng Li 1
- Qingwei Lin 1
- Saravan Rajmohan 1
- Yanming Shen 1
- Hao Sun 1
- Yifei Sun 1
- Yujing Wang 1
- Lu Wang 1
- Xing Xie 1
- Fangkai Yang 1
- Zhengxin Zeng 1
- Yuefeng Zhan 1
- Qi Zhang 1
- Dongmei Zhang 1
- Jianan Zhao 1
- Pu Zhao 1