Junzhe Jiang
Other people with similar names: Junzhe Jiang
Unverified author pages with similar names: Junzhe Jiang
2025
PQR: Improving Dense Retrieval via Potential Query Modeling
Junfeng Kang | Rui Li | Qi Liu | Yanjiang Chen | Zheng Zhang | Junzhe Jiang | Heng Yu | Yu Su
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Junfeng Kang | Rui Li | Qi Liu | Yanjiang Chen | Zheng Zhang | Junzhe Jiang | Heng Yu | Yu Su
Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Dense retrieval has now become the mainstream paradigm in information retrieval. The core idea of dense retrieval is to align document embeddings with their corresponding query embeddings by maximizing their dot product. The current training data is quite sparse, with each document typically associated with only one or a few labeled queries. However, a single document can be retrieved by multiple different queries. Aligning a document with just one or a limited number of labeled queries results in a loss of its semantic information. In this paper, we propose a training-free Potential Query Retrieval (PQR) framework to address this issue. Specifically, we use a Gaussian mixture distribution to model all potential queries for a document, aiming to capture its comprehensive semantic information. To obtain this distribution, we introduce three sampling strategies to sample a large number of potential queries for each document and encode them into a semantic space. Using these sampled queries, we employ the Expectation-Maximization algorithm to estimate parameters of the distribution. Finally, we also propose a method to calculate similarity scores between user queries and documents under the PQR framework. Extensive experiments demonstrate the effectiveness of the proposed method.
2024
ARM: An Alignment-and-Replacement Module for Chinese Spelling Check Based on LLMs
Changchun Liu | Kai Zhang | Junzhe Jiang | Zirui Liu | Hanqing Tao | Min Gao | Enhong Chen
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Changchun Liu | Kai Zhang | Junzhe Jiang | Zirui Liu | Hanqing Tao | Min Gao | Enhong Chen
Proceedings of the 2024 Conference on Empirical Methods in Natural Language Processing
Chinese Spelling Check (CSC) aims to identify and correct spelling errors in Chinese texts, where enhanced semantic understanding of a sentence can significantly improve correction accuracy. Recently, Large Language Models (LLMs) have demonstrated exceptional mastery of world knowledge and semantic understanding, rendering them more robust against spelling errors. However, the application of LLMs in CSC is a double-edged sword, as they tend to unnecessarily alter sentence length and modify rare but correctly used phrases. In this paper, by leveraging the capabilities of LLMs while mitigating their limitations, we propose a novel plug-and-play Alignment-and-Replacement Module ARM that enhances the performance of existing CSC models and without the need for retraining or fine-tuning. Experiment results and analysis on three benchmark datasets demonstrate the effectiveness and competitiveness of the proposed module.