Yu-An Liu
2025
The Silent Saboteur: Imperceptible Adversarial Attacks against Black-Box Retrieval-Augmented Generation Systems
Hongru Song
|
Yu-An Liu
|
Ruqing Zhang
|
Jiafeng Guo
|
Jianming Lv
|
Maarten de Rijke
|
Xueqi Cheng
Findings of the Association for Computational Linguistics: ACL 2025
We explore adversarial attacks against retrieval-augmented generation (RAG) systems to identify their vulnerabilities. We focus on generating human-imperceptible adversarial examples and introduce a novel imperceptible retrieve-to-generate attack against RAG. This task aims to find imperceptible perturbations that retrieve a target document, originally excluded from the initial top-k candidate set, in order to influence the final answer generation. To address this task, we propose ReGENT, a reinforcement learning-based framework that tracks interactions between the attacker and the target RAG and continuously refines attack strategies based on relevance-generation-naturalness rewards. Experiments on newly constructed factual and non-factual question-answering benchmarks demonstrate that ReGENT significantly outperforms existing attack methods in misleading RAG systems with small imperceptible text perturbations.
A Generative Framework for Personalized Sticker Retrieval
Changjiang Zhou
|
Ruqing Zhang
|
Jiafeng Guo
|
Yu-An Liu
|
Fan Zhang
|
Ganyuan Luo
|
Xueqi Cheng
Findings of the Association for Computational Linguistics: EMNLP 2025
Formulating information retrieval as a variant of generative modeling, specifically using autoregressive models to generate relevant identifiers for a given query, has recently attracted considerable attention. However, its application to personalized sticker retrieval remains largely unexplored and presents unique challenges: existing relevance-based generative retrieval methods typically lack personalization, leading to a mismatch between diverse user expectations and the retrieved results. To address this gap, we propose PEARL, a novel generative framework for personalized sticker retrieval, and make two key contributions: (i) To encode user-specific sticker preferences, we design a representation learning model to learn discriminative user representations. It is trained on three prediction tasks that leverage personal information and click history; and (ii) To generate stickers aligned with a user’s query intent, we propose a novel intent-aware learning objective that prioritizes stickers associated with higher-ranked intents. Empirical results from both offline evaluations and online tests demonstrate that PEARL significantly outperforms state-of-the-art methods.
Search
Fix author
Co-authors
- Xueqi Cheng (程学旗) 2
- Jiafeng Guo (嘉丰 郭) 2
- Ruqing Zhang (儒清 张) 2
- Ganyuan Luo 1
- Jianming Lv 1
- show all...