Ting Liu
Other people with similar names: Ting Liu, Ting Liu, Ting Liu
Unverified author pages with similar names: Ting Liu
2026
Exploring and Distilling Multi-Dimensional Clues for Interpretable Social Bot Detection
Yi Han | Haiqi Lu | Lizi Liao | Shuhan Zhou | Yuanxing Liu | Weinan Zhang | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yi Han | Haiqi Lu | Lizi Liao | Shuhan Zhou | Yuanxing Liu | Weinan Zhang | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Social bot accounts have long been disseminating disinformation and engaging in malicious activities on social media platforms. Detecting these social bots has become a critical and urgent task, essential for maintaining a healthy online ecosystem. Existing social bot detection research usually provides detection results directly without corresponding supportive explanations, making it difficult to assess the extent to which such predictions are trustworthy. This is a key concern for online moderation. In this work, we explore the detection interpretation and summarize a four-dimensional clue framework from individual and social perspectives. We propose CDRBot, which primarily employs outcome-reward reinforcement learning to train inspectors to generate faithful, grounded, and readable clues from the *User Information*, *Semantic Features*, *Interactive Situation*, and *Behavioral Pattern*. These clues are then integrated to make final predictions. Experimental results demonstrate that our approach outperforms other baselines in detection performance. The generated clues are faithful, grounded, and readable, and can significantly enhance the performance of large language models in social bot detection.
Question Tells You Where the Answer Is: Intention-aware Long-Context KV Cache Compression
Liang Zhao | Xiaocheng Feng | Weihong Zhong | Lei Huang | Kun Zhu | Baoxin Wang | Dayong Wu | Guoping Hu | Ting Liu | Bing Qin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Liang Zhao | Xiaocheng Feng | Weihong Zhong | Lei Huang | Kun Zhu | Baoxin Wang | Dayong Wu | Guoping Hu | Ting Liu | Bing Qin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
The increasing context window greatly extends the capabilities of large language models, but on the other hand, it incurs an unaffordable memory overhead and computational latency due to the increasing Key-Value (KV) cache size. Recent KV cache compression methods manage to reduce the cache size by dropping irrelevant KVs. However, these methods often fail to identify crucial KVs for generation while excluding others accurately, resulting in severe information loss. To address this gap, we propose **IntentKV**, an intention-aware KV cache eviction method that identifies and retains crucial KVs according to the attention distribution of intention, which semantically reflects the user’s goal and determines which part of the context is relevant. The consistency between the semantics and attention distribution is further substantiated through meticulously designed experiments. On this basis, IntentKV first distinguishes intention tokens from the vanilla context tokens based on their attention distribution distances. Then, the block-wise cumulative attention is calculated via aggregating the intention token attention. Finally, blocks that acquire high cumulative attention are picked and stored in KV cache. We evaluate our method across diverse long-context tasks and models. Results demonstrate that IntentKV can effectively maintain the model performance while reducing the KV cache size from 128K to 2K, leading to a 6.3x increase in decoding speed and 7.8x enhancement in memory efficiency compared to the default setting.
TinyJudge: Unverifiable Constraint Alignment via Lightweight Specialist Ensembles
Yirong Zeng | Yufei Liu | Xiao Ding | Yutai Hou | Yuxian Wang | Wu Ning | Haonan Song | Dandan Tu | Qixun Zhang | Yuxiang He | Bibo Cai | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yirong Zeng | Yufei Liu | Xiao Ding | Yutai Hou | Yuxian Wang | Wu Ning | Haonan Song | Dandan Tu | Qixun Zhang | Yuxiang He | Bibo Cai | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Instruction Following (IF) is a core capability of LLMs, requiring strict adherence to diverse constraints, ranging from verifiable ones (e.g., output length) to unverifiable ones (e.g., tone). Reinforcement learning with verifiable rewards has emerged as a paradigm for IF tasks, leveraging LLM-as-a-judge to assess unverifiable constraints. However, we empirically find that this approach remains a significant bottleneck, suffering from severe reward hacking and higher computational overhead. In this work, we first analyze the generalization capabilities of unverifiable constraints and discover that specific constraints exhibit distinct, high-generalization patterns. Motivated by this, we propose TinyJudge, a framework that employs an ensemble of specialized tiny language models (e.g., 0.6B) to provide rewards for soft constraints. By distilling expertise from frontier models into these tiny models, it achieves high-precision, lightweight evaluation. Extensive evaluations across five benchmarks demonstrate that TinyJudge outperforms the baselines by ~10% in average performance and 12% in reward precision. Crucially, it also achieves a 3× speedup in total training time. Our work provides a scalable and robust path for aligning LLMs with unverifiable human instructions.
Toward Secure Tuning: Mitigating Security Risks from Instruction Fine-Tuning
Yanrui Du | Fenglei Fan | Sendong Zhao | Jiawei Cao | Ming Ma | Danyang Zhao | Shuren Qi | Ting Liu | Bing Qin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yanrui Du | Fenglei Fan | Sendong Zhao | Jiawei Cao | Ming Ma | Danyang Zhao | Shuren Qi | Ting Liu | Bing Qin
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Instruction Fine-Tuning (IFT) has emerged as a critical technique for customizing Large Language Models (LLMs) to meet diverse downstream applications. However, recent studies have revealed that IFT can compromise the built-in security mechanisms of LLMs, thereby posing significant security risks. Although defense methods targeting various training stages have been proposed, they either face challenges in practical deployment or exhibit instability and limited performance gains. In our study, we propose a novel SWAT method that introduces a key idea: shifting more of the learning burden onto security-robust parameters. To this end, our study investigates how module-level parameters affect LLMs’ internal security feature space, aiming to uncover robustness patterns in parameters. Guided by this analysis, we identify a robust module set (Mods_Rob) that exhibits minimal effects on LLMs’ security feature space. Leveraging this insight, SWAT proceeds in two phases: (1) a warm-up phase that preferentially trains Mods_Rob to learn low-level features with minimal security risk, followed by (2) standard tuning to achieve optimal task performance. Across diverse knowledge-intensive datasets, scenarios, and LLMs, SWAT substantially reduces security risks without sacrificing task performance gains.
Consolidation or Adaptation? PRISM: Disentangling SFT and RL Data via Gradient Concentration
Yang Zhao | Yangou Ouyang | Xiao Ding | Hepeng Wang | Bibo Cai | Kai Xiong | Jinglong Gao | Zhouhao Sun | Li Du | Bing Qin | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yang Zhao | Yangou Ouyang | Xiao Ding | Hepeng Wang | Bibo Cai | Kai Xiong | Jinglong Gao | Zhouhao Sun | Li Du | Bing Qin | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
While Hybrid Supervised Fine-Tuning (SFT) followed by Reinforcement Learning (RL) has become the standard paradigm for training LLM agents, effective mechanisms for data allocation between these stages remain largely underexplored. Current data arbitration strategies often rely on surface-level heuristics that fail to diagnose intrinsic learning needs. Since SFT targets pattern consolidation through imitation while RL drives structural adaptation via exploration, misaligning data with these functional roles causes severe optimization interference. We propose PRISM, a dynamics-aware framework grounded in Schema Theory that arbitrates data based on its degree of cognitive conflict with the model’s existing knowledge. By analyzing the spatial geometric structure of gradients, PRISM identifies data triggering high spatial concentration as high-conflict signals that require RL for structural restructuring. In contrast, data yielding diffuse updates is routed to SFT for efficient consolidation. Extensive experiments on WebShop and ALFWorld demonstrate that PRISM achieves a Pareto improvement, outperforming state-of-the-art hybrid methods while reducing computational costs by up to 3.22 ×. Our findings suggest that disentangling data based on internal optimization regimes is crucial for scalable and robust agent alignment.
MAESTRO: Meta-learning Adaptive Estimation of Scalarization Trade-offs for Reward Optimization
Yang Zhao | Hepeng Wang | Xiao Ding | Yangou Ouyang | Bibo Cai | Kai Xiong | Jinglong Gao | Zhouhao Sun | Li Du | Bing Qin | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yang Zhao | Hepeng Wang | Xiao Ding | Yangou Ouyang | Bibo Cai | Kai Xiong | Jinglong Gao | Zhouhao Sun | Li Du | Bing Qin | Ting Liu
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Group-Relative Policy Optimization (GRPO) has emerged as an efficient paradigm for aligning Large Language Models (LLMs), yet its efficacy is primarily confined to domains with verifiable ground truths. Extending GRPO to **open-domain settings** remains a critical challenge, as **unconstrained generation** entails multi-faceted and often conflicting objectives—such as creativity versus factuality—where rigid, static reward scalarization is inherently suboptimal. To address this, we propose **MAESTRO** (**M**eta-learning **A**daptive **E**stimation of **S**calarization **T**rade-offs for **R**eward **O**ptimization), which introduces a meta-cognitive orchestration layer that treats reward scalarization as a dynamic latent policy, leveraging the model’s terminal hidden states as a semantic bottleneck to perceive task-specific priorities. We formulate this as a contextual bandit problem within a bi-level optimization framework, where a lightweight Conductor network co-evolves with the policy by utilizing group-relative advantages as a meta-reward signal. Across seven benchmarks, MAESTRO consistently outperforms single-reward and static multi-objective baselines, while preserving the efficiency advantages of GRPO, and in some settings even reducing redundant generation.
Search
Fix author
Co-authors
- Bing Qin (秦兵) 4
- Bibo Cai 3
- Xiao Ding 3
- Li Du 2
- Jinglong Gao 2
- Yangou Ouyang 2
- Zhouhao Sun 2
- Hepeng Wang 2
- Kai Xiong 2
- Yang Zhao 2
- Jiawei Cao 1
- Yanrui Du 1
- Fenglei Fan 1
- Xiaocheng Feng (冯骁骋) 1
- Yi Han 1
- Yuxiang He 1
- Yutai Hou 1
- Guoping Hu 1
- Lei Huang (黄磊) 1
- Lizi Liao 1
- Yuanxing Liu 1
- Yufei Liu 1
- Haiqi Lu 1
- Ming Ma 1
- Wu Ning 1
- Shuren Qi 1
- Haonan Song 1
- Dandan Tu 1
- Baoxin Wang 1
- Yuxian Wang 1
- Dayong Wu 1
- Yirong Zeng 1
- Qixun Zhang 1
- Weinan Zhang 1
- Danyang Zhao 1
- Liang Zhao (赵亮) 1
- Sendong Zhao 1
- Weihong Zhong 1
- Shuhan Zhou 1
- Kun Zhu (朱坤) 1
Venues
- ACL6