Lingxiang Wu
2026
C3D: Enhancing LLM Reasoning via Confidence-Guided Contrastive Decoding
Yufeng Zhang | Xuepeng Wang | Lingxiang Wu | Jinqiao Wang
Findings of the Association for Computational Linguistics: ACL 2026
Yufeng Zhang | Xuepeng Wang | Lingxiang Wu | Jinqiao Wang
Findings of the Association for Computational Linguistics: ACL 2026
Large language models (LLMs) are prone to distraction by contextual information during reasoning. Previous work primarily focuses on improving the generation of the next token while overlooking the potential bias introduced by existing premises. We propose a novel decoding method to mitigate such biases. Our framework uses predicted logits to estimate the model’s confidence. By decomposing the full context into multiple premises, we gain a clearer understanding of the relevance of each premise to the question. During next-token prediction, we refine the output by contrasting the logits with the highest and lowest confidence. Our method effectively reveals how the model dynamically activates and adjusts its consideration of each premise as reasoning progresses.
2025
OpenS2S: Advancing Fully Open-Source End-to-End Empathetic Large Speech Language Model
Chen Wang | Tianyu Peng | Wen Yang | YiNan Bai | Guangfu Wang | Jun Lin | Lanpeng Jia | Lingxiang Wu | Jinqiao Wang | Chengqing Zong | Jiajun Zhang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Chen Wang | Tianyu Peng | Wen Yang | YiNan Bai | Guangfu Wang | Jun Lin | Lanpeng Jia | Lingxiang Wu | Jinqiao Wang | Chengqing Zong | Jiajun Zhang
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing: System Demonstrations
Empathetic interaction is a cornerstone of human-machine communication, due to the need for understanding speech enriched with paralinguistic cues and generating emotional and expressive responses. However, the most powerful empathetic LSLMs are increasingly closed off, leaving the crucial details about the architecture, data and development opaque to researchers. Given the critical need for transparent research into the LSLMs and empathetic behavior, we present OpenS2S, a fully open-source, transparent and end-to-end LSLM designed to enable empathetic speech interactions. Based on our empathetic speech-to-text model BLSP-Emo, OpenS2S further employs a streaming interleaved decoding architecture to achieve low-latency speech generation. To facilitate end-to-end training, OpenS2S incorporates an automated data construction pipeline that synthesizes diverse, high-quality empathetic speech dialogues at low cost. By leveraging large language models to generate empathetic content and controllable text-to-speech systems to introduce speaker and emotional variation, we construct a scalable training corpus with rich paralinguistic diversity and minimal human supervision. We release the fully open-source OpenS2S model, including the dataset, model weights, pre-training and fine-tuning codes, to empower the broader research community and accelerate innovation in empathetic speech systems.