Yiwei Wang
Other people with similar names: Yiwei Wang
Unverified author pages with similar names: Yiwei Wang
2026
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models
Hengyuan Zhang | Zhihao Zhang | Ercong Nie | Mingyang Wang | Zunhai Su | Yiwei Wang | Qianli Wang | Shuzhou Yuan | Xufeng Duan | Qibo Xue | Zeping Yu | Chenming Shang | Xiao Liang | Jing Xiong | Hui Shen | Chaofan Tao | Zhengwu Liu | Senjie Jin | Zhiheng Xi | Dongdong Zhang | Sophia Ananiadou | Tao Gui | Ruobing Xie | Hayden Kwok-Hay So | Hinrich Schuetze | Xuanjing Huang | Qi Zhang | Ngai Wong
Findings of the Association for Computational Linguistics: ACL 2026
Hengyuan Zhang | Zhihao Zhang | Ercong Nie | Mingyang Wang | Zunhai Su | Yiwei Wang | Qianli Wang | Shuzhou Yuan | Xufeng Duan | Qibo Xue | Zeping Yu | Chenming Shang | Xiao Liang | Jing Xiong | Hui Shen | Chaofan Tao | Zhengwu Liu | Senjie Jin | Zhiheng Xi | Dongdong Zhang | Sophia Ananiadou | Tao Gui | Ruobing Xie | Hayden Kwok-Hay So | Hinrich Schuetze | Xuanjing Huang | Qi Zhang | Ngai Wong
Findings of the Association for Computational Linguistics: ACL 2026
Mechanistic Interpretability (MI) has emerged as a vital approach to demystify the opaque decision-making of Large Language Models (LLMs). However, existing reviews primarily treat MI as an observational science, summarizing analytical insights while lacking a systematic framework for actionable intervention. To bridge this gap, we present a practical survey structured around the pipeline: "Locate, Steer, and Improve." We formally categorize Localizing (diagnosis) and Steering (intervention) methods based on specific Interpretable Objects to establish a rigorous intervention protocol. Furthermore, we demonstrate how this framework enables tangible improvements in Alignment, Capability, and Efficiency, effectively operationalizing MI as a practical engineering toolkit for model optimization. The curated paper list of this work is available at https://anonymous.4open.science/r/Act-MI-F068.