Yi Hu
2026
SubTokenTest: A Practical Benchmark for Real-World Sub-token Understanding
Shuyang Hou | Yi Hu | Muhan Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Shuyang Hou | Yi Hu | Muhan Zhang
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Recent advancements in large language models (LLMs) have significantly enhanced their reasoning capabilities. However, they continue to struggle with basic character-level tasks, such as counting letters in words—a problem rooted in their tokenization process. While existing benchmarks have highlighted this weakness through basic character operations, such failures are often dismissed due to lacking practical relevance. Yet, many real-world applications, such as navigating text-based maps or interpreting structured tables, rely heavily on precise sub-token understanding. In this regard, we introduce SubTokenTest, a comprehensive benchmark that assesses sub-token understanding through **practical, utility-driven** tasks. Our benchmark includes ten tasks across four domains and isolates tokenization-related failures by decoupling performance from complex reasoning. We provide a comprehensive evaluation of nine advanced LLMs. Additionally, we investigate the impact of test-time scaling on sub-token reasoning and explore how character-level information is encoded within the hidden states.
Towards a Mechanistic Understanding of Large Reasoning Models: A Survey of Training, Inference, and Failures
Yi Hu | Jiaqi Gu | Ruxin Wang | Zijun Yao | Hao Peng | Xiaobao Wu | Jianhui Chen | Muhan Zhang | Liangming Pan
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Yi Hu | Jiaqi Gu | Ruxin Wang | Zijun Yao | Hao Peng | Xiaobao Wu | Jianhui Chen | Muhan Zhang | Liangming Pan
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Reinforcement learning (RL) has catalyzed the emergence of Large Reasoning Models (LRMs) that have pushed reasoning capabilities to new heights. While their performance has garnered significant excitement, exploring the internal mechanisms driving these behaviors has become an equally critical research frontier. This paper provides a comprehensive survey of the mechanistic understanding of LRMs, organizing recent findings into three core dimensions: 1) training dynamics, 2) reasoning mechanisms, and 3) unintended behaviors. By synthesizing these insights, we aim to bridge the gap between black-box performance and mechanistic transparency. Finally, we discuss under-explored challenges to outline a roadmap for future mechanistic studies, including the need for applied interpretability, improved methodologies, and a unified theoretical framework.
2023
DeRisk: An Effective Deep Learning Framework for Credit Risk Prediction over Real-World Financial Data
Yancheng Liang | Jiajie Zhang | Hui Li | Xiaochen Liu | Yi Hu | Yong Wu | Jinyao Zhang | Yongyan Liu | Yi Wu
Proceedings of the Fifth Workshop on Financial Technology and Natural Language Processing and the Second Multimodal AI For Financial Forecasting
Yancheng Liang | Jiajie Zhang | Hui Li | Xiaochen Liu | Yi Hu | Yong Wu | Jinyao Zhang | Yongyan Liu | Yi Wu
Proceedings of the Fifth Workshop on Financial Technology and Natural Language Processing and the Second Multimodal AI For Financial Forecasting
2007
Using a Generative Model for Sentiment Analysis
Yi Hu | Ruzhan Lu | Yuquan Chen | Jianyong Duan
International Journal of Computational Linguistics & Chinese Language Processing, Volume 12, Number 2, June 2007
Yi Hu | Ruzhan Lu | Yuquan Chen | Jianyong Duan
International Journal of Computational Linguistics & Chinese Language Processing, Volume 12, Number 2, June 2007
2006
A Bio-Inspired Approach for Multi-Word Expression Extraction
Jianyong Duan | Ruzhan Lu | Weilin Wu | Yi Hu | Yan Tian
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions
Jianyong Duan | Ruzhan Lu | Weilin Wu | Yi Hu | Yan Tian
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions