Miao Fan
2026
SADA: Bridging In-Context Learning and Fine-Tuning via State-Aligned Distillation Adapters
Wenhao Gao | Tianlong Wang | Wei Jia | Linhao Zhang | Aiwei Liu | Miao Fan | Zhou Xiao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Wenhao Gao | Tianlong Wang | Wei Jia | Linhao Zhang | Aiwei Liu | Miao Fan | Zhou Xiao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Prompt-based in-context learning (ICL) and parameter fine-tuning are two dominant paradigms for incorporating external information into large language models (LLMs), but they incur high inference costs or require expensive retraining. To bridge this gap, context-to-parameter mapping converts prompts into temporary adapter weights. However, we identify a critical failure mode in existing methods: *hidden-state collapse*, where the adapter-augmented model’s internal states diverge sharply from the full-context oracle in deeper layers. We trace this failure to two coupled gaps: suboptimal **Input-Selection** and inadequate **Supervision-Signal**. To address these issues, we propose SADA (**S**tate-**A**ligned **D**istillation **A**dapters). We establish the *attention-block output* as a principled feature interface to improve input selection and introduce *state-alignment distillation* to enforce consistency between the adapter-augmented model and the full-context oracle. Experiments on long-context language modeling (PG19) and downstream NLU and summarization benchmarks show that SADA consistently outperforms strong baselines like *StreamAdapter* and *GenerativeAdapter*, achieving performance comparable to ICL while significantly reducing memory footprint and latency. We further analyze when parameterized context compression is effective and when explicit context retention remains preferable. Our code is available at [https://github.com/Taylor-Gavel/SADA.git](https://github.com/Taylor-Gavel/SADA.git).
2019
Reinforced Product Metadata Selection for Helpfulness Assessment of Customer Reviews
Miao Fan | Chao Feng | Mingming Sun | Ping Li
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
Miao Fan | Chao Feng | Mingming Sun | Ping Li
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)
To automatically assess the helpfulness of a customer review online, conventional approaches generally acquire various linguistic and neural embedding features solely from the textual content of the review itself as the evidence. We, however, find out that a helpful review is largely concerned with the metadata (such as the name, the brand, the category, etc.) of its target product. It leaves us with a challenge of how to choose the correct key-value product metadata to help appraise the helpfulness of free-text reviews more precisely. To address this problem, we propose a novel framework composed of two mutual-benefit modules. Given a product, a selector (agent) learns from both the keys in the product metadata and one of its reviews to take an action that selects the correct value, and a successive predictor (network) makes the free-text review attend to this value to obtain better neural representations for helpfulness assessment. The predictor is directly optimized by SGD with the loss of helpfulness prediction, and the selector could be updated via policy gradient rewarded with the performance of the predictor. We use two real-world datasets from Amazon.com and Yelp.com, respectively, to compare the performance of our framework with other mainstream methods under two application scenarios: helpfulness identification and regression of customer reviews. Extensive results demonstrate that our framework can achieve state-of-the-art performance with substantial improvements.
2015
Distant Supervision for Entity Linking
Miao Fan | Qiang Zhou | Thomas Fang Zheng
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation
Miao Fan | Qiang Zhou | Thomas Fang Zheng
Proceedings of the 29th Pacific Asia Conference on Language, Information and Computation
Jointly Embedding Relations and Mentions for Knowledge Population
Miao Fan | Kai Cao | Yifan He | Ralph Grishman
Proceedings of the International Conference Recent Advances in Natural Language Processing
Miao Fan | Kai Cao | Yifan He | Ralph Grishman
Proceedings of the International Conference Recent Advances in Natural Language Processing
Improving Event Detection with Active Learning
Kai Cao | Xiang Li | Miao Fan | Ralph Grishman
Proceedings of the International Conference Recent Advances in Natural Language Processing
Kai Cao | Xiang Li | Miao Fan | Ralph Grishman
Proceedings of the International Conference Recent Advances in Natural Language Processing
2014
Transition-based Knowledge Graph Embedding with Relational Mapping Properties
Miao Fan | Qiang Zhou | Emily Chang | Thomas Fang Zheng
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing
Miao Fan | Qiang Zhou | Emily Chang | Thomas Fang Zheng
Proceedings of the 28th Pacific Asia Conference on Language, Information and Computing
Distant Supervision for Relation Extraction with Matrix Completion
Miao Fan | Deli Zhao | Qiang Zhou | Zhiyuan Liu | Thomas Fang Zheng | Edward Y. Chang
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Miao Fan | Deli Zhao | Qiang Zhou | Zhiyuan Liu | Thomas Fang Zheng | Edward Y. Chang
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)