Jian Huang
2026
DOS: Dependency-Oriented Sampler for Masked Diffusion Language Models
Xueyu Zhou | Yangrong Hu | Jian Huang
Findings of the Association for Computational Linguistics: ACL 2026
Xueyu Zhou | Yangrong Hu | Jian Huang
Findings of the Association for Computational Linguistics: ACL 2026
Masked diffusion language models (MDLMs) have recently emerged as a new paradigm in language modeling, offering flexible generation dynamics and enabling efficient parallel decoding. However, existing decoding strategies for pre-trained MDLMs predominantly rely on token-level uncertainty criteria, while largely overlooking sequence-level information and inter-token dependencies. To address this limitation, we propose Dependency-Oriented Sampler (DOS), a training-free decoding strategy that leverages inter-token dependencies to inform token updates during generation. Specifically, DOS exploits attention matrices from transformer blocks to approximate inter-token dependencies, emphasizing information from unmasked tokens when updating masked positions. Empirical results demonstrate that DOS consistently achieves superior performance on both code generation and mathematical reasoning tasks. Moreover, DOS can be seamlessly integrated with existing parallel sampling methods, leading to improved generation efficiency without sacrificing generation quality.
2010
SEERLAB: A System for Extracting Keyphrases from Scholarly Documents
Pucktada Treeratpituk | Pradeep Teregowda | Jian Huang | C. Lee Giles
Proceedings of the 5th International Workshop on Semantic Evaluation
Pucktada Treeratpituk | Pradeep Teregowda | Jian Huang | C. Lee Giles
Proceedings of the 5th International Workshop on Semantic Evaluation
Enhancing Cross Document Coreference of Web Documents with Context Similarity and Very Large Scale Text Categorization
Jian Huang | Pucktada Treeratpituk | Sarah Taylor | C. Lee Giles
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
Jian Huang | Pucktada Treeratpituk | Sarah Taylor | C. Lee Giles
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)
2009
Solving the “Who’s Mark Johnson Puzzle”: Information Extraction Based Cross Document Coreference
Jian Huang | Sarah M. Taylor | Jonathan L. Smith | Konstantinos A. Fotiadis | C. Lee Giles
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
Jian Huang | Sarah M. Taylor | Jonathan L. Smith | Konstantinos A. Fotiadis | C. Lee Giles
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics, Companion Volume: Student Research Workshop and Doctoral Consortium
Profile Based Cross-Document Coreference Using Kernelized Fuzzy Relational Clustering
Jian Huang | Sarah M. Taylor | Jonathan L. Smith | Konstantinos A. Fotiadis | C. Lee Giles
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP
Jian Huang | Sarah M. Taylor | Jonathan L. Smith | Konstantinos A. Fotiadis | C. Lee Giles
Proceedings of the Joint Conference of the 47th Annual Meeting of the ACL and the 4th International Joint Conference on Natural Language Processing of the AFNLP