TLSA: LLM-Guided Text-Label Space Alignment with Contrastive Learning for Generalized Category Discovery
Wenxi Xu, Chuan Qin, Xi Chen, Chuyu Fang, Yuanchun Zhou, Hengshu Zhu
Abstract
Generalized Category Discovery (GCD) aims to classify data from partially labeled datasets by jointly recognizing known categories and discovering novel ones.Despite recent advances, existing methods still suffer from weak text–label alignment, inconsistent objectives across known and novel categories, and poor discrimination of semantically similar clusters. To mitigate these issues, we propose TLSA, a unified framework that enforces contrastive alignment between text and label representations within a shared semantic space. Specifically, we first design a label-semantic aware dual-encoder equipped with a symmetric contrastive objective to achieve text-label alignment. Then, we leverage LLM-based label induction to generate explicit and semantically meaningful names for previously unseen categories, followed by a graph-based refinement strategy that disambiguates semantically overlapping clusters through forced renaming. Finally, a confidence-aware sampling strategy ensures balanced learning across both easy and hard instances. Extensive experiments on four benchmark datasets show that TLSA consistently outperforms state-of-the-art GCD methods. The code is available at https://github.com/Wenxi-Xu/TLSA.- Anthology ID:
- 2026.acl-long.869
- Volume:
- Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
- Month:
- July
- Year:
- 2026
- Address:
- San Diego, California, United States
- Editors:
- Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
- Venue:
- ACL
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 19030–19046
- Language:
- URL:
- https://preview.aclanthology.org/ingest-acl/2026.acl-long.869/
- DOI:
- Cite (ACL):
- Wenxi Xu, Chuan Qin, Xi Chen, Chuyu Fang, Yuanchun Zhou, and Hengshu Zhu. 2026. TLSA: LLM-Guided Text-Label Space Alignment with Contrastive Learning for Generalized Category Discovery. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 19030–19046, San Diego, California, United States. Association for Computational Linguistics.
- Cite (Informal):
- TLSA: LLM-Guided Text-Label Space Alignment with Contrastive Learning for Generalized Category Discovery (Xu et al., ACL 2026)
- PDF:
- https://preview.aclanthology.org/ingest-acl/2026.acl-long.869.pdf