TLSA: LLM-Guided Text-Label Space Alignment with Contrastive Learning for Generalized Category Discovery

Wenxi Xu, Chuan Qin, Xi Chen, Chuyu Fang, Yuanchun Zhou, Hengshu Zhu


Abstract
Generalized Category Discovery (GCD) aims to classify data from partially labeled datasets by jointly recognizing known categories and discovering novel ones.Despite recent advances, existing methods still suffer from weak text–label alignment, inconsistent objectives across known and novel categories, and poor discrimination of semantically similar clusters. To mitigate these issues, we propose TLSA, a unified framework that enforces contrastive alignment between text and label representations within a shared semantic space. Specifically, we first design a label-semantic aware dual-encoder equipped with a symmetric contrastive objective to achieve text-label alignment. Then, we leverage LLM-based label induction to generate explicit and semantically meaningful names for previously unseen categories, followed by a graph-based refinement strategy that disambiguates semantically overlapping clusters through forced renaming. Finally, a confidence-aware sampling strategy ensures balanced learning across both easy and hard instances. Extensive experiments on four benchmark datasets show that TLSA consistently outperforms state-of-the-art GCD methods. The code is available at https://github.com/Wenxi-Xu/TLSA.
Anthology ID:
2026.acl-long.869
Volume:
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Month:
July
Year:
2026
Address:
San Diego, California, United States
Editors:
Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:
ACL
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
19030–19046
Language:
URL:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.869/
DOI:
Bibkey:
Cite (ACL):
Wenxi Xu, Chuan Qin, Xi Chen, Chuyu Fang, Yuanchun Zhou, and Hengshu Zhu. 2026. TLSA: LLM-Guided Text-Label Space Alignment with Contrastive Learning for Generalized Category Discovery. In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 19030–19046, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):
TLSA: LLM-Guided Text-Label Space Alignment with Contrastive Learning for Generalized Category Discovery (Xu et al., ACL 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl/2026.acl-long.869.pdf
Checklist:
 2026.acl-long.869.checklist.pdf