Rank-Awareness and Angular Constraints: A New Perspective on Learning Sentence Embeddings from NLI Data

Zicheng Zhou, Min Huang, Qinghai Miao


Abstract
Learning high-quality sentence embeddings from Natural Language Inference (NLI) data is often challenged by a critical signal conflict between discrete labels and the continuous spectrum of semantic similarity, as well as information loss from discarded neutral sentence pairs during training. To address this, we introduce Rank-Awareness and Angular Optimization Embeddings (RAOE), a framework that leverages the full NLI dataset (Entailment, Neutral, Contradiction) augmented with pre-computed continuous similarity scores (S). RAOE employs a novel composite objective which features: (1) a Rank Margin objective that enforces rank consistency against S using an explicit margin, and (2) a Gated Angular objective that conditionally refines embedding geometry based on NLI label (L) and S score agreement. Extensive evaluations on STS tasks and the MTEB benchmark demonstrate RAOE’s effectiveness. Our general-purpose RAOE-S1 model (BERT-base) significantly outperforms strong baselines, achieving an average Spearman’s correlation of 85.11 (vs. SimCSE’s 81.57 and AnglE’s 82.43), and shows consistent improvements on MTEB. Further STS-specialized fine-tuning (RAOE-S2) establishes new state-of-the-art performance on STS (88.17 with BERT-base). These results confirm RAOE’s ability to efficiently learn robust and nuanced sentence representations through the synergy of rank-awareness and conditional angular constraints. Code is available at https://github.com/Shengjingwa/RAOE.
Anthology ID:
2025.emnlp-main.1129
Volume:
Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:
November
Year:
2025
Address:
Suzhou, China
Editors:
Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:
EMNLP
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
22206–22220
Language:
URL:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1129/
DOI:
Bibkey:
Cite (ACL):
Zicheng Zhou, Min Huang, and Qinghai Miao. 2025. Rank-Awareness and Angular Constraints: A New Perspective on Learning Sentence Embeddings from NLI Data. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 22206–22220, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):
Rank-Awareness and Angular Constraints: A New Perspective on Learning Sentence Embeddings from NLI Data (Zhou et al., EMNLP 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.1129.pdf
Checklist:
 2025.emnlp-main.1129.checklist.pdf