Quantifying Social Sentiment in Hostels Using A Domain-Specific Transformer Pipeline

Ian W. McMurry


Abstract
This paper presents a domain-specific transformer pipeline for quantifying social atmosphere in hostel reviews, an experiential dimension that travelers consistently prioritize but that existing NLP methods and booking platforms fail to capture. We train a cross-encoder on 4,994 manually annotated reviews and use it to pseudo-label 162,840 additional reviews; these labels are then distilled into a sentence-transformer bi-encoder, producing embeddings where proximity reflects social interaction level rather than generic sentiment. On held-out human-labeled data, the domain-adapted embeddings achieve F1 = 0.826, outperforming generic sentence embeddings (0.671) and zero-shot GPT-4o (0.774), with a 40-fold improvement in intra-class versus inter-class similarity. Aggregating predictions to the property level reveals that hostel socialness follows an approximate exponential distribution, confirming that highly social hostels are rare. This work formalizes socialness as a measurable semantic construct and provides a general template for extracting implicit experiential attributes from text at scale.
Anthology ID:
2026.wassa-1.3
Volume:
The Proceedings for the 15th Workshop on Computational Approaches to Subjectivity, Sentiment Social Media Analysis (WASSA 2026)
Month:
March
Year:
2026
Address:
Rabat, Morocco
Editors:
Jeremy Barnes, Valentin Barriere, Orphée De Clercq, Roman Klinger, Célia Nouri, Debora Nozza, Pranaydeep Singh
Venues:
WASSA | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
24–36
Language:
URL:
https://preview.aclanthology.org/ingest-eacl/2026.wassa-1.3/
DOI:
Bibkey:
Cite (ACL):
Ian W. McMurry. 2026. Quantifying Social Sentiment in Hostels Using A Domain-Specific Transformer Pipeline. In The Proceedings for the 15th Workshop on Computational Approaches to Subjectivity, Sentiment Social Media Analysis (WASSA 2026), pages 24–36, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):
Quantifying Social Sentiment in Hostels Using A Domain-Specific Transformer Pipeline (McMurry, WASSA 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-eacl/2026.wassa-1.3.pdf