Mapping Semantic Domains Across India’s Social Media: Networks, Geography, and Social Factors

Gunjan Anand, Jonathan Dunn


Abstract
This study examines socially-conditioned variation within semantic domains like kinship and weather using thirteen Indian cities as a case-study. Using bilingual social media data, we infer six semantic domains from corpora representing individual cities with a lexicon including terms from English, Hindi and Transliterated Hindi. The process of inferring semantic domains uses character-based embeddings to retrieve nearest neighbors and Jaccard similarity to operationalize the edge weights between lexical items within each domain. These representations reveal distinct regional variation across all six domains. We then examine the relationship between variation in semantic domains and external social factors such as literacy rates and local demographics. The results show that semantic domains exhibit systematic influences from sociolinguistic factors, a finding that has significant implications for the idea that semantic domains can be studied as abstractions distinct from specific speech communities.
Anthology ID:
2025.iwcs-1.28
Volume:
Proceedings of the 16th International Conference on Computational Semantics
Month:
September
Year:
2025
Address:
Düsseldorf, Germany
Editors:
Kilian Evang, Laura Kallmeyer, Sylvain Pogodalla
Venues:
IWCS | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
317–330
Language:
URL:
https://preview.aclanthology.org/iwcs-25-ingestion/2025.iwcs-1.28/
DOI:
Bibkey:
Cite (ACL):
Gunjan Anand and Jonathan Dunn. 2025. Mapping Semantic Domains Across India’s Social Media: Networks, Geography, and Social Factors. In Proceedings of the 16th International Conference on Computational Semantics, pages 317–330, Düsseldorf, Germany. Association for Computational Linguistics.
Cite (Informal):
Mapping Semantic Domains Across India’s Social Media: Networks, Geography, and Social Factors (Anand & Dunn, IWCS 2025)
Copy Citation:
PDF:
https://preview.aclanthology.org/iwcs-25-ingestion/2025.iwcs-1.28.pdf