What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence

Manodyna K H

What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence

Abstract

Field linguistics increasingly relies on computational tools to organize, analyze, and preserve linguistic data, yet the classificatory assumptions embedded in these tools are rarely examined. A pervasive assumption is that languages can be treated as discrete, genealogically defined units, with relatedness modeled as tree-structured descent. We argue that this assumption misrepresents linguistic evidence in contact-heavy regions and risks distorting the computational mediation of field linguistic data. Focusing on South Asia, we show that widely assumed boundaries—such as the Indo-Aryan–Dravidian divide—collapse in long-standing contact zones characterized by convergence, dialect continua, and institutional multilingualism. Through historically grounded case studies including Kannada–Telugu and Tamil–Malayalam, we demonstrate how convergence, script-mediated distance, and post-hoc standardization reshape how field data is segmented, compared, and interpreted when organized through genealogical labels. We argue that contact-aware, relational models of linguistic relatedness are necessary if NLP tools are to support, rather than distort, the documentation and analysis of linguistic diversity.

Anthology ID:: 2026.fieldmatters-1.2
Volume:: Proceedings of the Fifth Workshop on NLP Applications to Field Linguistics
Month:: March
Year:: 2026
Address:: Rabat, Morocco
Venues:: FieldMatters | WS
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 8–15
Language:
URL:: https://preview.aclanthology.org/manual-author-scripts/2026.fieldmatters-1.2/
DOI:
Bibkey:
Cite (ACL):: Manodyna K H. 2026. What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence. In Proceedings of the Fifth Workshop on NLP Applications to Field Linguistics, pages 8–15, Rabat, Morocco. Association for Computational Linguistics.
Cite (Informal):: What NLP Gets Wrong About Contact: Implications for Field Linguistic Evidence (K H, FieldMatters 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/manual-author-scripts/2026.fieldmatters-1.2.pdf

PDF Cite Search Fix data