The Lies Characters Tell: Utilizing Large Language Models to Normalize Adversarial Unicode Perturbations

Portia Cooper; Eduardo Blanco; Mihai Surdeanu

doi:10.18653/v1/2025.findings-acl.969

The Lies Characters Tell: Utilizing Large Language Models to Normalize Adversarial Unicode Perturbations

Portia Cooper, Eduardo Blanco, Mihai Surdeanu

Abstract

Homoglyphs, Unicode characters that are visually homogeneous to Latin letters, are widely used to mask offensive content. Dynamic strategies are needed to combat homoglyphs as the Unicode library is ever-expanding and new substitution possibilities for Latin letters continuously emerge. The present study investigated two novel mitigation approaches that do not rely on strict mappings but instead harness the power of large language models to neutralize both known and unknown homoglyphs: (1) indirectly normalizing homoglyphs by replacing non-Latin characters with a delimiter and prompting large language models to “fill in the blanks” and (2) directly normalizing homoglyphs by using large language models to determine which characters should be replaced with Latin letters. We found that GPT-4o-mini constructed normalized text with an average cosine similarity score of 0.91 to the original tweets when applying our indirect method and 0.96 to the original tweets when applying our direct method. This study indicates that large language model-based normalization techniques can effectively unmask offensive content concealed by homoglyphs. Code and data are available in our GitHub repository: https://github.com/pcoopercoder/The-Lies-Characters-Tell.

Anthology ID:: 2025.findings-acl.969
Volume:: Findings of the Association for Computational Linguistics: ACL 2025
Month:: July
Year:: 2025
Address:: Vienna, Austria
Editors:: Wanxiang Che, Joyce Nabende, Ekaterina Shutova, Mohammad Taher Pilehvar
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 18932–18944
Language:
URL:: https://preview.aclanthology.org/mtsummit-25-ingestion/2025.findings-acl.969/
DOI:: 10.18653/v1/2025.findings-acl.969
Bibkey:
Cite (ACL):: Portia Cooper, Eduardo Blanco, and Mihai Surdeanu. 2025. The Lies Characters Tell: Utilizing Large Language Models to Normalize Adversarial Unicode Perturbations. In Findings of the Association for Computational Linguistics: ACL 2025, pages 18932–18944, Vienna, Austria. Association for Computational Linguistics.
Cite (Informal):: The Lies Characters Tell: Utilizing Large Language Models to Normalize Adversarial Unicode Perturbations (Cooper et al., Findings 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/mtsummit-25-ingestion/2025.findings-acl.969.pdf

PDF Cite Search Fix data