Spectral Gravity Formant Estimation for Phonetic Segmentation

Michael S. Yantosca; Albert M. K. Cheng

Spectral Gravity Formant Estimation for Phonetic Segmentation

Abstract

Recent automated transcription systems have focused on end-to-end orthographic approaches driven by deep neural networks and sequence-to-sequence transformers. Growing public interest in transcription at the phonemic or phonetic level has led to re-purposing these systems to segment and identify phones, the basic sounds which comprise human speech. However, they miss the mark on a fundamental component of time-series analysis, namely time. For linguistic applications which require high fidelity in the temporal domain, the loss of timing information is untenable. Our work proposes a deadline-bounded expectation maximization (EM) algorithm with a novel initialization method to estimate formants, i.e., salient speech frequencies, for enhanced phonetic segmentation. Based on the concept of spectral gravity, i.e., treating spectral energy as mass attenuated by the square of frequency distance across the spectrum, our technique outperforms the recent state of the art on key clustering metrics, generating reasonable alignments across multiple languages with no a priori training.

Anthology ID:: 2026.findings-acl.1775
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 35639–35652
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1775/
DOI:
Bibkey:
Cite (ACL):: Michael S. Yantosca and Albert M. K. Cheng. 2026. Spectral Gravity Formant Estimation for Phonetic Segmentation. In Findings of the Association for Computational Linguistics: ACL 2026, pages 35639–35652, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: Spectral Gravity Formant Estimation for Phonetic Segmentation (Yantosca & Cheng, Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1775.pdf
Checklist:: 2026.findings-acl.1775.checklist.pdf

PDF Cite Search Checklist Fix data