Gina-Anne Levow

Also published as: Gina Levow


2023

pdf
Investigating Speaker Diarization of Endangered Language Data
Gina-Anne Levow
Proceedings of the Sixth Workshop on the Use of Computational Methods in the Study of Endangered Languages

2022

pdf
A Masked Segmental Language Model for Unsupervised Natural Language Segmentation
C.m. Downey | Fei Xia | Gina-Anne Levow | Shane Steinert-Threlkeld
Proceedings of the 19th SIGMORPHON Workshop on Computational Research in Phonetics, Phonology, and Morphology

We introduce a Masked Segmental Language Model (MSLM) for joint language modeling and unsupervised segmentation. While near-perfect supervised methods have been developed for segmenting human-like linguistic units in resource-rich languages such as Chinese, many of the world’s languages are both morphologically complex, and have no large dataset of “gold” segmentations for supervised training. Segmental Language Models offer a unique approach by conducting unsupervised segmentation as the byproduct of a neural language modeling objective. However, current SLMs are limited in their scalability due to their recurrent architecture. We propose a new type of SLM for use in both unsupervised and lightly supervised segmentation tasks. The MSLM is built on a span-masking transformer architecture, harnessing a masked bidirectional modeling context and attention, as well as adding the potential for model scalability. In a series of experiments, our model outperforms the segmentation quality of recurrent SLMs on Chinese, and performs similarly to the recurrent model on English.

2021

pdf bib
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue
Haizhou Li | Gina-Anne Levow | Zhou Yu | Chitralekha Gupta | Berrak Sisman | Siqi Cai | David Vandyke | Nina Dethlefs | Yan Wu | Junyi Jessy Li
Proceedings of the 22nd Annual Meeting of the Special Interest Group on Discourse and Dialogue

pdf
Prosody: Models, Methods, and Applications
Nigel Ward | Gina-Anne Levow
Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing: Tutorial Abstracts

Prosody is essential in human interaction, enabling people to show interest, establish rapport, efficiently convey nuances of attitude or intent, and so on. Some applications that exploit prosodic knowledge have recently shown superhuman performance, and in many respects our ability to effectively model prosody is rapidly advancing. This tutorial will overview the computational modeling of prosody, including recent advances and diverse actual and potential applications.

pdf
Developing a Shared Task for Speech Processing on Endangered Languages
Gina-Anne Levow | Emily Ahn | Emily M. Bender
Proceedings of the 4th Workshop on the Use of Computational Methods in the Study of Endangered Languages Volume 1 (Papers)

2018

pdf
Discovering Phonesthemes with Sparse Regularization
Nelson F. Liu | Gina-Anne Levow | Noah A. Smith
Proceedings of the Second Workshop on Subword/Character LEvel Models

We introduce a simple method for extracting non-arbitrary form-meaning representations from a collection of semantic vectors. We treat the problem as one of feature selection for a model trained to predict word vectors from subword features. We apply this model to the problem of automatically discovering phonesthemes, which are submorphemic sound clusters that appear in words with similar meaning. Many of our model-predicted phonesthemes overlap with those proposed in the linguistics literature, and we validate our approach with human judgments.

pdf
Automatic Identification of Basic-Level Categories
Chad Mills | Francis Bond | Gina-Anne Levow
Proceedings of the 9th Global Wordnet Conference

Basic-level categories have been shown to be both psychologically significant and useful in a wide range of practical applications. We build a rule-based system to identify basic-level categories in WordNet, achieving 77% accuracy on a test set derived from prior psychological experiments. With additional annotations we found our system also has low precision, in part due to the existence of many categories that do not fit into the three classes (superordinate, basic-level, and subordinate) relied on in basic-level category research.

2017

pdf
STREAMLInED Challenges: Aligning Research Interests with Shared Tasks
Gina-Anne Levow | Emily M. Bender | Patrick Littell | Kristen Howell | Shobhana Chelliah | Joshua Crowgey | Dan Garrette | Jeff Good | Sharon Hargus | David Inman | Michael Maxwell | Michael Tjalve | Fei Xia
Proceedings of the 2nd Workshop on the Use of Computational Methods in the Study of Endangered Languages

2015

pdf
CMILLS: Adapting Semantic Role Labeling Features to Dependency Parsing
Chad Mills | Gina-Anne Levow
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)

pdf
Analysis of Dysarthric Speech using Distinctive Feature Recognition
Ka Ho Wong | Yu Ting Yeung | Patrick C. M. Wong | Gina-Anne Levow | Helen Meng
Proceedings of SLPAT 2015: 6th Workshop on Speech and Language Processing for Assistive Technologies

2014

pdf bib
Proceedings of the Third CIPS-SIGHAN Joint Conference on Chinese Language Processing
Le Sun | Chengqing Zong | Min Zhang | Gina-Anne Levow
Proceedings of the Third CIPS-SIGHAN Joint Conference on Chinese Language Processing

2012

pdf
Bridging Gaps for Spoken Dialog System Frameworks in Instructional Settings
Gina-Anne Levow
NAACL-HLT Workshop on Future directions and needs in the Spoken Dialog Community: Tools and Data (SDCTD 2012)

2011

pdf
Contrasting Multi-Lingual Prosodic Cues to Predict Verbal Feedback for Rapport
Siwei Wang | Gina-Anne Levow
Proceedings of the 49th Annual Meeting of the Association for Computational Linguistics: Human Language Technologies

2009

pdf
Investigating Pitch Accent Recognition in Non-native Speech
Gina-Anne Levow
Proceedings of the ACL-IJCNLP 2009 Conference Short Papers

2008

pdf
Automatic Prosodic Labeling with Conditional Random Fields and Rich Acoustic Features
Gina-Anne Levow
Proceedings of the Third International Joint Conference on Natural Language Processing: Volume-I

pdf
Studying Discourse and Dialogue with SIDGrid
Gina-Anne Levow
Proceedings of the Third Workshop on Issues in Teaching Computational Linguistics

2007

pdf
SIDGRID: A Framework for Distributed and Integrated Multimodal Annotation and Archiving and and Analysis
Gina-Anne Levow | Bennett Bertenthal | Mark Hereld | Sarah Kenny | David McNeill | Michael Papka | Sonjia Waxmonsky
Proceedings of the 8th SIGdial Workshop on Discourse and Dialogue

pdf
Topic Segmentation with Hybrid Document Indexing
Irina Matveeva | Gina-Anne Levow
Proceedings of the 2007 Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning (EMNLP-CoNLL)

pdf
Hybrid Document Indexing with Spectral Embedding
Irina Matveeva | Gina-Anne Levow
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Companion Volume, Short Papers

pdf bib
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Tutorial Abstracts
Marti Hearst | Gina-Anne Levow | James Allan
Proceedings of the Human Language Technology Conference of the NAACL, Companion Volume: Tutorial Abstracts

2006

pdf
Unsupervised and Semi-supervised Learning of Tone and Pitch Accent
Gina-Anne Levow
Proceedings of the Human Language Technology Conference of the NAACL, Main Conference

pdf
Computing Term Translation Probabilities with Generalized Latent Semantic Analysis
Irina Matveeva | Gina-Anne Levow
Demonstrations

pdf
The Third International Chinese Language Processing Bakeoff: Word Segmentation and Named Entity Recognition
Gina-Anne Levow
Proceedings of the Fifth SIGHAN Workshop on Chinese Language Processing

pdf
Graph-based Generalized Latent Semantic Analysis for Document Representation
Irina Matveeva | Gina-Anne Levow
Proceedings of TextGraphs: the First Workshop on Graph Based Methods for Natural Language Processing

2005

pdf
Turn-taking in Mandarin Dialogue: Interactions of Tone and Intonation
Gina-Anne Levow
Proceedings of the Fourth SIGHAN Workshop on Chinese Language Processing

2004

pdf
Prosody-based Topic Segmentation for Mandarin Broadcast News
Gina-Anne Levow
Proceedings of HLT-NAACL 2004: Short Papers

pdf
Combining Prosodic and Text Features for Segmentation of Mandarin Broadcast News
Gina-Anne Levow
Proceedings of the Third SIGHAN Workshop on Chinese Language Processing

pdf
Prosodic Cues to Discourse Segment Boundaries in Human-Computer Dialogue
Gina-Anne Levow
Proceedings of the 5th SIGdial Workshop on Discourse and Dialogue at HLT-NAACL 2004

pdf
Assessing Prosodic and Text Features for Segmentation of Mandarin Broadcast News
Gina-Anne Levow
Proceedings of the Workshop on Interdisciplinary Approaches to Speech Indexing and Retrieval at HLT-NAACL 2004

2003

pdf
Issues in Pre- and Post-translation Document Expansion: Untranslatable Cognates and Missegmented Words
Gina-Anne Levow
Proceedings of the Sixth International Workshop on Information Retrieval with Asian Languages

pdf
Learning to Speak to a Spoken Language System: Vocabulary Convergence in Novice Users
Gina-Anne Levow
Proceedings of the Fourth SIGdial Workshop of Discourse and Dialogue

2001

pdf
Improved Cross-Language Retrieval using Backoff Translation
Philip Resnik | Douglas Oard | Gina Levow
Proceedings of the First International Conference on Human Language Technology Research

pdf
Mandarin-English Information: Investigating Translingual Speech Retrieval
Helen Meng | Berlin Chen | Sanjeev Khudanpur | Gina-Anne Levow | Wai-Kit Lo | Douglas Oard | Patrick Shone | Karen Tang | Hsin-Min Wang | Jianqiang Wang
Proceedings of the First International Conference on Human Language Technology Research

pdf
Rapidly Retargetable Interactive Translingual Retrieval
Gina-Anne Levow | Douglas W. Oard | Philip Resnik
Proceedings of the First International Conference on Human Language Technology Research

2000

pdf bib
Building a Chinese-English mapping between verb concepts for multilingual applications
Bonnie J. Dorr | Gina-Anne Levow | Dekang Lin
Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Technical Papers

This paper addresses the problem of building conceptual resources for multilingual applications. We describe new techniques for large-scale construction of a Chinese-English lexicon for verbs, using thematic-role information to create links between Chinese and English conceptual information. We then present an approach to compensating for gaps in the existing resources. The resulting lexicon is used for multilingual applications such as machine translation and cross-language information retrieval.

pdf
Chinese-English Semantic Resource Construction
Bonnie J. Dorr | Gina-Anne Levow | Dekang Lin | Scott Thomas
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

pdf
Mandarin-English Information (MEI): Investigating Translingual Speech Retrieval
Helen Meng | Sanjeev Khudanpur | Gina Levow | Douglas W. Oard | Hsin-Min Wang
ANLP-NAACL 2000 Workshop: Embedded Machine Translation Systems

1999

pdf
Modeling the language assessment process and result: Proposed architecture for automatic oral proficiency assessment
Gina-Anne Levow | Mari Broman Olsen
Computer Mediated Language Assessment and Evaluation in Natural Language Processing

1998

pdf
Characterizing and Recognizing Spoken Corrections in Human-Computer Dialogue
Gina-Anne Levow
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 1

pdf
Characterizing and Recognizing Spoken Corrections in Human-Computer Dialogue
Gina-Anne Levow
COLING 1998 Volume 1: The 17th International Conference on Computational Linguistics