Joyce Chai

Also published as: Joyce Y. Chai, Joyce Yue Chai


2022

pdf
Learning to Mediate Disparities Towards Pragmatic Communication
Yuwei Bao | Sayan Ghosh | Joyce Chai
Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Human communication is a collaborative process. Speakers, on top of conveying their own intent, adjust the content and language expressions by taking the listeners into account, including their knowledge background, personalities, and physical capabilities. Towards building AI agents with similar abilities in language communication, we propose a novel rational reasoning framework, Pragmatic Rational Speaker (PRS), where the speaker attempts to learn the speaker-listener disparity and adjust the speech accordingly, by adding a light-weighted disparity adjustment layer into working memory on top of speaker’s long-term memory system. By fixing the long-term memory, the PRS only needs to update its working memory to learn and adapt to different types of listeners. To validate our framework, we create a dataset that simulates different types of speaker-listener disparities in the context of referential games. Our empirical results demonstrate that the PRS is able to shift its output towards the language that listeners are able to understand, significantly improve the collaborative task outcome, and learn the disparity more efficiently than joint training.

2021

pdf
Hierarchical Task Learning from Language Instructions with Unified Transformers and Self-Monitoring
Yichi Zhang | Joyce Chai
Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021

pdf
Beyond the Tip of the Iceberg: Assessing Coherence of Text Classifiers
Shane Storks | Joyce Chai
Findings of the Association for Computational Linguistics: EMNLP 2021

As large-scale, pre-trained language models achieve human-level and superhuman accuracy on existing language understanding tasks, statistical bias in benchmark data and probing studies have recently called into question their true capabilities. For a more informative evaluation than accuracy on text classification tasks can offer, we propose evaluating systems through a novel measure of prediction coherence. We apply our framework to two existing language understanding benchmarks with different properties to demonstrate its versatility. Our experimental results show that this evaluation framework, although simple in ideas and implementation, is a quick, effective, and versatile measure to provide insight into the coherence of machines’ predictions.

pdf
Tiered Reasoning for Intuitive Physics: Toward Verifiable Commonsense Language Understanding
Shane Storks | Qiaozi Gao | Yichi Zhang | Joyce Chai
Findings of the Association for Computational Linguistics: EMNLP 2021

Large-scale, pre-trained language models (LMs) have achieved human-level performance on a breadth of language understanding tasks. However, evaluations only based on end task performance shed little light on machines’ true ability in language understanding and reasoning. In this paper, we highlight the importance of evaluating the underlying reasoning process in addition to end performance. Toward this goal, we introduce Tiered Reasoning for Intuitive Physics (TRIP), a novel commonsense reasoning dataset with dense annotations that enable multi-tiered evaluation of machines’ reasoning process. Our empirical results show that while large LMs can achieve high end performance, they struggle to support their predictions with valid supporting evidence. The TRIP dataset and our baseline results will motivate verifiable evaluation of commonsense reasoning and facilitate future research toward developing better language understanding and reasoning models.

pdf
MindCraft: Theory of Mind Modeling for Situated Dialogue in Collaborative Tasks
Cristian-Paul Bara | Sky CH-Wang | Joyce Chai
Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing

An ideal integration of autonomous agents in a human world implies that they are able to collaborate on human terms. In particular, theory of mind plays an important role in maintaining common ground during human collaboration and communication. To enable theory of mind modeling in situated interactions, we introduce a fine-grained dataset of collaborative tasks performed by pairs of human subjects in the 3D virtual blocks world of Minecraft. It provides information that captures partners’ beliefs of the world and of each other as an interaction unfolds, bringing abundant opportunities to study human collaborative behaviors in situated language communication. As a first step towards our goal of developing embodied AI agents able to infer belief states of collaborative partners in situ, we build and present results on computational models for several theory of mind tasks.

pdf
Zero-Shot Compositional Concept Learning
Guangyue Xu | Parisa Kordjamshidi | Joyce Chai
Proceedings of the 1st Workshop on Meta Learning and Its Applications to Natural Language Processing

In this paper, we study the problem of recognizing compositional attribute-object concepts within the zero-shot learning (ZSL) framework. We propose an episode-based cross-attention (EpiCA) network which combines merits of cross-attention mechanism and episode-based training strategy to recognize novel compositional concepts. Firstly, EpiCA bases on cross-attention to correlate conceptvisual information and utilizes the gated pooling layer to build contextualized representations for both images and concepts. The updated representations are used for a more indepth multi-modal relevance calculation for concept recognition. Secondly, a two-phase episode training strategy, especially the ransductive phase, is adopted to utilize unlabeled test examples to alleviate the low-resource learning problem. Experiments on two widelyused zero-shot compositional learning (ZSCL) benchmarks have demonstrated the effectiveness of the model compared with recent approaches on both conventional and generalized ZSCL settings.

2020

pdf
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics
Dan Jurafsky | Joyce Chai | Natalie Schluter | Joel Tetreault
Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics

pdf
Experience Grounds Language
Yonatan Bisk | Ari Holtzman | Jesse Thomason | Jacob Andreas | Yoshua Bengio | Joyce Chai | Mirella Lapata | Angeliki Lazaridou | Jonathan May | Aleksandr Nisnevich | Nicolas Pinto | Joseph Turian
Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP)

Language understanding research is held back by a failure to relate language to the physical world it describes and to the social interactions it facilitates. Despite the incredible effectiveness of language processing models to tackle tasks after being trained on text alone, successful linguistic communication relies on a shared experience of the world. It is this shared experience that makes utterances meaningful. Natural language processing is a diverse field, and progress throughout its development has come from new representational theories, modeling techniques, data collection paradigms, and tasks. We posit that the present success of representation learning approaches trained on large, text-only corpora requires the parallel tradition of research on the broader physical and social context of language to address the deeper questions of communication.

2018

pdf
Commonsense Justification for Action Explanation
Shaohua Yang | Qiaozi Gao | Sari Sadiya | Joyce Chai
Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing

To enable collaboration and communication between humans and agents, this paper investigates learning to acquire commonsense evidence for action justification. In particular, we have developed an approach based on the generative Conditional Variational Autoencoder(CVAE) that models object relations/attributes of the world as latent variables and jointly learns a performer that predicts actions and an explainer that gathers commonsense evidence to justify the action. Our empirical results have shown that, compared to a typical attention-based model, CVAE achieves significantly higher performance in both action prediction and justification. A human subject study further shows that the commonsense evidence gathered by CVAE can be communicated to humans to achieve a significantly higher common ground between humans and agents.

pdf
What Action Causes This? Towards Naive Physical Action-Effect Prediction
Qiaozi Gao | Shaohua Yang | Joyce Chai | Lucy Vanderwende
Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

Despite recent advances in knowledge representation, automated reasoning, and machine learning, artificial agents still lack the ability to understand basic action-effect relations regarding the physical world, for example, the action of cutting a cucumber most likely leads to the state where the cucumber is broken apart into smaller pieces. If artificial agents (e.g., robots) ever become our partners in joint tasks, it is critical to empower them with such action-effect understanding so that they can reason about the state of the world and plan for actions. Towards this goal, this paper introduces a new task on naive physical action-effect prediction, which addresses the relations between concrete actions (expressed in the form of verb-noun pairs) and their effects on the state of the physical world as depicted by images. We collected a dataset for this task and developed an approach that harnesses web image data through distant supervision to facilitate learning for action-effect prediction. Our empirical results have shown that web data can be used to complement a small number of seed examples (e.g., three examples for each action) for model learning. This opens up possibilities for agents to learn physical action-effect relations for tasks at hand through communication with humans with a few examples.

2017

pdf
Interactive Learning of Grounded Verb Semantics towards Human-Robot Communication
Lanbo She | Joyce Chai
Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

To enable human-robot communication and collaboration, previous works represent grounded verb semantics as the potential change of state to the physical world caused by these verbs. Grounded verb semantics are acquired mainly based on the parallel data of the use of a verb phrase and its corresponding sequences of primitive actions demonstrated by humans. The rich interaction between teachers and students that is considered important in learning new skills has not yet been explored. To address this limitation, this paper presents a new interactive learning approach that allows robots to proactively engage in interaction with human partners by asking good questions to learn models for grounded verb semantics. The proposed approach uses reinforcement learning to allow the robot to acquire an optimal policy for its question-asking behaviors by maximizing the long-term reward. Our empirical results have shown that the interactive learning approach leads to more reliable models for grounded verb semantics, especially in the noisy environment which is full of uncertainties. Compared to previous work, the models acquired from interactive learning result in a 48% to 145% performance gain when applied in new situations.

2016

pdf
Incremental Acquisition of Verb Hypothesis Space towards Physical World Interaction
Lanbo She | Joyce Chai
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf
Physical Causality of Action Verbs in Grounded Language Understanding
Qiaozi Gao | Malcolm Doering | Shaohua Yang | Joyce Chai
Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)

pdf
Grounded Semantic Role Labeling
Shaohua Yang | Qiaozi Gao | Changsong Liu | Caiming Xiong | Song-Chun Zhu | Joyce Y. Chai
Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

pdf
Jointly Learning Grounded Task Structures from Language Instruction and Visual Demonstration
Changsong Liu | Shaohua Yang | Sari Saba-Sadiya | Nishant Shukla | Yunzhong He | Song-Chun Zhu | Joyce Chai
Proceedings of the 2016 Conference on Empirical Methods in Natural Language Processing

2015

pdf
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies
Rada Mihalcea | Joyce Chai | Anoop Sarkar
Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

2014

pdf
Probabilistic Labeling for Efficient Referential Grounding based on Collaborative Discourse
Changsong Liu | Lanbo She | Rui Fang | Joyce Y. Chai
Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers)

pdf
Back to the Blocks World: Learning New Actions through Situated Human-Robot Dialogue
Lanbo She | Shaohua Yang | Yu Cheng | Yunyi Jia | Joyce Chai | Ning Xi
Proceedings of the 15th Annual Meeting of the Special Interest Group on Discourse and Dialogue (SIGDIAL)

2013

pdf
Modeling Collaborative Referring for Situated Referential Grounding
Changsong Liu | Rui Fang | Lanbo She | Joyce Chai
Proceedings of the SIGDIAL 2013 Conference

pdf
Towards Situated Dialogue: Revisiting Referring Expression Generation
Rui Fang | Changsong Liu | Lanbo She | Joyce Y. Chai
Proceedings of the 2013 Conference on Empirical Methods in Natural Language Processing

2012

pdf
Autonomous Self-Assessment of Autocorrections: Exploring Text Message Dialogues
Tyler Baldwin | Joyce Chai
Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies

pdf
Towards Mediating Shared Perceptual Basis in Situated Dialogue
Changsong Liu | Rui Fang | Joyce Chai
Proceedings of the 13th Annual Meeting of the Special Interest Group on Discourse and Dialogue

pdf
Semantic Role Labeling of Implicit Arguments for Nominal Predicates
Matthew Gerber | Joyce Y. Chai
Computational Linguistics, Volume 38, Issue 4 - December 2012

2011

pdf
Beyond Normalization: Pragmatics of Word Form in Text Messages
Tyler Baldwin | Joyce Chai
Proceedings of 5th International Joint Conference on Natural Language Processing

pdf
A Joint Model of Implicit Arguments for Nominal Predicates
Matthew Gerber | Joyce Chai | Robert Bart
Proceedings of the ACL 2011 Workshop on Relational Models of Semantics

pdf
Proceedings of the SIGDIAL 2011 Conference
Joyce Y. Chai | Johanna D. Moore | Rebecca J. Passonneau | David R. Traum
Proceedings of the SIGDIAL 2011 Conference

2010

pdf
Hand Gestures in Disambiguating Types of You Expressions in Multiparty Meetings
Tyler Baldwin | Joyce Chai | Katrin Kirchhoff
Proceedings of the SIGDIAL 2010 Conference

pdf
Fusing Eye Gaze with Speech Recognition Hypotheses to Resolve Exophoric References in Situated Dialogue
Zahar Prasov | Joyce Y. Chai
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing

pdf
Towards Conversation Entailment: An Empirical Investigation
Chen Zhang | Joyce Chai
Proceedings of the 2010 Conference on Empirical Methods in Natural Language Processing

pdf
Beyond NomBank: A Study of Implicit Arguments for Nominal Predicates
Matthew Gerber | Joyce Chai
Proceedings of the 48th Annual Meeting of the Association for Computational Linguistics

2009

pdf
The Role of Interactivity in Human-Machine Conversation for Automatic Word Acquisition
Shaolin Qu | Joyce Chai
Proceedings of the SIGDIAL 2009 Conference

pdf
What do We Know about Conversation Participants: Experiments on Conversation Entailment
Chen Zhang | Joyce Chai
Proceedings of the SIGDIAL 2009 Conference

pdf
The Role of Implicit Argumentation in Nominal SRL
Matthew Gerber | Joyce Chai | Adam Meyers
Proceedings of Human Language Technologies: The 2009 Annual Conference of the North American Chapter of the Association for Computational Linguistics

2008

pdf
Incorporating Temporal and Semantic Information with Eye Gaze for Automatic Word Acquisition in Multimodal Conversational Systems
Shaolin Qu | Joyce Chai
Proceedings of the 2008 Conference on Empirical Methods in Natural Language Processing

2007

pdf
An Exploration of Eye Gaze in Spoken Language Processing for Multimodal Conversational Interfaces
Shaolin Qu | Joyce Chai
Human Language Technologies 2007: The Conference of the North American Chapter of the Association for Computational Linguistics; Proceedings of the Main Conference

pdf
Automated Vocabulary Acquisition and Interpretation in Multimodal Conversational Systems
Yi Liu | Joyce Chai | Rong Jin
Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics

2006

pdf
Towards Conversational QA: Automatic Identification of Problematic Situations and User Intent
Joyce Y. Chai | Chen Zhang | Tyler Baldwin
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions

2005

pdf
A Salience Driven Approach to Robust Input Interpretation in Multimodal Conversational Systems
Joyce Y. Chai | Shaolin Qu
Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing

2004

pdf
Discourse Structure for Context Question Answering
Joyce Y. Chai | Rong Jin
Proceedings of the Workshop on Pragmatics of Question Answering at HLT-NAACL 2004

pdf
Optimization in Multimodal Interpretation
Joyce Y. Chai | Pengyu Hong | Michelle X. Zhou | Zahar Prasov
Proceedings of the 42nd Annual Meeting of the Association for Computational Linguistics (ACL-04)

pdf
Performance Evaluation and Error Analysis for Multimodal Reference Resolution in a Conversation System
Joyce Y. Chai | Zahar Prasov | Pengyu Hong
Proceedings of HLT-NAACL 2004: Short Papers

2003

pdf
Combining Semantic and Temporal Constraints for Multimodal Integration in Conversation Systems
Joyce Y. Chai | Pengyu Hong | Michelle X. Zhou
Proceedings of the HLT-NAACL 2003 Workshop on Research Directions in Dialogue Processing

2002

pdf
Semantics-based Representation for Multimodal Interpretation in Conversational Systems
Joyce Chai
COLING 2002: The 19th International Conference on Computational Linguistics

2001

pdf
A Conversational Interface for Online Shopping
Joyce Chai | Veronika Horvath | Nanda Kambhatla | Nicolas Nicolov | Margo Stys-Budzikowska
Proceedings of the First International Conference on Human Language Technology Research

pdf
Conversational Sales Assistant for Online Shopping
Margo Budzikowska | Joyce Chai | Sunil Govindappa | Veronika Horvath | Nanda Kambhatla | Nicolas Nicolov | Wlodek Zadrozny
Proceedings of the First International Conference on Human Language Technology Research

2000

pdf
Dynamic User Level and Utility Measurement for Adaptive Dialog in a Help-Desk System
Preetam Maloor | Joyce Chai
1st SIGdial Workshop on Discourse and Dialogue

pdf
Evaluation of a Generic Lexical Semantic Resource in Information Extraction
Joyce Yue Chai
Proceedings of the Second International Conference on Language Resources and Evaluation (LREC’00)

1997

pdf
Duke’s Trainable Information and Meaning Extraction System (Duke TIMES)
Amit Bagga | Joyce Yue Chai
Fifth Conference on Applied Natural Language Processing: Descriptions of System Demonstrations and Videos

pdf
Corpus Based Statistical Generalization Tree in Rule Optimization
Joyce Yue Chai | Alan W. Biermann
Fifth Workshop on Very Large Corpora

pdf
The Use of Lexical Semantics in Information Extraction
Joyce Yue Chai | Alan W. Biermann
Automatic Information Extraction and Building of Lexical Semantic Resources for NLP Applications

pdf
A Trainable Message Understanding System
Amit Bagga | Joyce Yue Chai
CoNLL97: Computational Natural Language Learning