Zhoujin Tian
2022
RAPO: An Adaptive Ranking Paradigm for Bilingual Lexicon Induction
Zhoujin Tian
|
Chaozhuo Li
|
Shuo Ren
|
Zhiqiang Zuo
|
Zengxuan Wen
|
Xinyue Hu
|
Xiao Han
|
Haizhen Huang
|
Denvy Deng
|
Qi Zhang
|
Xing Xie
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Bilingual lexicon induction induces the word translations by aligning independently trained word embeddings in two languages. Existing approaches generally focus on minimizing the distances between words in the aligned pairs, while suffering from low discriminative capability to distinguish the relative orders between positive and negative candidates. In addition, the mapping function is globally shared by all words, whose performance might be hindered by the deviations in the distributions of different languages. In this work, we propose a novel ranking-oriented induction model RAPO to learn personalized mapping function for each word. RAPO is capable of enjoying the merits from the unique characteristics of a single word and the cross-language isomorphism simultaneously. Extensive experimental results on public datasets including both rich-resource and low-resource languages demonstrate the superiority of our proposal. Our code is publicly available in https://github.com/Jlfj345wf/RAPO.
TANet: Thread-Aware Pretraining for Abstractive Conversational Summarization
Ze Yang
|
Christian Wang
|
Zhoujin Tian
|
Wei Wu
|
Zhoujun Li
Findings of the Association for Computational Linguistics: NAACL 2022
Although pre-trained language models (PLMs) have achieved great success and become a milestone in NLP, abstractive conversational summarization remains a challenging but less studied task. The difficulty lies in two aspects. One is the lack of large-scale conversational summary data. Another is that applying the existing pre-trained models to this task is tricky because of the structural dependence within the conversation and its informal expression, etc. In this work, we first build a large-scale (11M) pretraining dataset called RCSum, based on the multi-person discussions in the Reddit community. We then present TANet, a thread-aware Transformer-based network. Unlike the existing pre-trained models that treat a conversation as a sequence of sentences, we argue that the inherent contextual dependency among the utterances plays an essential role in understanding the entire conversation and thus propose two new techniques to incorporate the structural information into our model. The first is thread-aware attention which is computed by taking into account the contextual dependency within utterances. Second, we apply thread prediction loss to predict the relations between utterances. We evaluate our model on four datasets of real conversations, covering types of meeting transcripts, customer-service records, and forum threads. Experimental results demonstrate that TANet achieves a new state-of-the-art in terms of both automatic evaluation and human judgment.
Search
Co-authors
- Chaozhuo Li 1
- Shuo Ren 1
- Zhiqiang Zuo 1
- Zengxuan Wen 1
- Xinyue Hu 1
- show all...