Talaat Khalil


2022

pdf
HuaAMS at SemEval-2022 Task 8: Combining Translation and Domain Pre-training for Cross-lingual News Article Similarity
Sai Sandeep Sharma Chittilla | Talaat Khalil
Proceedings of the 16th International Workshop on Semantic Evaluation (SemEval-2022)

This paper describes our submission to SemEval-2022 Multilingual News Article Similarity task. We experiment with different approaches that utilize a pre-trained language model fitted with a regression head to predict similarity scores for a given pair of news articles. Our best performing systems include 2 key steps: 1) pre-training with in-domain data 2) training data enrichment through machine translation. Our final submission is an ensemble of predictions from our top systems. While we show the significance of pre-training and augmentation, we believe the issue of language coverage calls for more attention.

pdf
Empirical Evaluation of Language Agnostic Filtering of Parallel Data for Low Resource Languages
Praveen Dakwale | Talaat Khalil | Brandon Denis
Proceedings of the 36th Pacific Asia Conference on Language, Information and Computation

2019

pdf
Cross-lingual intent classification in a low resource industrial setting
Talaat Khalil | Kornel Kiełczewski | Georgios Christos Chouliaras | Amina Keldibek | Maarten Versteegh
Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP)

This paper explores different approaches to multilingual intent classification in a low resource setting. Recent advances in multilingual text representations promise cross-lingual transfer for classifiers. We investigate the potential for this transfer in an applied industrial setting and compare to multilingual classification using machine translated text. Our results show that while the recently developed methods show promise, practical application calls for a combination of techniques for useful results.

2017

pdf
Toward a full-scale neural machine translation in production: the Booking.com use case
Pavel Levin | Nishikant Dhanuka | Talaat Khalil | Fedor Kovalev | Maxim Khalilov
Proceedings of Machine Translation Summit XVI: Commercial MT Users and Translators Track

2016

pdf
NileTMRG at SemEval-2016 Task 5: Deep Convolutional Neural Networks for Aspect Category and Sentiment Extraction
Talaat Khalil | Samhaa R. El-Beltagy
Proceedings of the 10th International Workshop on Semantic Evaluation (SemEval-2016)