Bao-Tran Pham-Hong


Fixing paper assignments

  1. Please select all papers that belong to the same person.
  2. Indicate below which author they should be assigned to.
Provide a valid ORCID iD here. This will be used to match future papers to this author.
Provide the name of the school or the university where the author has received or will receive their highest degree (e.g., Ph.D. institution for researchers, or current affiliation for students). This will be used to form the new author page ID, if needed.

TODO: "submit" and "cancel" buttons here


2020

pdf bib
PGSG at SemEval-2020 Task 12: BERT-LSTM with Tweets’ Pretrained Model and Noisy Student Training Method
Bao-Tran Pham-Hong | Setu Chokshi
Proceedings of the Fourteenth Workshop on Semantic Evaluation

The paper presents a system developed for the SemEval-2020 competition Task 12 (OffensEval-2): Multilingual Offensive Language Identification in Social Media. We achieve the second place (2nd) in sub-task B: Automatic categorization of offense types and are ranked 55th with a macro F1-score of 90.59 in sub-task A: Offensive language identification. Our solution is using a stack of BERT and LSTM layers, training with the Noisy Student method. Since the tweets data contains a large number of noisy words and slang, we update the vocabulary of the BERT large model pre-trained by the Google AI Language team. We fine-tune the model with tweet sentences provided in the challenge.

2018

pdf bib
Genre-Oriented Web Content Extraction with Deep Convolutional Neural Networks and Statistical Methods
Bao-Dai Nguyen-Hoang | Bao-Tran Pham-Hong | Yiping Jin | Phu T. V. Le
Proceedings of the 32nd Pacific Asia Conference on Language, Information and Computation