SubmissionNumber#=%=#228 FinalPaperTitle#=%=#USTC-BUPT at SemEval-2024 Task 8: Enhancing Machine-Generated Text Detection via Domain Adversarial Neural Networks and LLM Embeddings ShortPaperTitle#=%=# NumberOfPages#=%=#12 CopyrightSigned#=%=#Zikang Guo JobTitle#==# Organization#==# Abstract#==#This paper introduces the system developed by USTC-BUPT for SemEval-2024 Task 8. The shared task comprises three subtasks across four tracks, aiming to develop automatic systems to distinguish between human-written and machine-generated text across various domains, languages and generators. Our system comprises four components: DATeD, LLAM, TLE, and AuDM, which empower us to effectively tackle all subtasks posed by the challenge. In the monolingual track, DATeD improves machine-generated text detection by incorporating a gradient reversal layer and integrating additional domain labels through Domain Adversarial Neural Networks, enhancing adaptation to diverse text domains. In the multilingual track, LLAM employs different strategies based on language characteristics. For English text, the LLM Embeddings approach utilizes embeddings from a proxy LLM followed by a two-stage CNN for classification, leveraging the broad linguistic knowledge captured during pre-training to enhance performance. For text in other languages, the LLM Sentinel approach transforms the classification task into a next-token prediction task, which facilitates easier adaptation to texts in various languages, especially low-resource languages. TLE utilizes the LLM Embeddings method with a minor modification in the classification strategy for subtask B. AuDM employs data augmentation and fine-tunes the DeBERTa model specifically for subtask C. Our system wins the multilingual track and ranks second in the monolingual track. Additionally, it achieves third place in both subtask B and C. Author{1}{Firstname}#=%=#Zikang Author{1}{Lastname}#=%=#Guo Author{1}{Username}#=%=#guozk Author{1}{Email}#=%=#gzk170401@mail.ustc.edu.cn Author{1}{Affiliation}#=%=#University of Science and Technology of China Author{2}{Firstname}#=%=#Kaijie Author{2}{Lastname}#=%=#Jiao Author{2}{Username}#=%=#shizha Author{2}{Email}#=%=#jiaokaijie@mail.ustc.cn Author{2}{Affiliation}#=%=#University of Science and Technology of China Author{3}{Firstname}#=%=#Xingyu Author{3}{Lastname}#=%=#Yao Author{3}{Username}#=%=#yaoxyu Author{3}{Email}#=%=#yaoxyu777@gmail.com Author{3}{Affiliation}#=%=#Beijing University of Posts and Telecommunications Author{4}{Firstname}#=%=#Yuning Author{4}{Lastname}#=%=#Wan Author{4}{Username}#=%=#wanyn Author{4}{Email}#=%=#peterwan1102@gmail.com Author{4}{Affiliation}#=%=#University of Science and Technology of China Author{5}{Firstname}#=%=#Haoran Author{5}{Lastname}#=%=#Li Author{5}{Username}#=%=#lihaoran01 Author{5}{Email}#=%=#lihaoran01@bupt.edu.cn Author{5}{Affiliation}#=%=#Beijing University of Posts and Telecommunications Author{6}{Firstname}#=%=#Benfeng Author{6}{Lastname}#=%=#Xu Author{6}{Username}#=%=#quentinzzz Author{6}{Email}#=%=#benfeng@mail.ustc.edu.cn Author{6}{Affiliation}#=%=#University of Science and Technology of China Author{7}{Firstname}#=%=#Licheng Author{7}{Lastname}#=%=#Zhang Author{7}{Username}#=%=#zhanglc06 Author{7}{Email}#=%=#zlczlc@mail.ustc.edu.cn Author{7}{Affiliation}#=%=#University of Science and Technology of China Author{8}{Firstname}#=%=#Quan Author{8}{Lastname}#=%=#Wang Author{8}{Username}#=%=#quanwang Author{8}{Email}#=%=#wangquan@bupt.edu.cn Author{8}{Affiliation}#=%=#Beijing University of Posts and Telecommunications Author{9}{Firstname}#=%=#Yongdong Author{9}{Lastname}#=%=#Zhang Author{9}{Username}#=%=#zhangyongdong Author{9}{Email}#=%=#zhyd73@ustc.edu.cn Author{9}{Affiliation}#=%=#University of Science and Technology of China Author{10}{Firstname}#=%=#Zhendong Author{10}{Lastname}#=%=#Mao Author{10}{Username}#=%=#maozhendong Author{10}{Email}#=%=#zdmao@ustc.edu.cn Author{10}{Affiliation}#=%=#University of Science and Technology of China ========== èéáğö