Xudong Shen
Papers on this page may belong to the following people: Xudong Shen, Xudong Shen
2025
CODEMENV: Benchmarking Large Language Models on Code Migration
Keyuan Cheng | Xudong Shen | Yihao Yang | Tengyue Wang | Yang Cao | Muhammad Asif Ali | Hanbin Wang | Lijie Hu | Di Wang
Findings of the Association for Computational Linguistics: ACL 2025
Keyuan Cheng | Xudong Shen | Yihao Yang | Tengyue Wang | Yang Cao | Muhammad Asif Ali | Hanbin Wang | Lijie Hu | Di Wang
Findings of the Association for Computational Linguistics: ACL 2025
Large language models (LLMs) have demonstrated remarkable proficiency in handling a wide range of tasks within the software engineering domain, but their ability to perform code migration—adapting code to different environments—remains underexplored. In this work, we propose a novel benchmark, : Code Migration Across Environment, designed to evaluate LLMs’ performance in handling code migration tasks. The benchmark comprises 922 data points across 19 Python and Java packages, offering three tasks to systematically evaluate code migration: identifying version-incompatible functions, determining function changes, and adapting code to target environments. Experimental evaluation of across seven LLMs revealed an average pass@1 rate of 26.50%, with GPT-4o performing best at 43.84%. We highlight our key findings as follows: (i) LLMs are more familiar with newer function versions, making them better at migrating legacy code, and (ii) a logical inconsistency where LLMs sometimes identify irrelevant function changes for the target migration environment.
2023
NL-Augmenter: A Framework for Task-Sensitive Natural Language Augmentation
Kaustubh Dhole | Varun Gangal | Sebastian Gehrmann | Aadesh Gupta | Zhenhao Li | Saad Mahamood | Abinaya Mahadiran | Simon Mille | Ashish Shrivastava | Samson Tan | Tongshang Wu | Jascha Sohl-Dickstein | Jinho Choi | Eduard Hovy | Ondřej Dušek | Sebastian Ruder | Sajant Anand | Nagender Aneja | Rabin Banjade | Lisa Barthe | Hanna Behnke | Ian Berlot-Attwell | Connor Boyle | Caroline Brun | Marco Antonio Sobrevilla Cabezudo | Samuel Cahyawijaya | Emile Chapuis | Wanxiang Che | Mukund Choudhary | Christian Clauss | Pierre Colombo | Filip Cornell | Gautier Dagan | Mayukh Das | Tanay Dixit | Thomas Dopierre | Paul-Alexis Dray | Suchitra Dubey | Tatiana Ekeinhor | Marco Di Giovanni | Tanya Goyal | Rishabh Gupta | Louanes Hamla | Sang Han | Fabrice Harel-Canada | Antoine Honoré | Ishan Jindal | Przemysław Joniak | Denis Kleyko | Venelin Kovatchev | Kalpesh Krishna | Ashutosh Kumar | Stefan Langer | Seungjae Ryan Lee | Corey James Levinson | Hualou Liang | Kaizhao Liang | Zhexiong Liu | Andrey Lukyanenko | Vukosi Marivate | Gerard de Melo | Simon Meoni | Maxine Meyer | Afnan Mir | Nafise Sadat Moosavi | Niklas Meunnighoff | Timothy Sum Hon Mun | Kenton Murray | Marcin Namysl | Maria Obedkova | Priti Oli | Nivranshu Pasricha | Jan Pfister | Richard Plant | Vinay Prabhu | Vasile Pais | Libo Qin | Shahab Raji | Pawan Kumar Rajpoot | Vikas Raunak | Roy Rinberg | Nicholas Roberts | Juan Diego Rodriguez | Claude Roux | Vasconcellos Samus | Ananya Sai | Robin Schmidt | Thomas Scialom | Tshephisho Sefara | Saqib Shamsi | Xudong Shen | Yiwen Shi | Haoyue Shi | Anna Shvets | Nick Siegel | Damien Sileo | Jamie Simon | Chandan Singh | Roman Sitelew | Priyank Soni | Taylor Sorensen | William Soto | Aman Srivastava | Aditya Srivatsa | Tony Sun | Mukund Varma | A Tabassum | Fiona Tan | Ryan Teehan | Mo Tiwari | Marie Tolkiehn | Athena Wang | Zijian Wang | Zijie Wang | Gloria Wang | Fuxuan Wei | Bryan Wilie | Genta Indra Winata | Xinyu Wu | Witold Wydmanski | Tianbao Xie | Usama Yaseen | Michael Yee | Jing Zhang | Yue Zhang
Northern European Journal of Language Technology, Volume 9
Kaustubh Dhole | Varun Gangal | Sebastian Gehrmann | Aadesh Gupta | Zhenhao Li | Saad Mahamood | Abinaya Mahadiran | Simon Mille | Ashish Shrivastava | Samson Tan | Tongshang Wu | Jascha Sohl-Dickstein | Jinho Choi | Eduard Hovy | Ondřej Dušek | Sebastian Ruder | Sajant Anand | Nagender Aneja | Rabin Banjade | Lisa Barthe | Hanna Behnke | Ian Berlot-Attwell | Connor Boyle | Caroline Brun | Marco Antonio Sobrevilla Cabezudo | Samuel Cahyawijaya | Emile Chapuis | Wanxiang Che | Mukund Choudhary | Christian Clauss | Pierre Colombo | Filip Cornell | Gautier Dagan | Mayukh Das | Tanay Dixit | Thomas Dopierre | Paul-Alexis Dray | Suchitra Dubey | Tatiana Ekeinhor | Marco Di Giovanni | Tanya Goyal | Rishabh Gupta | Louanes Hamla | Sang Han | Fabrice Harel-Canada | Antoine Honoré | Ishan Jindal | Przemysław Joniak | Denis Kleyko | Venelin Kovatchev | Kalpesh Krishna | Ashutosh Kumar | Stefan Langer | Seungjae Ryan Lee | Corey James Levinson | Hualou Liang | Kaizhao Liang | Zhexiong Liu | Andrey Lukyanenko | Vukosi Marivate | Gerard de Melo | Simon Meoni | Maxine Meyer | Afnan Mir | Nafise Sadat Moosavi | Niklas Meunnighoff | Timothy Sum Hon Mun | Kenton Murray | Marcin Namysl | Maria Obedkova | Priti Oli | Nivranshu Pasricha | Jan Pfister | Richard Plant | Vinay Prabhu | Vasile Pais | Libo Qin | Shahab Raji | Pawan Kumar Rajpoot | Vikas Raunak | Roy Rinberg | Nicholas Roberts | Juan Diego Rodriguez | Claude Roux | Vasconcellos Samus | Ananya Sai | Robin Schmidt | Thomas Scialom | Tshephisho Sefara | Saqib Shamsi | Xudong Shen | Yiwen Shi | Haoyue Shi | Anna Shvets | Nick Siegel | Damien Sileo | Jamie Simon | Chandan Singh | Roman Sitelew | Priyank Soni | Taylor Sorensen | William Soto | Aman Srivastava | Aditya Srivatsa | Tony Sun | Mukund Varma | A Tabassum | Fiona Tan | Ryan Teehan | Mo Tiwari | Marie Tolkiehn | Athena Wang | Zijian Wang | Zijie Wang | Gloria Wang | Fuxuan Wei | Bryan Wilie | Genta Indra Winata | Xinyu Wu | Witold Wydmanski | Tianbao Xie | Usama Yaseen | Michael Yee | Jing Zhang | Yue Zhang
Northern European Journal of Language Technology, Volume 9
Data augmentation is an important method for evaluating the robustness of and enhancing the diversity of training data for natural language processing (NLP) models. In this paper, we present NL-Augmenter, a new participatory Python-based natural language (NL) augmentation framework which supports the creation of transformations (modifications to the data) and filters (data splits according to specific features). We describe the framework and an initial set of 117 transformations and 23 filters for a variety of NL tasks annotated with noisy descriptive tags. The transformations incorporate noise, intentional and accidental human mistakes, socio-linguistic variation, semantically-valid style, syntax changes, as well as artificial constructs that are unambiguous to humans. We demonstrate the efficacy of NL-Augmenter by using its transformations to analyze the robustness of popular language models. We find different models to be differently challenged on different tasks, with quasi-systematic score decreases. The infrastructure, datacards, and robustness evaluation results are publicly available on GitHub for the benefit of researchers working on paraphrase generation, robustness analysis, and low-resource NLP.
2022
Super-NaturalInstructions: Generalization via Declarative Instructions on 1600+ NLP Tasks
Yizhong Wang | Swaroop Mishra | Pegah Alipoormolabashi | Yeganeh Kordi | Amirreza Mirzaei | Atharva Naik | Arjun Ashok | Arut Selvan Dhanasekaran | Anjana Arunkumar | David Stap | Eshaan Pathak | Giannis Karamanolakis | Haizhi Lai | Ishan Purohit | Ishani Mondal | Jacob Anderson | Kirby Kuznia | Krima Doshi | Kuntal Kumar Pal | Maitreya Patel | Mehrad Moradshahi | Mihir Parmar | Mirali Purohit | Neeraj Varshney | Phani Rohitha Kaza | Pulkit Verma | Ravsehaj Singh Puri | Rushang Karia | Savan Doshi | Shailaja Keyur Sampat | Siddhartha Mishra | Sujan Reddy A | Sumanta Patro | Tanay Dixit | Xudong Shen
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
Yizhong Wang | Swaroop Mishra | Pegah Alipoormolabashi | Yeganeh Kordi | Amirreza Mirzaei | Atharva Naik | Arjun Ashok | Arut Selvan Dhanasekaran | Anjana Arunkumar | David Stap | Eshaan Pathak | Giannis Karamanolakis | Haizhi Lai | Ishan Purohit | Ishani Mondal | Jacob Anderson | Kirby Kuznia | Krima Doshi | Kuntal Kumar Pal | Maitreya Patel | Mehrad Moradshahi | Mihir Parmar | Mirali Purohit | Neeraj Varshney | Phani Rohitha Kaza | Pulkit Verma | Ravsehaj Singh Puri | Rushang Karia | Savan Doshi | Shailaja Keyur Sampat | Siddhartha Mishra | Sujan Reddy A | Sumanta Patro | Tanay Dixit | Xudong Shen
Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing
How well can NLP models generalize to a variety of unseen tasks when provided with task instructions? To address this question, we first introduce Super-NaturalInstructions, a benchmark of 1,616 diverse NLP tasks and their expert-written instructions. Our collection covers 76 distinct task types, including but not limited to classification, extraction, infilling, sequence tagging, text rewriting, and text composition. This large and diverse collection of tasks enables rigorous benchmarking of cross-task generalization under instructions—training models to follow instructions on a subset of tasks and evaluating them on the remaining unseen ones.Furthermore, we build Tk-Instruct, a transformer model trained to follow a variety of in-context instructions (plain language task definitions or k-shot examples). Our experiments show that Tk-Instruct outperforms existing instruction-following models such as InstructGPT by over 9% on our benchmark despite being an order of magnitude smaller. We further analyze generalization as a function of various scaling parameters, such as the number of observed tasks, the number of instances per task, and model sizes. We hope our dataset and model facilitate future progress towards more general-purpose NLP models.
Search
Fix author
Co-authors
- Tanay Dixit 2
- Sujan Reddy A 1
- Muhammad Asif Ali 1
- Pegah Alipoormolabashi 1
- Sajant Anand 1
- Jacob Anderson 1
- Nagender Aneja 1
- Anjana Arunkumar 1
- Arjun Ashok 1
- Rabin Banjade 1
- Lisa Barthe 1
- Hanna Behnke 1
- Ian Berlot-Attwell 1
- Connor Boyle 1
- Caroline Brun 1
- Samuel Cahyawijaya 1
- Yang Cao 1
- Emile Chapuis 1
- Wanxiang Che (车万翔) 1
- Keyuan Cheng 1
- Jinho D. Choi 1
- Mukund Choudhary 1
- Christian Clauss 1
- Pierre Colombo 1
- Filip Cornell 1
- Gautier Dagan 1
- Mayukh Das 1
- Gerard De Melo 1
- Arut Selvan Dhanasekaran 1
- Kaustubh Dhole 1
- Marco Di Giovanni 1
- Thomas Dopierre 1
- Krima Doshi 1
- Savan Doshi 1
- Paul-Alexis Dray 1
- Suchitra Dubey 1
- Ondřej Dušek 1
- Tatiana Ekeinhor 1
- Varun Gangal 1
- Sebastian Gehrmann 1
- Tanya Goyal 1
- Aadesh Gupta 1
- Rishabh Gupta 1
- Louanes Hamla 1
- Sang Han 1
- Fabrice Harel-Canada 1
- Antoine Honoré 1
- Eduard Hovy 1
- Lijie Hu 1
- Ishan Jindal 1
- Przemysław Joniak 1
- Giannis Karamanolakis 1
- Rushang Karia 1
- Phani Rohitha Kaza 1
- Denis Kleyko 1
- Yeganeh Kordi 1
- Venelin Kovatchev 1
- Kalpesh Krishna 1
- Ashutosh Kumar 1
- Kirby Kuznia 1
- Haizhi Lai 1
- Stefan Langer 1
- Seungjae Ryan Lee 1
- Corey James Levinson 1
- Zhenhao Li 1
- Hualou Liang 1
- Kaizhao Liang 1
- Zhexiong Liu 1
- Andrey Lukyanenko 1
- Abinaya Mahadiran 1
- Saad Mahamood 1
- Vukosi Marivate 1
- Simon Meoni 1
- Niklas Meunnighoff 1
- Maxine Meyer 1
- Simon Mille 1
- Afnan Mir 1
- Amirreza Mirzaei 1
- Siddhartha Mishra 1
- Swaroop Mishra 1
- Ishani Mondal 1
- Nafise Sadat Moosavi 1
- Mehrad Moradshahi 1
- Timothy Sum Hon Mun 1
- Kenton Murray 1
- Atharva Naik 1
- Marcin Namysl 1
- Maria Obedkova 1
- Priti Oli 1
- Vasile Pais 1
- Kuntal Kumar Pal 1
- Mihir Parmar 1
- Nivranshu Pasricha 1
- Maitreya Patel 1
- Eshaan Pathak 1
- Sumanta Patro 1
- Jan Pfister 1
- Richard Plant 1
- Vinay Prabhu 1
- Ravsehaj Singh Puri 1
- Ishan Purohit 1
- Mirali Purohit 1
- Libo Qin 1
- Shahab Raji 1
- Pawan Kumar Rajpoot 1
- Vikas Raunak 1
- Roy Rinberg 1
- Nicholas Roberts 1
- Juan Diego Rodriguez 1
- Claude Roux 1
- Sebastian Ruder 1
- Ananya Sai 1
- Shailaja Keyur Sampat 1
- Vasconcellos Samus 1
- Robin Schmidt 1
- Thomas Scialom 1
- Tshephisho Sefara 1
- Saqib Shamsi 1
- Freda Shi 1
- Yiwen Shi 1
- Ashish Shrivastava 1
- Anna Shvets 1
- Nick Siegel 1
- Damien Sileo 1
- Jamie Simon 1
- Chandan Singh 1
- Roman Sitelew 1
- Marco Antonio Sobrevilla Cabezudo 1
- Jascha Sohl-Dickstein 1
- Priyank Soni 1
- Taylor Sorensen 1
- William Soto Martinez 1
- Aman Srivastava 1
- Aditya Srivatsa 1
- David Stap 1
- Tony Sun 1
- A Tabassum 1
- Fiona Tan 1
- Samson Tan 1
- Ryan Teehan 1
- Mo Tiwari 1
- Marie Tolkiehn 1
- Mukund Varma 1
- Neeraj Varshney 1
- Pulkit Verma 1
- Athena Wang 1
- Di Wang 1
- Gloria Wang 1
- Hanbin Wang 1
- Tengyue Wang 1
- Yizhong Wang 1
- Zijian Wang 1
- Zijie Wang 1
- Fuxuan Wei 1
- Bryan Wilie 1
- Genta Indra Winata 1
- Tongshang Wu 1
- Xinyu Wu 1
- Witold Wydmanski 1
- Tianbao Xie 1
- Yihao Yang 1
- Usama Yaseen 1
- Michael Yee 1
- Jing Zhang 1
- Yue Zhang 1