LiveCANNBench: Benchmark SWE AI Coding for Ascend CANN

Sijie Wang; Kai Zhao; Wee Peng Tay; Shuo Zhang; Chengwen Liu; Quanjiang Guo; Ren Junhao; Xin Li; Heng Lian; Jingdi Lei; Rui She; Huacan Wang; Ronghao Chen

LiveCANNBench: Benchmark SWE AI Coding for Ascend CANN

Sijie Wang, Kai Zhao, Wee Peng Tay, Shuo Zhang, Chengwen Liu, Quanjiang Guo, Ren Junhao, Xin Li, Heng Lian, Jingdi Lei, Rui She, Huacan Wang, Ronghao Chen

Abstract

AI coding has emerged as a core application of large language models (LLMs), evolving from single-file coding tasks towards complex software engineering (SWE) scenarios. Recent advances in agents have enabled multi-file, multi-language, and dependency-aware AI coding, significantly expanding the scope of AI-assisted software development. While a variety of benchmarks have been proposed to evaluate coding capabilities in general-purpose or GPU coding ecosystems such as CUDA and ROCm, systematic evaluation for Huawei Ascend CANN remains largely underexplored. In this work, we propose LiveCANNBench, an SWE-level benchmark designed for AI coding in the CANN software stack. LiveCANNBench is constructed from real-world CANN repositories and consists of over 400 task instances spanning multi-file, multi-language, and execution-aware coding challenges. Unlike existing static benchmarks that primarily focus on kernel-level code generation, LiveCANNBench adopts a live benchmarking paradigm, effectively mitigating data leakage and enabling more reliable evaluation of modern coding agents.

Anthology ID:: 2026.findings-acl.1143
Volume:: Findings of the Association for Computational Linguistics: ACL 2026
Month:: July
Year:: 2026
Address:: San Diego, California, United States
Editors:: Maria Liakata, Viviane P. Moreira, Jiajun Zhang, David Jurgens
Venue:: Findings
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 22788–22803
Language:
URL:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1143/
DOI:
Bibkey:
Cite (ACL):: Sijie Wang, Kai Zhao, Wee Peng Tay, Shuo Zhang, Chengwen Liu, Quanjiang Guo, Ren Junhao, Xin Li, Heng Lian, Jingdi Lei, Rui She, Huacan Wang, and Ronghao Chen. 2026. LiveCANNBench: Benchmark SWE AI Coding for Ascend CANN. In Findings of the Association for Computational Linguistics: ACL 2026, pages 22788–22803, San Diego, California, United States. Association for Computational Linguistics.
Cite (Informal):: LiveCANNBench: Benchmark SWE AI Coding for Ascend CANN (Wang et al., Findings 2026)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-acl/2026.findings-acl.1143.pdf
Checklist:: 2026.findings-acl.1143.checklist.pdf

PDF Cite Search Checklist Fix data