New Author System in the ACL Anthology
January 17, 2026
A basic task of the ACL Anthology is to correctly match papers to authors. When a new volume is ingested, the Anthology receives author information as textual metadata along with each PDF. When the site is built, we assemble author pages from all papers that share the same name.
This may sound straightforward, but in practice it’s difficult, for two reasons: the same person may publish under multiple name variants (diacritics, middle initials, name changes), and many names are shared by multiple people. In practice, we resolve both of these issues using a manual process: one mechanism to group papers for a single author with multiple names, and another to explicitly assign a paper to a specific author using a name ID. This process is time-consuming and labor-intensive.
The following histogram of name counts shows that most names are unique, but the number of ambiguous names is still significant. In the past few years, especially, the number of ambiguous names has increased sharply, which increases the amount of human effort in resolving these ambiguities.
| People sharing a name | Name count | Examples |
|---|---|---|
| 1 | 122,122 | — (unique entries) |
| 2 | 121 | Ai, Wei; Amini, Massih R |
| 3 | 297 | Agarwal, Shubham; Bartelt, Christian |
| 4 | 63 | Zhang, Dongyu; Zhang, Han |
| 5 | 22 | Chen, Wei; Li, Yang |
| 6 | 9 | Li, Lei; Wang, Di |
| 7 | 11 | Chen, Chen; Li, Chen |
| 8 | 2 | Liu, Wei; Zhang, Yu |
| 9 | 2 | Li, Bo; Wang, Hao |
| 10 | 2 | Chen, Hao; Li, Xiang |
| 14 | 1 | Zhang, Li (only known example) |
| 25 | 1 | Liu, Yang (only known example) |
To address this, we are introducing a new author system centered on ORCID iDs, a widely used persistent identifier for researchers. Papers with ORCID iDs will be automatically linked together and used to create “verified” author pages. Authors without ORCID iDs will still have author pages, but these will be marked as “unverified” to indicate that the papers listed may not all belong to the same person.
The best way to ensure your papers are correctly attributed is to create an ORCID iD and add it everywhere you submit papers: in particular, to your OpenReview profile and to conference submission systems such as Softconf. If you notice papers that don’t belong together, you can use the “Fix author” link on the author page to report it.
For more information, you can read about how we verify authors and how we use ORCID iDs.