Frequently Asked Questions

Important information about the ACL Anthology


Using the Anthology

  • How can I programmatically access the Anthology's data?

    The Anthology’s data is hosted in our Github repo, which contains all the metadata for all its papers (under /data/xml) and their authors (data/yaml/people.yaml), the volumes those papers are organized into, and the real-world events that presented those volumes. The PDFs are hosted on our servers.

    All of this data is accessible via the ACL Anthology Python module on PyPI, which you can install with pip:

    pip install acl-anthology
    

    Please see our repository documentation or the Python module documentation for more information.

    You may also be interested in learning how easy it is to cite our papers in a variety of citation formats.

  • How do I cite papers in the ACL Anthology?

    Citing papers in the ACL Anthology is simple. We provide

    • bulk bibliographic exports
    • click-to-copy citation keys on each paper page
    • per-paper citation downloads

    Our primary supported format is BibTeX. The simplest way to cite papers is download the bulk BibTeX exports, and then use the citation keys, which are often inferrable from the paper’s author list and title: {authors}-{year}-{title-word}, where

    • {authors} is the last names of the first one or two authors, separated by hyphens; if more than two authors, etal is used for the second author
    • {year} is the four-digit year of publication
    • {title-word} is the first significant word of the paper’s title; additional words are added if needed to create a unique bibkey

    Some examples are galley-etal-2004-whats and huang-chiang-2005-better. For convenience, a button on each paper page provides click-to-copy access to the bibkey.

    Bulk downloads for consolidated BibTeX files are available in the following variations.

    • Overleaf-friendly: anthology-1.bib, anthology-2.bib etc. are sharded variants that are under 50 MB each, suitable for direct import into Overleaf repositories.
    • Full, with abstracts: anthology+abstracts.bib.gz contains citations for all papers that exist in the Anthology, including abstracts.
    • No abstracts: anthology.bib.gz contains all citations but removes abstracts, to save on space.
    • No abstracts, uncompressed: anthology.bib is the same as the above, but provided uncompressed, for convenience.

    For individual papers, buttons are provided to download citation data in a number of formats, including BibTeX, MODS XML, Endnote, and an informal citation string. These formats can be downloaded as files or copied to the clipboard via convenient buttons.

    Finally, we also offer an XML paper feed, which is useful in tools like Zotero and Mendeley.

  • How do I link to publications in the ACL Anthology?

    Every paper in the Anthology is assigned an Anthology ID. After 2020, identifiers are of the form {year}.{venue}-{volume}.{#}, where {year} is the four-digit year, {venue} a lowercased venue code comprising ASCII letters and digits, {volume} is a volume name or number, and {#} is the paper number. Prior to 2020, this identifier took the form CYY-VPPP or CYY-VVPP, where C is a collection, YY a two-digit year, V a volume, and P a paper ID. The canonical URL of an Anthology paper is given by appending this identifier to the Anthology’s base URL https://www.aclanthology.org/; e.g., https://www.aclanthology.org/2020.iwclul-1.4 (new style) or https://www.aclanthology.org/E91-1001 (old style). This is the paper’s landing page, which includes (among other things) a link to the PDF.

    Many papers in the Anthology also have Digital Object Identifiers (DOIs). Both the DOIs and the canonical Anthology URLs embed the 8-character ACL Anthology Identifier. When available, DOI URLs will redirect to the Anthology canonical URL, and will be listed on that page.

    Variations of the canonical URL can be used to access the PDF and citation format files directly:

    and so on.

Submitting and correcting content

  • How can I submit corrections to papers?

    There are a number of types of corrections. For an overview, you can see our our corrections page.

    In summary, there are four main types of corrections we handle:

    • PDF corrections. This includes changes such as revisions or retractions. For more information, please see see details on the corrections page.
    • Metadata corrections. This includes corrections to the title, author list, or abstract, for situations where they do not match the PDF. These can be initiated by clicking on the yellow “Fix data” button on any paper page in the Anthology.
    • Author profile merging. This is required when an author’s papers are split over multiple author profiles reflecting different name variants of a single author. Example: Aravind Joshi. After ensuring that all papers have metadata that matches the PDF, Anthology staff can merge profiles under a canonical name variant.
    • Author splitting. In this situation, multiple different people have published under the same name. To address this, we assign a manual ID to one or more of the authors, based on the author’s preferred name and institution of highest (anticipate) degree.

    Important: The Anthology treats the data on the PDF itself as the authoritative source of information, so all metadata corrections should conform to the information found there. If that information is wrong, a PDF correction is needed.

  • How can I submit my proceedings or journal to the Anthology?

    Please see our submission guide.

  • How can I submit new a new venue to the Anthology?

    A venue in the ACL Anthology is a conference, workshop, journal, or other publication series that publishes papers in volumes. We regularly accept new venues subject to certain conditions. Please see our information page on new venues.

  • How can I submit attachments to papers?

    Anthology paper pages can include attachments submitted at the time of ingestion. This typically includes supplementary materials uploaded with the final (“camera-ready”) version of the paper, such as code or data tarballs or submission checklists.

    We do not have a formal process for accepting attachments submitted after initial publication. If you have an important artifact that you would like to be archived with a paper (e.g., a unique poster or sought-after presentation), please create a Github issue and we will deal with it on a case-by-case basis.

    Around 2020, we had a more formal process for accepting attachments to papers, but we retired this ability due to the cost in time to Anthology volunteers. We may restore this ability in the future.

Community & other info