@inproceedings{cai-etal-2025-handling,
    title = "Handling Missing Entities in Zero-Shot Named Entity Recognition: Integrated Recall and Retrieval Augmentation",
    author = "Cai, Ruichu  and
      Lu, Junhao  and
      Chen, Zhongjie  and
      Xu, Boyan  and
      Hao, Zhifeng",
    editor = "Chiruzzo, Luis  and
      Ritter, Alan  and
      Wang, Lu",
    booktitle = "Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)",
    month = apr,
    year = "2025",
    address = "Albuquerque, New Mexico",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2025.naacl-long.540/",
    doi = "10.18653/v1/2025.naacl-long.540",
    pages = "10790--10802",
    ISBN = "979-8-89176-189-6",
    abstract = "Zero-shot Named Entity Recognition (ZS-NER) aims to recognize entities in unseen domains without specific annotated data. A key challenge is handling missing entities while ensuring accurate type recognition, hindered by: 1) the pre-training assumption that each entity has a single type, overlooking diversity, and 2) insufficient contextual knowledge for type reasoning. To address this, we propose IRRA (Integrated Recall and Retrieval Augmentation), a novel two-stage framework leveraging large language model techniques. In the \textit{Recall Augmented Entity Extracting} stage, we built a perturbed dataset to induce the model to exhibit missing or erroneous extracted entities. Based on this, we trained an enhanced model to correct these errors. This approach can improve the ZS-NER{'}s recall rate. In the \textit{Retrieval Augmented Type Correcting} stage, we employ Retrieval-Augmented Generation techniques to locate entity-related unannotated contexts, with the additional contextual information significantly improving the accuracy of type correcting. Extensive evaluations demonstrate the state-of-the-art performance of our IRRA, with significant improvements in zero-shot cross-domain settings validated through both auto-evaluated metrics and analysis. Our implementation will be open-sourced at\url{https://github.com/DMIRLAB-Group/IRRA}."
}Markdown (Informal)
[Handling Missing Entities in Zero-Shot Named Entity Recognition: Integrated Recall and Retrieval Augmentation](https://preview.aclanthology.org/ingest-emnlp/2025.naacl-long.540/) (Cai et al., NAACL 2025)
ACL