@inproceedings{lei-etal-2021-keyphrase,
    title = "Keyphrase Extraction with Incomplete Annotated Training Data",
    author = "Lei, Yanfei  and
      Hu, Chunming  and
      Ma, Guanghui  and
      Zhang, Richong",
    editor = "Xu, Wei  and
      Ritter, Alan  and
      Baldwin, Tim  and
      Rahimi, Afshin",
    booktitle = "Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021)",
    month = nov,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2021.wnut-1.4/",
    doi = "10.18653/v1/2021.wnut-1.4",
    pages = "26--34",
    abstract = "Extracting keyphrases that summarize the main points of a document is a fundamental task in natural language processing. Supervised approaches to keyphrase extraction(KPE) are largely developed based on the assumption that the training data is fully annotated. However, due to the difficulty of keyphrase annotating, KPE models severely suffer from incomplete annotated problem in many scenarios. To this end, we propose a more robust training method that learns to mitigate the misguidance brought by unlabeled keyphrases. We introduce negative sampling to adjust training loss, and conduct experiments under different scenarios. Empirical studies on synthetic datasets and open domain dataset show that our model is robust to incomplete annotated problem and surpasses prior baselines. Extensive experiments on five scientific domain datasets of different scales demonstrate that our model is competitive with the state-of-the-art method."
}Markdown (Informal)
[Keyphrase Extraction with Incomplete Annotated Training Data](https://preview.aclanthology.org/ingest-emnlp/2021.wnut-1.4/) (Lei et al., WNUT 2021)
ACL