脑卒中疾病电子病历实体及实体关系标注语料库构建(Corpus Construction for Named-Entity and Entity Relations for Electronic Medical Records of Stroke Disease)
Hongyang Chang (常洪阳), Hongying Zan (昝红英), Yutuan Ma (马玉团), Kunli Zhang (张坤丽)
Abstract
本文探讨了在脑卒中疾病中文电子病历文本中实体及实体间关系的标注问题,提出了适用于脑卒中疾病电子病历文本的实体及实体关系标注体系和规范。在标注体系和规范的指导下,进行了多轮的人工标注及校正工作,完成了158万余字的脑卒中电子病历文本实体及实体关系的标注工作。构建了脑卒中电子病历实体及实体关系标注语料库(Stroke Electronic Medical Record entity and entity related Corpus SEMRC)。所构建的语料库共包含命名实体10594个,实体关系14457个。实体名标注一致率达到85.16%,实体关系标注一致率达到94.16%。- Anthology ID:
- 2021.ccl-1.57
- Volume:
- Proceedings of the 20th Chinese National Conference on Computational Linguistics
- Month:
- August
- Year:
- 2021
- Address:
- Huhhot, China
- Editors:
- Sheng Li (李生), Maosong Sun (孙茂松), Yang Liu (刘洋), Hua Wu (吴华), Kang Liu (刘康), Wanxiang Che (车万翔), Shizhu He (何世柱), Gaoqi Rao (饶高琦)
- Venue:
- CCL
- SIG:
- Publisher:
- Chinese Information Processing Society of China
- Note:
- Pages:
- 633–642
- Language:
- Chinese
- URL:
- https://aclanthology.org/2021.ccl-1.57
- DOI:
- Cite (ACL):
- Hongyang Chang, Hongying Zan, Yutuan Ma, and Kunli Zhang. 2021. 脑卒中疾病电子病历实体及实体关系标注语料库构建(Corpus Construction for Named-Entity and Entity Relations for Electronic Medical Records of Stroke Disease). In Proceedings of the 20th Chinese National Conference on Computational Linguistics, pages 633–642, Huhhot, China. Chinese Information Processing Society of China.
- Cite (Informal):
- 脑卒中疾病电子病历实体及实体关系标注语料库构建(Corpus Construction for Named-Entity and Entity Relations for Electronic Medical Records of Stroke Disease) (Chang et al., CCL 2021)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/2021.ccl-1.57.pdf