A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial News

Ali Jabbari, Olivier Sauvage, Hamada Zeine, Hamza Chergui


Abstract
In financial services industry, compliance involves a series of practices and controls in order to meet key regulatory standards which aim to reduce financial risk and crime, e.g. money laundering and financing of terrorism. Faced with the growing risks, it is imperative for financial institutions to seek automated information extraction techniques for monitoring financial activities of their customers. This work describes an ontology of compliance-related concepts and relationships along with a corpus annotated according to it. The presented corpus consists of financial news articles in French and allows for training and evaluating domain-specific named entity recognition and relation extraction algorithms. We present some of our experimental results on named entity recognition and relation extraction using our annotated corpus. We aim to furthermore use the the proposed ontology towards construction of a knowledge base of financial relations.
Anthology ID:
2020.lrec-1.279
Volume:
Proceedings of the Twelfth Language Resources and Evaluation Conference
Month:
May
Year:
2020
Address:
Marseille, France
Venue:
LREC
SIG:
Publisher:
European Language Resources Association
Note:
Pages:
2293–2299
Language:
English
URL:
https://aclanthology.org/2020.lrec-1.279
DOI:
Bibkey:
Cite (ACL):
Ali Jabbari, Olivier Sauvage, Hamada Zeine, and Hamza Chergui. 2020. A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial News. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 2293–2299, Marseille, France. European Language Resources Association.
Cite (Informal):
A French Corpus and Annotation Schema for Named Entity Recognition and Relation Extraction of Financial News (Jabbari et al., LREC 2020)
Copy Citation:
PDF:
https://preview.aclanthology.org/auto-file-uploads/2020.lrec-1.279.pdf