Emmanuel Giguet


2021

pdf bib
Daniel@FinTOC-2021: Taking Advantage of Images and Vectorial Shapes in Native PDF Document Analysis
Emmanuel Giguet | Gaël Lejeune
Proceedings of the 3rd Financial Narrative Processing Workshop

2020

pdf bib
Daniel@FinTOC’2 Shared Task: Title Detection and Structure Extraction
Emmanuel Giguet | Gaël Lejeune | Jean-Baptiste Tanguy
Proceedings of the 1st Joint Workshop on Financial Narrative Processing and MultiLing Financial Summarisation

We present our contributions for the 2020 FinTOC Shared Tasks: Title Detection and Table of Contents Extraction. For the Structure Extraction task, we propose an approach that combines information from multiple sources: the table of contents, the wording of the document, and lexical domain knowledge. For the title detection task, we compare surface features to character-based features on various training configurations. We show that title detection results are very sensitive to the kind of training dataset used.

pdf bib
Daniel at the FinSBD-2 Task: Extracting List and Sentence Boundaries from PDF Documents, a model-driven approach to PDF document analysis
Emmanuel Giguet | Gaël Lejeune
Proceedings of the Second Workshop on Financial Technology and Natural Language Processing

2019

pdf bib
Daniel@FinTOC-2019 Shared Task : TOC Extraction and Title Detection
Emmanuel Giguet | Gaël Lejeune
Proceedings of the Second Financial Narrative Processing Workshop (FNP 2019)

2013

pdf bib
Parallel areas detection in multi-documents for multilingual alignment (Détection de zones parallèles à l’intérieur de multi-documents pour l’alignement multilingue) [in French]
Charlotte Lecluze | Romain Brixtel | Loïs Rigouste | Emmanuel Giguet | Régis Clouard | Gaël Lejeune | Patrick Constant
Proceedings of TALN 2013 (Volume 1: Long Papers)

2006

pdf bib
Multilingual Lexical Database Generation from Parallel Texts in 20 European Languages with Endogenous Resources
Emmanuel Giguet | Pierre-Sylvain Luquet
Proceedings of the COLING/ACL 2006 Main Conference Poster Sessions

1997

pdf bib
Syntactic Structures of Sentences from Large Corpora
Emmanuel Giguet | Jacques Vergne
Fifth Conference on Applied Natural Language Processing: Descriptions of System Demonstrations and Videos

pdf bib
From Part of Speech Tagging to Memory-based Deep Syntactic Analysis
Emmanuel Giguet | Jacques Vergne
Proceedings of the Fifth International Workshop on Parsing Technologies

This paper presents a robust system for deep syntactic parsing of unrestricted French. This system uses techniques from Part-of-Speech tagging in order to build a constituent structure and uses other techniques from dependency grammar in an original framework of memories in order to build a functional structure. The two structures are build simultaneously by two interacting processes. The processes share the same aim, that is, to recover efficiently and reliably syntactic information with no explicit expectation on text structure.