Caroline Barriere

Also published as: Caroline Barrière

2018

pdf abs
CRIM at SemEval-2018 Task 9: A Hybrid Approach to Hypernym Discovery
Gabriel Bernier-Colborne | Caroline Barrière
Proceedings of the 12th International Workshop on Semantic Evaluation

This report describes the system developed by the CRIM team for the hypernym discovery task at SemEval 2018. This system exploits a combination of supervised projection learning and unsupervised pattern-based hypernym discovery. It was ranked first on the 3 sub-tasks for which we submitted results.

2017

pdf
Fine-grained domain classification of text using TERMIUM Plus
Gabriel Bernier-Colborne | Caroline Barrière | Pierre André Ménard
Proceedings of Language, Ontology, Terminology and Knowledge Structures Workshop (LOTKS 2017)

pdf
PACTE: A colloaborative platform for textual annotation
Pierre André Ménard | Caroline Barrière
Proceedings of the 13th Joint ISO-ACL Workshop on Interoperable Semantic Annotation (ISA-13)

2016

pdf
Classification of comment helpfulness to improve knowledge sharing among medical practitioners.
Pierre André Ménard | Caroline Barrière
Proceedings of the 7th Workshop on Computational Approaches to Subjectivity, Sentiment and Social Media Analysis

pdf abs
Contextual term equivalent search using domain-driven disambiguation
Caroline Barrière | Pierre André Ménard | Daphnée Azoulay
Proceedings of the 5th International Workshop on Computational Terminology (Computerm2016)

This article presents a domain-driven algorithm for the task of term sense disambiguation (TSD). TSD aims at automatically choosing which term record from a term bank best represents the meaning of a term occurring in a particular context. In a translation environment, finding the contextually appropriate term record is necessary to access the proper equivalent to be used in the target language text. The term bank TERMIUM Plus, recently published as an open access repository, is chosen as a domain-rich resource for testing our TSD algorithm, using English and French as source and target languages. We devise an experiment using over 1300 English terms found in scientific articles, and show that our domain-driven TSD algorithm is able to bring the best term record, and therefore the best French equivalent, at the average rank of 1.69 compared to a baseline random rank of 3.51.

2014

pdf abs
Linked Open Data and Web Corpus Data for noun compound bracketing
Pierre André Ménard | Caroline Barrière
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)

This research provides a comparison of a linked open data resource (DBpedia) and web corpus data resources (Google Web Ngrams and Google Books Ngrams) for noun compound bracketing. Large corpus statistical analysis has often been used for noun compound bracketing, and our goal is to introduce a linked open data (LOD) resource for such task. We show its particularities and its performance on the task. Results obtained on resources tested individually are promising, showing a potential for DBpedia to be included in future hybrid systems.

pdf
Multiword noun compound bracketing using Wikipedia
Caroline Barrière | Pierre André Ménard
Proceedings of the First Workshop on Computational Approaches to Compound Analysis (ComAComA 2014)

2011

pdf
Searching Parallel Corpora for Contextually Equivalent Terms
Caroline Barrière | Pierre Isabelle
Proceedings of the 15th Annual Conference of the European Association for Machine Translation

2010

pdf abs
Recherche contextuelle d’équivalents en banque de terminologie
Caroline Barrière
Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Articles longs

Notre recherche démontre que l’utilisation du contenu d’un texte à traduire permet de mieux cibler dans une banque de terminologie les équivalents terminologiques pertinents à ce texte. Une banque de terminologie a comme particularité qu’elle catégorise ses entrées (fiches) en leur assignant un ou des domaines provenant d’une liste de domaines préétablie. La stratégie ici présentée repose sur l’utilisation de cette information sur les domaines. Un algorithme a été développé pour l’assignation automatique d’un profil de domaines à un texte. Celui-ci est combiné à un algorithme d’appariement entre les domaines d’un terme présent dans la banque de terminologie et le profil de domaines du texte. Pour notre expérimentation, des résumés bilingues (français et anglais) provenant de huit revues scientifiques nous fournissent un ensemble de 1130 paires d’équivalents terminologiques et le Grand Dictionnaire Terminologique (Office Québécois de la Langue Française) nous sert de ressource terminologique. Sur notre ensemble, nous démontrons une réduction de 75% du rang moyen de l’équivalent correct en comparaison avec un choix au hasard.

pdf bib abs
TerminoWeb : recherche et analyse d’information thématique
Caroline Barrière
Actes de la 17e conférence sur le Traitement Automatique des Langues Naturelles. Démonstrations

Notre démonstration porte sur le prototype TerminoWeb, une plateforme Web qui permet (1) la construction automatique d’un corpus thématique à partir d’une recherche de documents sur le Web, (2) l’extraction de termes du corpus, et (3) la recherche d’information définitionnelle sur ces termes en corpus. La plateforme intégrant les trois modules, elle aidera un langagier (terminologue, traducteur, rédacteur) à découvrir un nouveau domaine (thème) en facilitant la recherche et l’analyse de documents informatifs pertinents à ce domaine.

2009

pdf bib
Finding Domain Specific Collocations and Concordances on the Web
Caroline Barrière
Proceedings of the Workshop on Natural Language Processing Methods and Corpora in Translation, Lexicography, and Language Learning

pdf bib
The Web as a Source of Informative Background Knowledge
Caroline Barrière
Beyond Translation Memories: New Tools for Translators Workshop

2006