2018
pdf
Retrieving Information from the French Lexical Network in RDF/OWL Format
Alexsandro Fonseca
|
Fatiha Sadat
|
François Lareau
Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018)
2016
pdf
abs
Named Entity Recognition and Hashtag Decomposition to Improve the Classification of Tweets
Billal Belainine
|
Alexsandro Fonseca
|
Fatiha Sadat
Proceedings of the 2nd Workshop on Noisy User-generated Text (WNUT)
In social networks services like Twitter, users are overwhelmed with huge amount of social data, most of which are short, unstructured and highly noisy. Identifying accurate information from this huge amount of data is indeed a hard task. Classification of tweets into organized form will help the user to easily access these required information. Our first contribution relates to filtering parts of speech and preprocessing this kind of highly noisy and short data. Our second contribution concerns the named entity recognition (NER) in tweets. Thus, the adaptation of existing language tools for natural languages, noisy and not accurate language tweets, is necessary. Our third contribution involves segmentation of hashtags and a semantic enrichment using a combination of relations from WordNet, which helps the performance of our classification system, including disambiguation of named entities, abbreviations and acronyms. Graph theory is used to cluster the words extracted from WordNet and tweets, based on the idea of connected components. We test our automatic classification system with four categories: politics, economy, sports and the medical field. We evaluate and compare several automatic classification systems using part or all of the items described in our contributions and found that filtering by part of speech and named entity recognition dramatically increase the classification precision to 77.3 %. Moreover, a classification system incorporating segmentation of hashtags and semantic enrichment by two relations from WordNet, synonymy and hyperonymy, increase classification precision up to 83.4 %.
pdf
abs
Lexfom: a lexical functions ontology model
Alexsandro Fonseca
|
Fatiha Sadat
|
François Lareau
Proceedings of the 5th Workshop on Cognitive Aspects of the Lexicon (CogALex - V)
A lexical function represents a type of relation that exists between lexical units (words or expressions) in any language. For example, the antonymy is a type of relation that is represented by the lexical function Anti: Anti(big) = small. Those relations include both paradigmatic relations, i.e. vertical relations, such as synonymy, antonymy and meronymy and syntagmatic relations, i.e. horizontal relations, such as objective qualification (legitimate demand), subjective qualification (fruitful analysis), positive evaluation (good review) and support verbs (pay a visit, subject to an interrogation). In this paper, we present the Lexical Functions Ontology Model (lexfom) to represent lexical functions and the relation among lexical units. Lexfom is divided in four modules: lexical function representation (lfrep), lexical function family (lffam), lexical function semantic perspective (lfsem) and lexical function relations (lfrel). Moreover, we show how it combines to Lexical Model for Ontologies (lemon), for the transformation of lexical networks into the semantic web formats. So far, we have implemented 100 simple and 500 complex lexical functions, and encoded about 8,000 syntagmatic and 46,000 paradigmatic relations, for the French language.
2014
pdf
Identifying Portuguese Multiword Expressions using Different Classification Algorithms - A Comparative Analysis
Alexsandro Fonseca
|
Fatiha Sadat
|
Alexandre Blondin Massé
Proceedings of the 4th International Workshop on Computational Terminology (Computerm)
pdf
A Comparative Study of Different Classification Methods for the Identification of Brazilian Portuguese Multiword Expressions
Alexsandro Fonseca
|
Fatiha Sadat
Proceedings of the First Workshop on Computational Approaches to Compound Analysis (ComAComA 2014)