Rahma Sellami


2014

pdf
Collaboratively Constructed Linguistic Resources for Language Variants and their Exploitation in NLP Application – the case of Tunisian Arabic and the Social Media
Fatiha Sadat | Fatma Mallek | Mohamed Boudabous | Rahma Sellami | Atefeh Farzindar
Proceedings of Workshop on Lexical and Grammatical Resources for Language Processing

2013

pdf
Exploiting multiple resources for Japanese to English patent translation
Rahma Sellami | Fatiha Sadat | Lamia Hadrich Belguith
Proceedings of the 5th Workshop on Patent Translation

2012

pdf
Extraction de lexiques bilingues à partir de Wikipédia (Bilingual lexicon extraction from Wikipedia) [in French]
Rahma Sellami | Fatiha Sadat | Lamia Hadrich Belguith
JEP-TALN-RECITAL 2012, Workshop TALAf 2012: Traitement Automatique des Langues Africaines (TALAf 2012: African Language Processing)

pdf
Exploiting Wikipedia as a Knowledge Base for the Extraction of Linguistic Resources: Application on Arabic-French Comparable Corpora and Bilingual Lexicons
Rahma Sellami | Fatiha Sadat | Lamia Hadrich Belguith
Fourth Workshop on Computational Approaches to Arabic-Script-based Languages

We present simple and effective methods for extracting comparable corpora and bilingual lexicons from Wikipedia. We shall exploit the large scale and the structure of Wikipedia articles to extract two resources that will be very useful for natural language applications. We build a comparable corpus from Wikipedia using categories as topic restrictions and we extract bilingual lexicons from inter-language links aligned with statistical method or a combined statistical and linguistic method.