Adrián Rabadán


2017

pdf
Exploring the Behavior of Classic REG Algorithms in the Description of Characters in 3D Images
Gonzalo Méndez | Raquel Hervás | Susana Bautista | Adrián Rabadán | Teresa Rodríguez
Proceedings of the 10th International Conference on Natural Language Generation

Describing people and characters can be very useful in different contexts, such as computational narrative or image description for the visually impaired. However, a review of the existing literature shows that the automatic generation of people descriptions has not received much attention. Our work focuses on the description of people in snapshots from a 3D environment. First, we have conducted a survey to identify the way in which people describe other people under different conditions. We have used the information extracted from this survey to design several Referring Expression Generation algorithms which produce similar results. We have evaluated these algorithms with users in order to identify which ones generate the best description for specific characters in different situations. The evaluation has shown that, in order to generate good descriptions, a combination of different algorithms has to be used depending on the features and situation of the person to be described.

2016

pdf
Improving Information Extraction from Wikipedia Texts using Basic English
Teresa Rodríguez-Ferreira | Adrián Rabadán | Raquel Hervás | Alberto Díaz
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

The aim of this paper is to study the effect that the use of Basic English versus common English has on information extraction from online resources. The amount of online information available to the public grows exponentially, and is potentially an excellent resource for information extraction. The problem is that this information often comes in an unstructured format, such as plain text. In order to retrieve knowledge from this type of text, it must first be analysed to find the relevant details, and the nature of the language used can greatly impact the quality of the extracted information. In this paper, we compare triplets that represent definitions or properties of concepts obtained from three online collaborative resources (English Wikipedia, Simple English Wikipedia and Simple English Wiktionary) and study the differences in the results when Basic English is used instead of common English. The results show that resources written in Basic English produce less quantity of triplets, but with higher quality.