2023
pdf
bib
Evaluating a Multilingual Pre-trained Model for the Automatic Standard German captioning of Swiss German TV
Johanna Gerlach
|
Pierrette Bouillon
|
Silvia Rodríguez Vázquez
|
Jonathan Mutal
|
Marianne Starlander
Proceedings of the 8th edition of the Swiss Text Analytics Conference
2014
pdf
abs
The IULA Spanish LSP Treebank
Montserrat Marimon
|
Núria Bel
|
Beatriz Fisas
|
Blanca Arias
|
Silvia Vázquez
|
Jorge Vivaldi
|
Carlos Morell
|
Mercè Lorente
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
This paper presents the IULA Spanish LSP Treebank, a dependency treebank of over 41,000 sentences of different domains (Law, Economy, Computing Science, Environment, and Medicine), developed in the framework of the European project METANET4U. Dependency annotations in the treebank were automatically derived from manually selected parses produced by an HPSG-grammar by a deterministic conversion algorithm that used the identifiers of grammar rules to identify the heads, the dependents, and some dependency types that were directly transferred onto the dependency structure (e.g., subject, specifier, and modifier), and the identifiers of the lexical entries to identify the argument-related dependency functions (e.g. direct object, indirect object, and oblique complement). The treebank is accessible with a browser that provides concordance-based search functions and delivers the results in two formats: (i) a column-based format, in the style of CoNLL-2006 shared task, and (ii) a dependency graph, where dependency relations are noted by an oriented arrow which goes from the dependent node to the head node. The IULA Spanish LSP Treebank is the first technical corpus of Spanish annotated at surface syntactic level following the dependency grammar theory. The treebank has been made publicly and freely available from the META-SHARE platform with a Creative Commons CC-by licence.
pdf
abs
Applying Accessibility-Oriented Controlled Language (CL) Rules to Improve Appropriateness of Text Alternatives for Images: an Exploratory Study
Silvia Rodríguez Vázquez
|
Pierrette Bouillon
|
Anton Bolfing
Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC'14)
At present, inappropriate text alternatives for images in the Web continue to pose web accessibility barriers for people with special needs. Although research efforts have been devoted to define how to write text equivalents for visual content in websites, existing guidelines often lack direct linguistic-oriented recommendations. Similarly, most web accessibility evaluation tools just provide users with an automated functionality to check the presence of text alternatives within the element, rather than a platform to verify their content. This paper presents an overview of the findings from an exploratory study carried out to investigate if the appropriateness level of text alternatives for images in French can be improved when applying controlled language (CL) rules. Results gathered suggest that using accessibility-oriented alt style rules can have a significant impact on text alternatives appropriateness. Although more data would be needed to draw further conclusions about our proposal, this preliminary study already offers an interest insight into the potential use of CL checkers such as Acrolinx for language-based web accessibility evaluation.
2012
pdf
Automatic Extraction of Polar Adjectives for the Creation of Polarity Lexicons
Silvia Vázquez
|
Muntsa Padró
|
Núria Bel
|
Julio Gonzalo
Proceedings of COLING 2012: Posters
pdf
abs
A Classification of Adjectives for Polarity Lexicons Enhancement
Silvia Vázquez
|
Núria Bel
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
Subjective language detection is one of the most important challenges in Sentiment Analysis. Because of the weight and frequency in opinionated texts, adjectives are considered a key piece in the opinion extraction process. These subjective units are more and more frequently collected in polarity lexicons in which they appear annotated with their prior polarity. However, at the moment, any polarity lexicon takes into account prior polarity variations across domains. This paper proves that a majority of adjectives change their prior polarity value depending on the domain. We propose a distinction between domain dependent and domain independent adjectives. Moreover, our analysis led us to propose a further classification related to subjectivity degree: constant, mixed and highly subjective adjectives. Following this classification, polarity values will be a better support for Sentiment Analysis.
pdf
abs
The IULA Treebank
Montserrat Marimon
|
Beatriz Fisas
|
Núria Bel
|
Marta Villegas
|
Jorge Vivaldi
|
Sergi Torner
|
Mercè Lorente
|
Silvia Vázquez
|
Marta Villegas
Proceedings of the Eighth International Conference on Language Resources and Evaluation (LREC'12)
This paper describes on-going work for the construction of a new treebank for Spanish, The IULA Treebank. This new resource will contain about 60,000 richly annotated sentences as an extension of the already existing IULA Technical Corpus which is only PoS tagged. In this paper we have focused on describing the work done for defining the annotation process and the treebank design principles. We report on how the used framework, the DELPH-IN processing framework, has been crucial in the design principles and in the bootstrapping strategy followed, especially in what refers to the use of stochastic modules for reducing parsing overgeneration. We also report on the different evaluation experiments carried out to guarantee the quality of the already available results.
2011
pdf
Creating Sentiment Dictionaries via Triangulation
Josef Steinberger
|
Polina Lenkova
|
Mohamed Ebrahim
|
Maud Ehrmann
|
Ali Hurriyetoglu
|
Mijail Kabadjov
|
Ralf Steinberger
|
Hristo Tanev
|
Vanni Zavarella
|
Silvia Vázquez
Proceedings of the 2nd Workshop on Computational Approaches to Subjectivity and Sentiment Analysis (WASSA 2.011)