Borja Navarro

Also published as: B. Navarro, Borja Navarro-Colorado


2024

pdf
The Simplification of the Language of Public Administration: The Case of Ombudsman Institutions
Gabriel Gonzalez-Delgado | Borja Navarro-Colorado
Proceedings of the Workshop on DeTermIt! Evaluating Text Difficulty in a Multilingual Context @ LREC-COLING 2024

Language produced by Public Administrations has crucial implications in citizens’ lives. However, its syntactic complexity and the use of legal jargon, among other factors, make it difficult to be understood for laypeople and certain target audiences. The NLP task of Automatic Text Simplification (ATS) can help to the necessary simplification of this technical language. For that purpose, specialized parallel datasets of complex-simple pairs need to be developed for the training of these ATS systems. In this position paper, an on-going project is presented, whose main objectives are (a) to extensively analyze the syntactical, lexical, and discursive features of the language of English-speaking ombudsmen, as samples of public administrative language, with special attention to those characteristics that pose a threat to comprehension, and (b) to develop the OmbudsCorpus, a parallel corpus of complex-simple supra-sentential fragments from ombudsmen’s case reports that have been manually simplified by professionals and annotated with standardized simplification operations. This research endeavor aims to provide a deeper understanding of the simplification process and to enhance the training of ATS systems specialized in administrative texts.

2016

pdf
Metrical Annotation of a Large Corpus of Spanish Sonnets: Representation, Scansion and Evaluation
Borja Navarro | María Ribes Lafoz | Noelia Sánchez
Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC'16)

In order to analyze metrical and semantics aspects of poetry in Spanish with computational techniques, we have developed a large corpus annotated with metrical information. In this paper we will present and discuss the development of this corpus: the formal representation of metrical patterns, the semi-automatic annotation process based on a new automatic scansion system, the main annotation problems, and the evaluation, in which an inter-annotator agreement of 96% has been obtained. The corpus is open and available.

2015

pdf
GPLSIUA: Combining Temporal Information and Topic Modeling for Cross-Document Event Ordering
Borja Navarro | Estela Saquete
Proceedings of the 9th International Workshop on Semantic Evaluation (SemEval 2015)

pdf
A computational linguistic approach to Spanish Golden Age Sonnets: metrical and semantic aspects
Borja Navarro
Proceedings of the Fourth Workshop on Computational Linguistics for Literature

2011

pdf
Data-Driven Approach Using Semantics for Recognizing and Classifying TimeML Events in Italian
Tommaso Caselli | Hector Llorens | Borja Navarro-Colorado | Estela Saquete
Proceedings of the International Conference Recent Advances in Natural Language Processing 2011

2010

pdf
TIPSem (English and Spanish): Evaluating CRFs and Semantic Roles in TempEval-2
Hector Llorens | Estela Saquete | Borja Navarro
Proceedings of the 5th International Workshop on Semantic Evaluation

pdf
TimeML Events Recognition and Classification: Learning CRF Models with Semantic Roles
Hector Llorens | Estela Saquete | Borja Navarro-Colorado
Proceedings of the 23rd International Conference on Computational Linguistics (Coling 2010)

2009

pdf
Using Semantic Networks to Identify Temporal Expressions from Semantic Roles
Hector Llorens | Borja Navarro | Estela Saquete
Proceedings of the International Conference RANLP-2009

2007

pdf
UA-ZBSA: A Headline Emotion Classification through Web Information
Zornitsa Kozareva | Borja Navarro | Sonia Vázquez | Andrés Montoyo
Proceedings of the Fourth International Workshop on Semantic Evaluations (SemEval-2007)

2004

pdf
Exploiting Semantic Information for Manual Anaphoric Annotation in Cast3LB Corpus
Borja Navarro | Ruben Izquierdo | Maximiliano Saiz-Noeda
Proceedings of the Workshop on Discourse Annotation

pdf
MiniCors and Cast3LB: Two Semantically Tagged Spanish Corpora
M. Taulé | M. Civit | N. Artigas | M. García | L. Màrquez | M.A. Martí | B. Navarro
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

pdf
Automatic Extraction of Syntactic Semantic Patterns for Multilingual Resources
Borja Navarro | Manuel Palomar | Patricio Martínez-Barco
Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04)

2003

pdf bib
Issues in the Syntactic Annotation of Cast3LB
Montserrat Civit | Ma. Antònia Martí | Borja Navarro | Núria Bufí | Belén Fernández | Raquel Marcos
Proceedings of 4th International Workshop on Linguistically Interpreted Corpora (LINC-03) at EACL 2003