Joseba Abaitua


2011

pdf
Experiments on Term Extraction using Noun Phrase Subclassifications
Merley da Silva Conrado | Walter Koza | Josuka Díaz-Labrador | Joseba Abaitua | Solange Oliveira Rezende | Thiago Pardo | Zulema Solana
Proceedings of the International Conference Recent Advances in Natural Language Processing 2011

2003

pdf
Metadata for multilingual content management - A practical experience with the SARE-Bi system
JosuKa Díaz-Labrador | Joseba Abaitua | Inés Jacob | Fernando Quintana
Proceedings of Translating and the Computer 25

2002

pdf
Cascading XSL Filters for Content Selection in Multilingual Document Generation
Guillermo Barrutieta | Joseba Abaitua | JosuKa Díaz
COLING-02: The 2nd Workshop on NLP and XML (NLPXML-2002)

2001

pdf
Gross-grained RST through XML metadata for multilingual document generation
Guillermo Barrutieta | Joseba Abaitua | Josuha Díaz
Proceedings of Machine Translation Summit VIII

We present an RST-based discourse annotation proposal used in the construction of a trial multilingual XML-tagged corpus of teaching material in Basque, English and Spanish. The corpus feeds an experimental multilingual document generation system for the web. The main contributions of this paper are an implementation of RST through XML metadata and the adoption of gross-grained RST to avoid non-isomorphism in multilingual corpora.

2000

pdf
DTD-driven bilingual document generation
Arantza Casillas | Joseba Abaitua | Raquel Martínez
INLG’2000 Proceedings of the First International Conference on Natural Language Generation

pdf
Recycling annotated parallel corpora for bilingual document composition
Arantza Casillas | Joseba Abaitua | Raquel Martínez
Proceedings of the Fourth Conference of the Association for Machine Translation in the Americas: Technical Papers

Parallel corpora enriched with descriptive annotations facilitate multilingual authoring development. Departing from an annotated bitext we show how SGML markup can be recycled to produce complementary language resources. On the one hand, several translation memory databases together with glossaries of proper nouns have been produced. On the other, DTDs for source and target documents have been derived and put into correspondence. This paper discusses how these resources have been automatically generated and applied to an interactive bilingual authoring system. This tool is capable of handling a substantial proportion of text both in the composition and translation of structured documents.

1998

pdf
Bitext Correspondences through Rich Mark-up
Raquel Martinez | Joseba Abaitua | Arantza Casillas
COLING 1998 Volume 2: The 17th International Conference on Computational Linguistics

pdf
Aligning tagged bitexts
Raquel Martinez | Joseba Abaitua | Arantza Casillas
Sixth Workshop on Very Large Corpora

pdf
Bitext Correspondences through Rich Mark-up
Raquel Martinez | Joseba Abaitua | Arantza Casillas
36th Annual Meeting of the Association for Computational Linguistics and 17th International Conference on Computational Linguistics, Volume 2