Donia Scott

Also published as: D Scott, Donia R. Scott

Annotation studies in CL are generally unscientific: they are mostly not reproducible, make use of too few (and often non-independent) annotators and use guidelines that are often something of a moving target. Additionally, the notion of expert annotators' invariably means only that the annotators have linguistic training. While this can be acceptable in some special contexts, it is often far from ideal. This is particularly the case when subtle judgements are required or when, as increasingly, one is making use of corpora originating from technical texts that have been produced by, and intended to be consumed by, an audience of technical experts in the field. We outline a more rigorous approach to collecting human annotations, using as our example a study designed to capture judgements on the meaning of hedge words in medical records.

pdf bib

KBGen – Text Generation from Knowledge Bases as a New Shared Task
Eva Banik | Claire Gardent | Donia Scott | Nikhil Dinesh | Fennie Liang
INLG 2012 Proceedings of the Seventh International Natural Language Generation Conference

2011

pdf bib

Unlocking Medical Ontologies for Non-Ontology Experts
Shao Fen Liang | Donia Scott | Robert Stevens | Alan Rector
Proceedings of BioNLP 2011 Workshop

2008

pdf bib

Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)
Donia Scott | Hans Uszkoreit
Proceedings of the 22nd International Conference on Computational Linguistics (Coling 2008)

pdf bib

Coling 2008: Companion volume: Posters
Donia Scott | Hans Uszkoreit
Coling 2008: Companion volume: Posters

pdf bib abs

Can we Evaluate the Quality of Generated Text?
David Hardcastle | Donia Scott
Proceedings of the Sixth International Conference on Language Resources and Evaluation (LREC'08)

Evaluating the output of NLG systems is notoriously difficult, and performing assessments of text quality even more so. A range of automated and subject-based approaches to the evaluation of text quality have been taken, including comparison with a putative gold standard text, analysis of specific linguistic features of the output, expert review and task-based evaluation. In this paper we present the results of a variety of such approaches in the context of a case study application. We discuss the problems encountered in the implementation of each approach in the context of the literature, and propose that a test based on the Turing test for machine intelligence offers a way forward in the evaluation of the subjective notion of text quality.

2007

pdf bib

Composing Questions through Conceptual Authoring
Catalina Hallett | Donia Scott | Richard Power
Computational Linguistics, Volume 33, Number 1, March 2007

pdf bib

Visualising Discourse Structure in Interactive Documents
Clara Mancini | Christian Pietsch | Donia Scott
Proceedings of the Eleventh European Workshop on Natural Language Generation (ENLG 07)

2006

pdf bib

Computational Approaches to Discourse and Document Processing
Marie-Paule Péry-Woodley | Donia Scott
Traitement Automatique des Langues, Volume 47, Numéro 2 : Discours et document : traitements automatiques [Computational Approaches to Discourse and Document Processing]

pdf bib

Visualising discourse coherence in nonlinear documents
Clara Mancini | Donia Scott | Simon Buckingham Shum
Traitement Automatique des Langues, Volume 47, Numéro 2 : Discours et document : traitements automatiques [Computational Approaches to Discourse and Document Processing]

2005

pdf bib

Structural variation in generated health reports
Catalina Hallett | Donia Scott
Proceedings of the Third International Workshop on Paraphrasing (IWP2005)

pdf bib

Automatic generation of large-scale paraphrases
Richard Power | Donia Scott
Proceedings of the Third International Workshop on Paraphrasing (IWP2005)

2003

pdf bib

Multilingual generation of controlled languages
Richard Power | Donia Scott | Anthony Hartley
EAMT Workshop: Improving MT through other language technology tools: resources and tools for building MT

pdf bib

Document Structure
Richard Power | Donia Scott | Nadjet Bouayad-Agha
Computational Linguistics, Volume 29, Number 2, June 2003

2002

pdf bib

PILLS: Multilingual generation of medical information documents with overlapping content
Nadjet Bouayad-Agha | Richard Power | Donia Scott | Anja Belz
Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02)

2001

pdf bib abs

AGILE - a system for multilingual generation of technical instructions
Anthony Hartley | Donia Scott | John Bateman | Danail Dochev
Proceedings of Machine Translation Summit VIII

This paper presents a multilingual Natural Language Generation system that produces technical instruction texts in Bulgarian, Czech and Russian. It generates several types of texts, common for software manuals, in two styles. We illustrate the system’s functionality with examples of its input and output behaviour. We discuss the criteria and procedures adopted for evaluating the system and summarise their results. The system embodies novel approaches to providing multilingual documentation, ranging from the re-use of a large-scale, broad coverage grammar of English in order to develop the lexico-grammatical resources necessary for the generation in the three target languages, through to the adoption of a ‘knowledge editing’ approach to specifying the desired content of the texts to be generated independently of the target languages in which those texts finally appear.

pdf bib