Diego Molla

Also published as: Diego Molla-Aliod, Diego Mollá, Diego Mollá Aliod, Diego Mollá-Aliod


2021

pdf bib
Overview of the 2021 ALTA Shared Task: Automatic Grading of Evidence, 10 years later
Diego Mollá
Proceedings of the The 19th Annual Workshop of the Australasian Language Technology Association

The 2021 ALTA shared task is the 12th instance of a series of shared tasks organised by ALTA since 2010. Motivated by the advances in machine learning in the last 10 years, this year�s task is a re-visit of the 2011 ALTA shared task. Set within the framework of Evidence Based Medicine (EBM), the goal is to predict the qual-ity of the clinical evidence present in a set of documents. This year�s participant results didnot improve over those of participants from 2011.

pdf bib
Demonstrating the Reliability of Self-Annotated Emotion Data
Anton Malko | Cecile Paris | Andreas Duenser | Maria Kangas | Diego Molla | Ross Sparks | Stephen Wan
Proceedings of the Seventh Workshop on Computational Linguistics and Clinical Psychology: Improving Access

Vent is a specialised iOS/Android social media platform with the stated goal to encourage people to post about their feelings and explicitly label them. In this paper, we study a snapshot of more than 100 million messages obtained from the developers of Vent, together with the labels assigned by the authors of the messages. We establish the quality of the self-annotated data by conducting a qualitative analysis, a vocabulary based analysis, and by training and testing an emotion classifier. We conclude that the self-annotated labels of our corpus are indeed indicative of the emotional contents expressed in the text and thus can support more detailed analyses of emotion expression on social media, such as emotion trajectories and factors influencing them.

2020

pdf bib
Overview of the 2020 ALTA Shared Task: Assess Human Behaviour
Diego Mollá
Proceedings of the The 18th Annual Workshop of the Australasian Language Technology Association

The 2020 ALTA shared task is the 11th in stance of a series of shared tasks organised by ALTA since 2010. The task is to classify texts posted in social media according to human judgements expressed in them. The data used for this task is a subset of SemEval 2018 AIT DISC, which has been annotated by domain experts for this task. In this paper we introduce the task, describe the data and present the results of participating systems.

2019

pdf bib
Overview of the 2019 ALTA Shared Task: Sarcasm Target Identification
Diego Molla | Aditya Joshi
Proceedings of the The 17th Annual Workshop of the Australasian Language Technology Association

We present an overview of the 2019 ALTA shared task. This is the 10th of the series of shared tasks organised by ALTA since 2010. The task was to detect the target of sarcastic comments posted on social media. We intro- duce the task, describe the data and present the results of baselines and participants. This year’s shared task was particularly challenging and no participating systems improved the re- sults of our baseline.

2018

pdf bib
Macquarie University at BioASQ 6b: Deep learning and deep reinforcement learning for query-based summarisation
Diego Mollá
Proceedings of the 6th BioASQ Workshop A challenge on large-scale biomedical semantic indexing and question answering

This paper describes Macquarie University’s contribution to the BioASQ Challenge (BioASQ 6b, Phase B). We focused on the extraction of the ideal answers, and the task was approached as an instance of query-based multi-document summarisation. In particular, this paper focuses on the experiments related to the deep learning and reinforcement learning approaches used in the submitted runs. The best run used a deep learning model under a regression-based framework. The deep learning architecture used features derived from the output of LSTM chains on word embeddings, plus features based on similarity with the query, and sentence position. The reinforcement learning approach was a proof-of-concept prototype that trained a global policy using REINFORCE. The global policy was implemented as a neural network that used tf.idf features encoding the candidate sentence, question, and context.

pdf bib
Supervised Machine Learning for Extractive Query Based Summarisation of Biomedical Data
Mandeep Kaur | Diego Mollá
Proceedings of the Ninth International Workshop on Health Text Mining and Information Analysis

The automation of text summarisation of biomedical publications is a pressing need due to the plethora of information available online. This paper explores the impact of several supervised machine learning approaches for extracting multi-document summaries for given queries. In particular, we compare classification and regression approaches for query-based extractive summarisation using data provided by the BioASQ Challenge. We tackled the problem of annotating sentences for training classification systems and show that a simple annotation approach outperforms regression-based summarisation.

pdf bib
Overview of the 2018 ALTA Shared Task: Classifying Patent Applications
Diego Mollá | Dilesha Seneviratne
Proceedings of the Australasian Language Technology Association Workshop 2018

We present an overview of the 2018 ALTA shared task. This is the 9th of the series of shared tasks organised by ALTA since 2010. The task was to classify Australian patent classifications following the sections defined by the International Patient Classification (IPC), using data made available by IP Australia. We introduce the task, describe the data and present the results of the participating teams. Some of the participating teams outperformed state of the art.

2017

pdf bib
On Extending Neural Networks with Loss Ensembles for Text Classification
Hamideh Hajiabadi | Diego Molla-Aliod | Reza Monsefi
Proceedings of the Australasian Language Technology Association Workshop 2017

pdf bib
Towards the Use of Deep Reinforcement Learning with Global Policy for Query-based Extractive Summarisation
Diego Mollá-Aliod
Proceedings of the Australasian Language Technology Association Workshop 2017

pdf bib
Overview of the 2017 ALTA Shared Task: Correcting OCR Errors
Diego Mollá-Aliod | Steve Cassidy
Proceedings of the Australasian Language Technology Association Workshop 2017

pdf bib
Macquarie University at BioASQ 5b – Query-based Summarisation Techniques for Selecting the Ideal Answers
Diego Mollá
BioNLP 2017

Macquarie University’s contribution to the BioASQ challenge (Task 5b Phase B) focused on the use of query-based extractive summarisation techniques for the generation of the ideal answers. Four runs were submitted, with approaches ranging from a trivial system that selected the first n snippets, to the use of deep learning approaches under a regression framework. Our experiments and the ROUGE results of the five test batches of BioASQ indicate surprisingly good results for the trivial approach. Overall, most of our runs on the first three test batches achieved the best ROUGE-SU4 results in the challenge.

2016

pdf bib
Semi-supervised Clustering of Medical Text
Pracheta Sahoo | Asif Ekbal | Sriparna Saha | Diego Mollá | Kaushik Nandan
Proceedings of the Clinical Natural Language Processing Workshop (ClinicalNLP)

Semi-supervised clustering is an attractive alternative for traditional (unsupervised) clustering in targeted applications. By using the information of a small annotated dataset, semi-supervised clustering can produce clusters that are customized to the application domain. In this paper, we present a semi-supervised clustering technique based on a multi-objective evolutionary algorithm (NSGA-II-clus). We apply this technique to the task of clustering medical publications for Evidence Based Medicine (EBM) and observe an improvement of the results against unsupervised and other semi-supervised clustering techniques.

pdf bib
Overview of the 2016 ALTA Shared Task: Cross-KB Coreference
Andrew Chisholm | Ben Hachey | Diego Mollá
Proceedings of the Australasian Language Technology Association Workshop 2016

2015

pdf bib
Query-Based Single Document Summarization Using an Ensemble Noisy Auto-Encoder
Mahmood Yousefi Azar | Kairit Sirts | Diego Mollá Aliod | Len Hamey
Proceedings of the Australasian Language Technology Association Workshop 2015

pdf bib
Similarity Metrics for Clustering PubMed Abstracts for Evidence Based Medicine
Hamed Hassanzadeh | Diego Mollá | Tudor Groza | Anthony Nguyen | Jane Hunter
Proceedings of the Australasian Language Technology Association Workshop 2015

pdf bib
Overview of the 2015 ALTA Shared Task: Identifying French Cognates in English Text
Laurianne Sitbon | Diego Molla | Haoxing Wang
Proceedings of the Australasian Language Technology Association Workshop 2015

2014

pdf bib
Impact of Citing Papers for Summarisation of Clinical Documents
Diego Mollá | Christopher Jones | Abeed Sarker
Proceedings of the Australasian Language Technology Association Workshop 2014

pdf bib
Overview of the 2014 ALTA Shared Task: Identifying Expressions of Locations in Tweets
Diego Molla | Sarvnaz Karimi
Proceedings of the Australasian Language Technology Association Workshop 2014

2013

pdf bib
Learning from OzCLO, the Australian Computational and Linguistics Olympiad
Dominique Estival | John Henderson | Mary Laughren | Diego Mollá | Cathy Bow | Rachel Nordlinger | Verna Rieschild | Andrea C. Schalley | Alexander W. Stanley | Colette Mrowa-Hopkins
Proceedings of the Fourth Workshop on Teaching NLP and CL

pdf bib
Automatic Prediction of Evidence-based Recommendations via Sentence-level Polarity Classification
Abeed Sarker | Diego Mollá-Aliod | Cécile Paris
Proceedings of the Sixth International Joint Conference on Natural Language Processing

pdf bib
Multi-Objective Optimization for Clustering of Medical Publications
Asif Ekbal | Sriparna Saha | Diego Mollá | K Ravikumar
Proceedings of the Australasian Language Technology Association Workshop 2013 (ALTA 2013)

pdf bib
Overview of the 2013 ALTA Shared Task
Diego Molla
Proceedings of the Australasian Language Technology Association Workshop 2013 (ALTA 2013)

2012

pdf bib
Proceedings of the First International Workshop on Optimization Techniques for Human Language Technology
Pushpak Bhattacharyya | Asif Ekbal | Sriparna Saha | Mark Johnson | Diego Molla-Aliod | Mark Dras
Proceedings of the First International Workshop on Optimization Techniques for Human Language Technology

pdf bib
Towards Two-step Multi-document Summarisation for Evidence Based Medicine: A Quantitative Analysis
Abeed Sarker | Diego Mollá-Aliod | Cécile Paris
Proceedings of the Australasian Language Technology Association Workshop 2012

pdf bib
Overview of the ALTA 2012 Shared Task
Iman Amini | David Martinez | Diego Molla
Proceedings of the Australasian Language Technology Association Workshop 2012

pdf bib
Experiments with Clustering-based Features for Sentence Classification in Medical Publications: Macquarie Test’s participation in the ALTA 2012 shared task.
Diego Mollá
Proceedings of the Australasian Language Technology Association Workshop 2012

2011

pdf bib
Proceedings of the Australasian Language Technology Association Workshop 2011
Diego Molla | David Martinez
Proceedings of the Australasian Language Technology Association Workshop 2011

pdf bib
Automatic Grading of Evidence: the 2011 ALTA Shared Task
Diego Molla | Abeed Sarker
Proceedings of the Australasian Language Technology Association Workshop 2011

pdf bib
Development of a Corpus for Evidence Based Medicine Summarisation
Diego Molla | Maria Elena Santiago-Martinez
Proceedings of the Australasian Language Technology Association Workshop 2011

pdf bib
Outcome Polarity Identification of Medical Papers
Abeed Sarker | Diego Molla | Cécile Paris
Proceedings of the Australasian Language Technology Association Workshop 2011

2010

pdf bib
A Corpus for Evidence Based Medicine Summarisation
Diego Molla
Proceedings of the Australasian Language Technology Association Workshop 2010

2008

pdf bib
Indexing on Semantic Roles for Question Answering
Luiz Augusto Pizzato | Diego Mollá
Coling 2008: Proceedings of the 2nd workshop on Information Retrieval for Question Answering

2007

pdf bib
Question Answering in Restricted Domains: An Overview
Diego Mollá | José Luis Vicedo
Computational Linguistics, Volume 33, Number 1, March 2007

pdf bib
Named Entity Recognition in Question Answering of Speech Data
Diego Mollá | Menno van Zaanen | Steve Cassidy
Proceedings of the Australasian Language Technology Workshop 2007

pdf bib
Question Prediction Language Model
Luiz Augusto Pizzato | Diego Mollá
Proceedings of the Australasian Language Technology Workshop 2007

2006

pdf bib
Learning of Graph-based Question Answering Rules
Diego Mollá
Proceedings of TextGraphs: the First Workshop on Graph Based Methods for Natural Language Processing

pdf bib
Named Entity Recognition for Question Answering
Diego Mollá | Menno van Zaanen | Daniel Smith
Proceedings of the Australasian Language Technology Workshop 2006

pdf bib
Pseudo Relevance Feedback Using Named Entities for Question Answering
Luiz Augusto Pizzato | Diego Mollá | Cécile Paris
Proceedings of the Australasian Language Technology Workshop 2006

2005

pdf bib
Learning of Graph Rules for Question Answering
Diego Molla | Menno van Zaanen
Proceedings of the Australasian Language Technology Workshop 2005

pdf bib
Extracting Exact Answers using a Meta Question Answering System
Luiz Augusto Pizzato | Diego Molla
Proceedings of the Australasian Language Technology Workshop 2005

2004

pdf bib
Answerfinder: Question Answering by Combining Lexical, Syntactic and Semantic Information
Diego Molla | Mary Gardiner
Proceedings of the Australasian Language Technology Workshop 2004

2003

pdf bib
Towards semantic-based overlap measures for question-answering
Diego Mollá
Proceedings of the Australasian Language Technology Workshop 2003

pdf bib
Exploiting Paraphrases in a Question Answering System
Fabio Rinaldi | James Dowdall | Kaarel Kaljurand | Michael Hess | Diego Mollá
Proceedings of the Second International Workshop on Paraphrasing

pdf bib
Intrinsic versus Extrinsic Evaluations of Parsing Systems
Diego Mollá | Ben Hutchinson
Proceedings of the EACL 2003 Workshop on Evaluation Initiatives in Natural Language Processing: are evaluation methods, metrics and resources reusable?

2002

pdf bib
Evangelising Language Technology: A Practically-Focussed Undergraduate Program
Robert Dale | Diego Mollá Aliod | Rolf Schwitter
Proceedings of the ACL-02 Workshop on Effective Tools and Methodologies for Teaching Natural Language Processing and Computational Linguistics

2000

pdf bib
Answer Extraction Towards better Evaluations of NLP Systems
Rolf Schwitter | Diego Molla | Rachel Fournier | Michael Hess
ANLP-NAACL 2000 Workshop: Reading Comprehension Tests as Evaluation for Computer-Based Language Understanding Systems