Terry Lyons
2024
Sig-Networks Toolkit: Signature Networks for Longitudinal Language Modelling
Talia Tseriotou
|
Ryan Chan
|
Adam Tsakalidis
|
Iman Munire Bilal
|
Elena Kochkina
|
Terry Lyons
|
Maria Liakata
Proceedings of the 18th Conference of the European Chapter of the Association for Computational Linguistics: System Demonstrations
We present an open-source, pip installable toolkit, Sig-Networks, the first of its kind for longitudinal language modelling. A central focus is the incorporation of Signature-based Neural Network models, which have recently shown success in temporal tasks. We apply and extend published research providing a full suite of signature-based models. Their components can be used as PyTorch building blocks in future architectures. Sig-Networks enables task-agnostic dataset plug-in, seamless preprocessing for sequential data, parameter flexibility, automated tuning across a range of models. We examine signature networks under three different NLP tasks of varying temporal granularity: counselling conversations, rumour stance switch and mood changes in social media threads, showing SOTA performance in all three, and provide guidance for future tasks. We release the Toolkit as a PyTorch package with an introductory video, Git repositories for preprocessing and modelling including sample notebooks on the modeled NLP tasks.
2020
Information Extraction from Swedish Medical Prescriptions with Sig-Transformer Encoder
John Pougué Biyong
|
Bo Wang
|
Terry Lyons
|
Alejo Nevado-Holgado
Proceedings of the 3rd Clinical Natural Language Processing Workshop
Relying on large pretrained language models such as Bidirectional Encoder Representations from Transformers (BERT) for encoding and adding a simple prediction layer has led to impressive performance in many clinical natural language processing (NLP) tasks. In this work, we present a novel extension to the Transformer architecture, by incorporating signature transform with the self-attention model. This architecture is added between embedding and prediction layers. Experiments on a new Swedish prescription data show the proposed architecture to be superior in two of the three information extraction tasks, comparing to baseline models. Finally, we evaluate two different embedding approaches between applying Multilingual BERT and translating the Swedish text to English then encode with a BERT model pretrained on clinical notes.
Search
Co-authors
- John Pougué Biyong 1
- Bo Wang 1
- Alejo Nevado-Holgado 1
- Talia Tseriotou 1
- Ryan Chan 1
- show all...