Annalu Waller
Also published as: A. Waller
2026
TamilMayangoliSpell: An Open-Source Neural Framework for Context-Sensitive Mayangoli Error Correction in Tamil
Yazhmozhi V M | Annalu Waller | Jacky Visser
Proceedings of the Sixth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages
Yazhmozhi V M | Annalu Waller | Jacky Visser
Proceedings of the Sixth Workshop on Speech, Vision, and Language Technologies for Dravidian Languages
Mayangoli errors are context-sensitive errors in Tamil that arise from confusion among phonetically similar graphemes (e.g., ல/ள/ழ, ர/ற, ந/ன/ண). These errors are challenging for conventional spell checkers because both incorrect and correct forms are valid dictionary words, making dictionary lookup insufficient and requiring contextual modelling. We present TamilMayangoliSpell, a reproducible framework for Mayangoli error correction that combines (i) Tamil-specific preprocessing for sentence segmentation and normalisation, (ii) linguistically grounded error induction for generating training data constrained by dictionary validity, and (iii) fine-tuning of multilingual sequence-to-sequence models. Using 30,000 sentence pairs derived from TamilCorp, a massive multi-genre Tamil corpus and split 80/10/10 into train/validation/test, we fine-tune mBART, mT5, and NLLB under a small hyperparameter grid using greedy decoding with a maximum sequence length of 128. mT5 achieves the best performance (BLEU 99.28; Exact Match Accuracy 93.50%) and remains strong in a cross-genre evaluation on short stories. The preprocessing scripts, generated parallel datasets, and trained models are publicly available in a GitHub repository.
2014
Proceedings of the 5th Workshop on Speech and Language Processing for Assistive Technologies
Jan Alexandersson | Dimitra Anastasiou | Cui Jian | Ani Nenkova | Rupal Patel | Frank Rudzicz | Annalu Waller | Desislava Zhekova
Proceedings of the 5th Workshop on Speech and Language Processing for Assistive Technologies
Jan Alexandersson | Dimitra Anastasiou | Cui Jian | Ani Nenkova | Rupal Patel | Frank Rudzicz | Annalu Waller | Desislava Zhekova
Proceedings of the 5th Workshop on Speech and Language Processing for Assistive Technologies
2012
Applying Prediction Techniques to Phoneme-based AAC Systems
Ha Trinh | Annalu Waller | Keith Vertanen | Per Ola Kristensson | Vicki L. Hanson
Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
Ha Trinh | Annalu Waller | Keith Vertanen | Per Ola Kristensson | Vicki L. Hanson
Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
Jan Alexandersson | Peter Ljunglöf | Kathleen F. McCoy | Brian Roark | Annalu Waller
Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
Jan Alexandersson | Peter Ljunglöf | Kathleen F. McCoy | Brian Roark | Annalu Waller
Proceedings of the Third Workshop on Speech and Language Processing for Assistive Technologies
2011
SLPAT Demo Session
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies
Proceedings of the Second Workshop on Speech and Language Processing for Assistive Technologies
2010
Using NLG and Sensors to Support Personal Narrative for Children with Complex Communication Needs
Rolf Black | Joseph Reddington | Ehud Reiter | Nava Tintarev | Annalu Waller
Proceedings of the NAACL HLT 2010 Workshop on Speech and Language Processing for Assistive Technologies
Rolf Black | Joseph Reddington | Ehud Reiter | Nava Tintarev | Annalu Waller
Proceedings of the NAACL HLT 2010 Workshop on Speech and Language Processing for Assistive Technologies
2009
Using NLG to Help Language-Impaired Users Tell Stories and Participate in Social Dialogues
Ehud Reiter | Ross Turner | Norman Alm | Rolf Black | Martin Dempster | Annalu Waller
Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009)
Ehud Reiter | Ross Turner | Norman Alm | Rolf Black | Martin Dempster | Annalu Waller
Proceedings of the 12th European Workshop on Natural Language Generation (ENLG 2009)
2006
Building a Lexical Database for an Interactive Joke-Generator
R. Manurung | D. O’Mara | H. Pain | G. Ritchie | A. Waller
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
R. Manurung | D. O’Mara | H. Pain | G. Ritchie | A. Waller
Proceedings of the Fifth International Conference on Language Resources and Evaluation (LREC’06)
As part of a project to construct an interactive program which will encourage children to play with language by building jokes, we have developed a large lexical database, closely based on WordNet. As well as the standard WordNet information about part of speech, synonymy, hyponymy, etc, we have added phonetic representations and symbolic links allowing attachment of pictures. All information is represented in a relational database, allowing powerful searches using SQL via a Java API. The lexicon has a facility to label subsets of the lexicon with symbolic names, and we are working to incorporate some educationally relevant word lists as sublexicons. This should also allow us to improve the familiarity ratings which the lexicon assigns to words.
2004
Search
Fix author
Co-authors
- Jan Alexandersson 2
- Rolf Black 2
- Ehud Reiter 2
- Norman Alm 1
- Dimitra Anastasiou 1
- Martin Dempster 1
- Vicki L. Hanson 1
- Kris Jack 1
- Cui Jian 1
- Per Ola Kristensson 1
- Peter Ljunglöf 1
- Yazhmozhi V M 1
- Ruli Manurung 1
- Kathleen F. McCoy 1
- Ani Nenkova 1
- Dave O’mara 1
- Helen Pain 1
- Rupal Patel 1
- Joseph Reddington 1
- Chris Reed 1
- Graeme Ritchie 1
- Brian Roark 1
- Frank Rudzicz 1
- Nava Tintarev 1
- Ha Trinh 1
- Ross Turner 1
- Keith Vertanen 1
- Jacky Visser 1
- Desislava Zhekova 1