@inproceedings{gao-etal-2021-hierarchical,
    title = "Hierarchical Character Tagger for Short Text Spelling Error Correction",
    author = "Gao, Mengyi  and
      Xu, Canran  and
      Shi, Peng",
    editor = "Xu, Wei  and
      Ritter, Alan  and
      Baldwin, Tim  and
      Rahimi, Afshin",
    booktitle = "Proceedings of the Seventh Workshop on Noisy User-generated Text (W-NUT 2021)",
    month = nov,
    year = "2021",
    address = "Online",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/2021.wnut-1.13/",
    doi = "10.18653/v1/2021.wnut-1.13",
    pages = "106--113",
    abstract = "State-of-the-art approaches to spelling error correction problem include Transformer-based Seq2Seq models, which require large training sets and suffer from slow inference time; and sequence labeling models based on Transformer encoders like BERT, which involve token-level label space and therefore a large pre-defined vocabulary dictionary. In this paper we present a Hierarchical Character Tagger model, or HCTagger, for short text spelling error correction. We use a pre-trained language model at the character level as a text encoder, and then predict character-level edits to transform the original text into its error-free form with a much smaller label space. For decoding, we propose a hierarchical multi-task approach to alleviate the issue of long-tail label distribution without introducing extra model parameters. Experiments on two public misspelling correction datasets demonstrate that HCTagger is an accurate and much faster approach than many existing models."
}Markdown (Informal)
[Hierarchical Character Tagger for Short Text Spelling Error Correction](https://preview.aclanthology.org/ingest-emnlp/2021.wnut-1.13/) (Gao et al., WNUT 2021)
ACL