@inproceedings{riordan-etal-2019-account,
    title = "How to account for mispellings: Quantifying the benefit of character representations in neural content scoring models",
    author = "Riordan, Brian  and
      Flor, Michael  and
      Pugh, Robert",
    editor = "Yannakoudakis, Helen  and
      Kochmar, Ekaterina  and
      Leacock, Claudia  and
      Madnani, Nitin  and
      Pil{\'a}n, Ildik{\'o}  and
      Zesch, Torsten",
    booktitle = "Proceedings of the Fourteenth Workshop on Innovative Use of NLP for Building Educational Applications",
    month = aug,
    year = "2019",
    address = "Florence, Italy",
    publisher = "Association for Computational Linguistics",
    url = "https://preview.aclanthology.org/ingest-emnlp/W19-4411/",
    doi = "10.18653/v1/W19-4411",
    pages = "116--126",
    abstract = "Character-based representations in neural models have been claimed to be a tool to overcome spelling variation in in word token-based input. We examine this claim in neural models for content scoring. We formulate precise hypotheses about the possible effects of adding character representations to word-based models and test these hypotheses on large-scale real world content scoring datasets. We find that, while character representations may provide small performance gains in general, their effectiveness in accounting for spelling variation may be limited. We show that spelling correction can provide larger gains than character representations, and that spelling correction improves the performance of models with character representations. With these insights, we report a new state of the art on the ASAP-SAS content scoring dataset."
}Markdown (Informal)
[How to account for mispellings: Quantifying the benefit of character representations in neural content scoring models](https://preview.aclanthology.org/ingest-emnlp/W19-4411/) (Riordan et al., BEA 2019)
ACL