David A. Evans

Also published as: David Andreoff Evans

A Statistical Approach to Automatic OCR Error Correction in Context
Xiang Tong | David A. Evans
Fourth Workshop on Very Large Corpora

1993

pdf bib abs

Frequency Estimation of Verb Subcategorization Frames Based on Syntactic and Multidimensional Statistical Analysis
Akira Ushioda | David A. Evans | Ted Gibson | Alex Waibel
Proceedings of the Third International Workshop on Parsing Technologies

We describe a mechanism for automatically estimating frequencies of verb subcategorization frames in a large corpus. A tagged corpus is first partially parsed to identify noun phrases and then a regular grammar is used to estimate the appropriate subcategorization frame for each verb token in the corpus. In an experiment involving the identification of six fixed subcategorization frames, our current system showed more than 80% accuracy. In addition, a new statistical method enables the system to learn patterns of errors based on a set of training samples and substantially improves the accuracy of the frequency estimation.

pdf bib

The Automatic Acquisition of Frequencies of Verb Subcategorization Frames from Tagged Corpora
Akira Ushioda | David A. Evans | Ted Gibson | Alex Waibel
Acquisition of Lexical Knowledge from Text