Abstract
There are a number of collocational constraints in natural languages that ought to play a more important role in natural language parsers. Thus, for example, it is hard for most parsers to take advantage of the fact that wine is typically drunk, produced, and sold, but (probably) not pruned. So too, it is hard for a parser to know which verbs go with which prepositions (e.g., set up) and which nouns fit together to form compound noun phrases (e.g., computer programmer). This paper will attempt to show that many of these types of concerns can be addressed with syntactic methods (symbol pushing), and need not require explicit semantic interpretation. We have found that it is possible to identify many of these interesting co-occurrence relations by computing simple summary statistics over millions of words of text. This paper will summarize a number of experiments carried out by various subsets of the authors over the last few years. The term collocation will be used quite broadly to include constraints on SVO (subject verb object) triples, phrasal verbs, compound noun phrases, and psychoiinguistic notions of word association (e.g., doctor/nurse).- Anthology ID:
- W89-0240
- Volume:
- Proceedings of the First International Workshop on Parsing Technologies
- Month:
- August
- Year:
- 1989
- Address:
- Pittsburgh, Pennsylvania, USA
- Editor:
- Masaru Tomita
- Venue:
- IWPT
- SIG:
- SIGPARSE
- Publisher:
- Carnegy Mellon University
- Note:
- Pages:
- 389–398
- Language:
- URL:
- https://aclanthology.org/W89-0240
- DOI:
- Cite (ACL):
- Kenneth Church, William Gale, Patrick Hanks, and Donald Hindle. 1989. Parsing, Word Associations and Typical Predicate-Argument Relations. In Proceedings of the First International Workshop on Parsing Technologies, pages 389–398, Pittsburgh, Pennsylvania, USA. Carnegy Mellon University.
- Cite (Informal):
- Parsing, Word Associations and Typical Predicate-Argument Relations (Church et al., IWPT 1989)
- PDF:
- https://preview.aclanthology.org/nschneid-patch-2/W89-0240.pdf