Low-resource Cross-lingual Event Type Detection via Distant Supervision with Minimal Effort
Aldrian Obaja Muis, Naoki Otani, Nidhi Vyas, Ruochen Xu, Yiming Yang, Teruko Mitamura, Eduard Hovy
Abstract
The use of machine learning for NLP generally requires resources for training. Tasks performed in a low-resource language usually rely on labeled data in another, typically resource-rich, language. However, there might not be enough labeled data even in a resource-rich language such as English. In such cases, one approach is to use a hand-crafted approach that utilizes only a small bilingual dictionary with minimal manual verification to create distantly supervised data. Another is to explore typical machine learning techniques, for example adversarial training of bilingual word representations. We find that in event-type detection task—the task to classify [parts of] documents into a fixed set of labels—they give about the same performance. We explore ways in which the two methods can be complementary and also see how to best utilize a limited budget for manual annotation to maximize performance gain.- Anthology ID:
- C18-1007
- Original:
- C18-1007v1
- Version 2:
- C18-1007v2
- Volume:
- Proceedings of the 27th International Conference on Computational Linguistics
- Month:
- August
- Year:
- 2018
- Address:
- Santa Fe, New Mexico, USA
- Venue:
- COLING
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 70–82
- Language:
- URL:
- https://aclanthology.org/C18-1007
- DOI:
- Cite (ACL):
- Aldrian Obaja Muis, Naoki Otani, Nidhi Vyas, Ruochen Xu, Yiming Yang, Teruko Mitamura, and Eduard Hovy. 2018. Low-resource Cross-lingual Event Type Detection via Distant Supervision with Minimal Effort. In Proceedings of the 27th International Conference on Computational Linguistics, pages 70–82, Santa Fe, New Mexico, USA. Association for Computational Linguistics.
- Cite (Informal):
- Low-resource Cross-lingual Event Type Detection via Distant Supervision with Minimal Effort (Muis et al., COLING 2018)
- PDF:
- https://preview.aclanthology.org/ingestion-script-update/C18-1007.pdf