Licon: A Diverse, Controllable and Challenging Linguistic Concept Learning Benchmark
Shenglong Yu, Ying Zhang, Wenya Guo, Zhengkun Zhang, Ru Zhou, Xiaojie Yuan
Abstract
Concept Learning requires learning the definition of a general category from given training examples. Most of the existing methods focus on learning concepts from images. However, the visual information cannot present abstract concepts exactly, which struggles the introduction of novel concepts related to known concepts (e.g., ‘Plant’→‘Asteroids’). In this paper, inspired by the fact that humans learn most concepts through linguistic description, we introduce Linguistic Concept Learning benchmark (Licon), where concepts in diverse forms (e.g., plain attributes, images, and text) are defined by linguistic descriptions. The difficulty to learn novel concepts can be controlled by the number of attributes or the hierarchical relationships between concepts. The diverse and controllable concepts are used to support challenging evaluation tasks, including concept classification, attribute prediction, and concept relationship recognition. In addition, we design an entailment-based concept learning method (EnC) to model the relationship among concepts. Extensive experiments demonstrate the effectiveness of EnC. The benchmark will be released to the public soon.- Anthology ID:
- 2023.findings-emnlp.358
- Volume:
- Findings of the Association for Computational Linguistics: EMNLP 2023
- Month:
- December
- Year:
- 2023
- Address:
- Singapore
- Editors:
- Houda Bouamor, Juan Pino, Kalika Bali
- Venue:
- Findings
- SIG:
- Publisher:
- Association for Computational Linguistics
- Note:
- Pages:
- 5383–5398
- Language:
- URL:
- https://aclanthology.org/2023.findings-emnlp.358
- DOI:
- 10.18653/v1/2023.findings-emnlp.358
- Cite (ACL):
- Shenglong Yu, Ying Zhang, Wenya Guo, Zhengkun Zhang, Ru Zhou, and Xiaojie Yuan. 2023. Licon: A Diverse, Controllable and Challenging Linguistic Concept Learning Benchmark. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 5383–5398, Singapore. Association for Computational Linguistics.
- Cite (Informal):
- Licon: A Diverse, Controllable and Challenging Linguistic Concept Learning Benchmark (Yu et al., Findings 2023)
- PDF:
- https://preview.aclanthology.org/ingest-acl-2023-videos/2023.findings-emnlp.358.pdf