Ciyang Qing
2026
MultiCoS: A Multilingual Dataset of Connective Semantics with Context–Sentence Compatibility
Anne Mucha | Ciyang Qing | Wataru Uegaki
Proceedings of the Fifteenth Language Resources and Evaluation Conference
Anne Mucha | Ciyang Qing | Wataru Uegaki
Proceedings of the Fifteenth Language Resources and Evaluation Conference
We present a multilingual dataset of connective semantics. The dataset contains the semantic annotations of clausal connectives (e.g. and and or in English) from 24 languages, based on our original native-speaker elicitation data. Unlike existing lexica on connectives, the dataset includes systematic evidence for the annotations in the form of context-sentence compatibility judgments, including negative evidence. The paper describes the methodology of data collection and the format of the dataset. We also discuss its potential use cases for the validation of cross-linguistic generalizations, examinations of their potential counterexamples, and for benchmarking felicity judgments by NLU systems.
2025
Automatic Extraction of Clausal Embedding Based on Large-Scale English Text Data
Iona Carslaw | Sivan Milton | Nicolas Navarre | Ciyang Qing | Wataru Uegaki
Proceedings of the Society for Computation in Linguistics 2025
Iona Carslaw | Sivan Milton | Nicolas Navarre | Ciyang Qing | Wataru Uegaki
Proceedings of the Society for Computation in Linguistics 2025
2023
A Crosslinguistic Database for Combinatorial and Semantic Properties of Attitude Predicates
Deniz Özyıldız | Ciyang Qing | Floris Roelofsen | Maribel Romero | Wataru Uegaki
Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
Deniz Özyıldız | Ciyang Qing | Floris Roelofsen | Maribel Romero | Wataru Uegaki
Proceedings of the 5th Workshop on Research in Computational Linguistic Typology and Multilingual NLP
We introduce a cross-linguistic database for attitude predicates, which references their combinatorial (syntactic) and semantic properties. Our data allows assessment of cross-linguistic generalizations about attitude predicates as well as discovery of new typological/cross-linguistic patterns. This paper motivates empirical and theoretical issues that our database will help to address, the sample predicates and the properties that it references, as well as our design and methodological choices. Two case studies illustrate how the database can be used to assess validity of cross-linguistic generalizations.