Language Models as Causal Effect Generators

Lucius E.j. Bynum; Kyunghyun Cho

Language Models as Causal Effect Generators

Abstract

In this work, we present sequence-driven structural causal models (SD-SCMs), a framework for specifying causal models with user-defined structure and language-model-defined mechanisms. We characterize how an SD-SCM enables sampling from observational, interventional, and counterfactual distributions according to the desired causal structure. We then leverage this procedure to propose a new type of benchmark for causal inference methods, generating individual-level counterfactual data to test treatment effect estimation. We create an example benchmark consisting of thousands of datasets, and test a suite of popular estimation methods for average, conditional average, and individual treatment effect estimation. We find under this benchmark that (1) causal methods outperform non-causal methods and that (2) even state-of-the-art methods struggle with individualized effect estimation, suggesting this benchmark captures some inherent difficulties in causal estimation. Apart from generating data, this same technique can underpin the auditing of language models for (un)desirable causal effects, such as misinformation or discrimination. We believe SD-SCMs can serve as a useful tool in any application that would benefit from sequential data with controllable causal structure.

Anthology ID:: 2025.emnlp-main.107
Volume:: Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing
Month:: November
Year:: 2025
Address:: Suzhou, China
Editors:: Christos Christodoulopoulos, Tanmoy Chakraborty, Carolyn Rose, Violet Peng
Venue:: EMNLP
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 2096–2115
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.107/
DOI:
Bibkey:
Cite (ACL):: Lucius E.j. Bynum and Kyunghyun Cho. 2025. Language Models as Causal Effect Generators. In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing, pages 2096–2115, Suzhou, China. Association for Computational Linguistics.
Cite (Informal):: Language Models as Causal Effect Generators (Bynum & Cho, EMNLP 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.emnlp-main.107.pdf
Checklist:: 2025.emnlp-main.107.checklist.pdf

PDF Cite Search Checklist Fix data