Tianchun Li
2026
LegalDrill: Diagnosis-Driven Synthesis for Legal Reasoning in Small Language Models
Tianchun Li | Haochen Liu | Vishwa Pardeshi | Xingchen Wang | Tianci Liu | Huijun Zhao | Wei Fan | Jing Gao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Tianchun Li | Haochen Liu | Vishwa Pardeshi | Xingchen Wang | Tianci Liu | Huijun Zhao | Wei Fan | Jing Gao
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Small language models (SLMs) are promising for real-world deployment due to their efficiency and low operational cost. However, their limited capacity struggles with high-stakes legal reasoning tasks that require coherent statute interpretation and logically consistent deduction. Furthermore, training SLMs for such tasks demands high-quality, concise reasoning trajectories, which are prohibitively expensive to manually collect and difficult to curate via standard rejection sampling, which lacks granularity beyond final verdicts. To address these challenges, we propose LegalDrill, a diagnosis-driven synthesis framework that extracts and iteratively refines reasoning trajectories from a capable teacher via fine-grained prompting, then a self-reflective verification is employed to adaptively select the most effective data for the SLM student. The resulting data empower SLM training through supervised fine-tuning and direct preference optimization. Extensive experiments on several legal benchmarks demonstrate that LegalDrill significantly bolsters the legal reasoning capabilities of representative SLMs while bypassing the need for scarce expert annotations, paving a scalable path toward practical legal reasoning systems.
2025
Towards Universal Debiasing for Language Models-based Tabular Data Generation
Tianchun Li | Tianci Liu | Xingchen Wang | Rongzhe Wei | Pan Li | Lu Su | Jing Gao
Findings of the Association for Computational Linguistics: EMNLP 2025
Tianchun Li | Tianci Liu | Xingchen Wang | Rongzhe Wei | Pan Li | Lu Su | Jing Gao
Findings of the Association for Computational Linguistics: EMNLP 2025
Large language models (LLMs) have achieved promising results in tabular data generation. However, inherent historical biases in tabular datasets often cause LLMs to exacerbate fairness issues, particularly when multiple advantaged and protected features are involved. In this work, we introduce a universal debiasing framework that minimizes group-level dependencies by simultaneously reducing the mutual information between advantaged and protected attributes. By leveraging the autoregressive structure and analytic sampling distributions of LLM-based tabular data generators, our approach efficiently computes mutual information, reducing the need for cumbersome numerical estimations. Building on this foundation, we propose two complementary methods: a direct preference optimization (DPO)-based strategy, namely UDF-DPO, that integrates seamlessly with existing models, and a targeted debiasing technique, namely UDF-MIX, that achieves debiasing without tuning the parameters of LLMs. Extensive experiments demonstrate that our framework effectively balances fairness and utility, offering a scalable and practical solution for debiasing in high-stakes applications.