Differentially Private Learning Needs Better Model Initialization and Self-Distillation

Ivoline C. Ngong; Joseph Near; Niloofar Mireshghallah

doi:10.18653/v1/2025.naacl-long.455

Differentially Private Learning Needs Better Model Initialization and Self-Distillation

Ivoline C. Ngong, Joseph Near, Niloofar Mireshghallah

Abstract

Differentially private SGD (DPSGD) enables privacy-preserving training of language models, but often reduces utility, diversity, and linguistic quality. We introduce DPRefine, a three-phase method that initializes a model using data synthesis from a small pre-trained LM with rigorous filtering, applies DP finetuning on private data, and performs self-distillation to refine outputs. This approach significantly outperforms vanilla DPSGD, with AlpacaEval preferring DPRefine’s generations in 78.38% of cases across all datasets and metrics, while also demonstrating substantial improvements in lexical diversity, achieving 85.31% in MSTTR and 86.82% in Jaccard similarity. Our fine-grained analysis reveals that DPRefine reduces linguistic errors in generated text by 84%, mitigating grammar errors, spelling mistakes, and missing punctuation commonly associated with DPSGD. It also reduces inconsistencies present in non-private models, such as fabricated details and misattributed quotes. We find that small models like GPT-2 and T5 are effective for initialization and distillation, highlighting their potential in enabling scalable and efficient deployment of high-performing, privacy-preserving language models with improved linguistic quality and consistency.

Anthology ID:: 2025.naacl-long.455
Volume:: Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers)
Month:: April
Year:: 2025
Address:: Albuquerque, New Mexico
Editors:: Luis Chiruzzo, Alan Ritter, Lu Wang
Venue:: NAACL
SIG:
Publisher:: Association for Computational Linguistics
Note:
Pages:: 9009–9027
Language:
URL:: https://preview.aclanthology.org/ingest-emnlp/2025.naacl-long.455/
DOI:: 10.18653/v1/2025.naacl-long.455
Bibkey:
Cite (ACL):: Ivoline C. Ngong, Joseph Near, and Niloofar Mireshghallah. 2025. Differentially Private Learning Needs Better Model Initialization and Self-Distillation. In Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), pages 9009–9027, Albuquerque, New Mexico. Association for Computational Linguistics.
Cite (Informal):: Differentially Private Learning Needs Better Model Initialization and Self-Distillation (Ngong et al., NAACL 2025)
Copy Citation:
PDF:: https://preview.aclanthology.org/ingest-emnlp/2025.naacl-long.455.pdf

PDF Cite Search Fix data