SecureLLM: Using Inference-time Compositionality to Build Secure Language Models

Abdulrahman Alabdulkareem, Christian Michael Arnold, Yerim Lee, Pieter M Feenstra, Conner Arnold, Boris Katz, Andrei Barbu, Brian Cheung


Abstract
As Large Language Models (LLMs) increasingly support critical sectors such as healthcare, finance, and public governance, ensuring data confidentiality and robust access control is a pressing societal challenge. Traditional security mechanisms isolate sensitive resources from unauthorized users, yet existing LLM safety approaches often fail to enforce strict segregation of confidential data. In this work, we introduce SecureLLM, a novel compositional framework for building secure LLMs that integrates fine-tuning with traditional access security measures to protect private information. By fine-tuning LLMs on segregated, “siloed” training data and composing their outputs at inference time based solely on a user’s verified credentials, SecureLLM not only prevents unauthorized data leakage but also enables accurate responses for complex queries spanning multiple data silos. Our method is demonstrated on a challenging natural-language-to-SQL translation task and is designed with real-world applications in mind, where protecting sensitive information is critical.
Anthology ID:
2026.privatenlp-main.3
Volume:
Proceedings of the Seventh Workshop on Privacy in Natural Language Processing
Month:
July
Year:
2026
Address:
San Diego, California
Editors:
Ivan Habernal, Sepideh Ghanavati, Sara Haghighi, Krithika Ramesh, Timour Igamberdiev, Shomir Wilson
Venues:
PrivateNLP | WS
SIG:
Publisher:
Association for Computational Linguistics
Note:
Pages:
26–52
Language:
URL:
https://preview.aclanthology.org/ingest-acl-workshops/2026.privatenlp-main.3/
DOI:
Bibkey:
Cite (ACL):
Abdulrahman Alabdulkareem, Christian Michael Arnold, Yerim Lee, Pieter M Feenstra, Conner Arnold, Boris Katz, Andrei Barbu, and Brian Cheung. 2026. SecureLLM: Using Inference-time Compositionality to Build Secure Language Models. In Proceedings of the Seventh Workshop on Privacy in Natural Language Processing, pages 26–52, San Diego, California. Association for Computational Linguistics.
Cite (Informal):
SecureLLM: Using Inference-time Compositionality to Build Secure Language Models (Alabdulkareem et al., PrivateNLP 2026)
Copy Citation:
PDF:
https://preview.aclanthology.org/ingest-acl-workshops/2026.privatenlp-main.3.pdf