Jan Bronec
2026
Thesis Proposal: Targeted and Unified Cross-Lingual Unlearning from Multilingual Language Models
Jan Bronec | Jindřich Helcl
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
Jan Bronec | Jindřich Helcl
Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026)
As large language models (LLM) trained on massive corpora scraped from the web exhibit the capability to reproduce sensitive and copyright-protected data, the field of machine unlearning has emerged to address the arising ethical and legal concerns.While previous research has provided a unified evaluation of LLM unlearning methods, this unification remains constrained to English-only models and datasets.We aim to address the prevailing fragmentation in recent cross-lingual unlearning research by extending existing unified benchmarks with multilingual data.To that end, we plan to compile a dataset of parallel translations of question-answer pairs consisting of real-world facts and synthetic personally identifiable information.Moreover, we will focus on mitigating model degradation during unlearning by selectively editing only those layers that contain the given knowledge.
2025
Atyaephyra at SemEval-2025 Task 4: Low-Rank Negative Preference Optimization
Jan Bronec | Jindřich Helcl
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
Jan Bronec | Jindřich Helcl
Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval-2025)
We present a submission to the SemEval 2025 shared task on unlearning sensitive content from LLMs. Our approach employs negative preference optimization using low-rank adaptation. We show that we can utilize this combination to cheaply compute additional regularization terms, which help with unlearning stabilization. The results of our approach significantly exceed the shared task baselines.