Rotation Control Unlearning: Quantifying and Controlling Continuous Unlearning for LLM with The Cognitive Rotation Space
Xiang Zhang, Kun Wei, Xu Yang, Jiahua Li, Su Yan, Cheng Deng
TL;DR
This work tackles continuous unlearning in LLMs without retaining data by reframing LoRA updates as rotations within a cognitive rotation space $R\in SO(n)$, where the update $W\gets (I+BA)W$ induces a rotation with angle $\theta$. The authors introduce a skew symmetric loss $\mathcal{L}_{Sk}$ to enforce antisymmetric updates, an orthogonal rotation axes loss $\mathcal{L}_{o}$ to ensure perpendicularity between successive unlearning steps, and an unlearning-alignment loss $\mathcal{L}_{Ua}$ to align OOD representations with the RCUs paradigm. An OOD-based rotational salience weight $\beta$ is computed via a distributional shift compensator to control the unlearning degree continuously, enabling a retain-free, monotonic control over unlearning progress. Extensive experiments on ScienceQA and TOFU demonstrate state-of-the-art unlearning performance with fewer trainable parameters than prior retainful methods, and ablations confirm the key role of the rotational losses and alignment components. The approach offers a principled, scalable path for safe, continual unlearning in large language models.
Abstract
As Large Language Models (LLMs) become increasingly prevalent, their security vulnerabilities have already drawn attention. Machine unlearning is introduced to seek to mitigate these risks by removing the influence of undesirable data. However, existing methods not only rely on the retained dataset to preserve model utility, but also suffer from cumulative catastrophic utility loss under continuous unlearning requests. To solve this dilemma, we propose a novel method, called Rotation Control Unlearning (RCU), which leverages the rotational salience weight of RCU to quantify and control the unlearning degree in the continuous unlearning process. The skew symmetric loss is designed to construct the existence of the cognitive rotation space, where the changes of rotational angle can simulate the continuous unlearning process. Furthermore, we design an orthogonal rotation axes regularization to enforce mutually perpendicular rotation directions for continuous unlearning requests, effectively minimizing interference and addressing cumulative catastrophic utility loss. Experiments on multiple datasets confirm that our method without retained dataset achieves SOTA performance.
