Dual asymmetric momentum improves federated class unlearning in edge systems - Nature

Introduction

Federated learning (FL) enables multiple clients to collaboratively train a shared model while keeping raw data local, making it attractive in privacy-sensitive domains. This paradigm is particularly relevant when data are generated on-device and transferring raw data to a centralized server is impractical because of latency, bandwidth, or resource constraints. However, FL does not eliminate privacy risk: exchanged parameters and updates can still encode information about local samples and may be exploited by inference attacks such as membership inference under realistic access settings^1,2. At the same time, regulatory and governance frameworks, including the EU General Data Protection Regulation (GDPR)³ and the California Consumer Privacy Act (CCPA)⁴, have increased interest in machine unlearning, that is, procedures that reduce or remove the influence of designated training data, such as samples, users, clients, or classes, from a trained model.

Post-hoc unlearning is especially challenging in FL because the most direct reference procedure, retrain-on-retain, is often operationally infeasible. It requires repeated client coordination, substantial communication, and considerable compute. These constraints motivate budgeted post-hoc federated unlearning methods that aim to improve the forgetting–utility trade-off without re-running end-to-end training^5,6,7. In this work, we focus on post-hoc federated class unlearning, where a trained federated system receives a request to reduce the influence of a designated class while preserving utility on the remaining classes under a strict update and communication budget. The primary method and analysis target global class requests, and additional experiments on sample-level and client-level removal are included to examine robustness when forget sets are sparse and uneven across clients.

A central difficulty in this setting is the forget–retain conflict, which becomes more pronounced under non-IID client heterogeneity. Forgetting pushes parameters away from representations that support the forget set, whereas retention requires stability to preserve performance on retained data. Beyond the trade-off at the loss level, the optimizer state can itself act as a coupling mechanism: a single momentum or adaptive state can mix retain- and forget-driven signals into a shared direction estimate, potentially amplifying interference when local objectives conflict across heterogeneous clients^8,9. FedDAM addresses this issue through two design choices. First, it adopts an aux-only unlearning protocol that freezes the backbone and main classifier and updates only a lightweight auxiliary head, thereby reducing communicated payload and client-side computation. Second, it decouples retain and forget optimization through separate momentum buffers combined with an asymmetric update rule. In addition to downstream utility measures, the paper also provides mechanistic evidence through retain–forget gradient-alignment analysis during unlearning.

The main contributions of this work are as follows. First, we introduce a parameter-efficient post-hoc federated unlearning protocol that freezes the backbone and main classifier and communicates only auxiliary parameters during unlearning, with the resulting communication and runtime characteristics quantified in Tables 4 and 5. Second, we propose a dual-asymmetric momentum mechanism that separates retain and forget dynamics during local optimization. Direct gradient-conflict analysis provides mechanistic support for this decoupling (Table 12), the effect of asymmetry is isolated through a symmetric-versus-asymmetric ablation (Table 15), and Section 3.4 now combines Proposition 3.4 on improved forget-direction alignment under gradient conflict with a bounded-scope convergence perspective for the frozen auxiliary-head subproblem. Third, we report matched-budget experiments on CIFAR-10, CIFAR-100, and ImageNet-100, together with additional sample-level and client-level removal studies, to characterize retain–forget trade-offs under non-IID heterogeneity and sparse forget support (Tables 1, 2, 8, and 10, 11). Fourth, we compare FedDAM with projection- and scrubbing-style auxiliary-head adaptations under the same budget, as well as with compressed full-model retraining, in order to clarify the utility–communication trade-off associated with restricting the update scope (Tables 13 and 5). Finally, we report black-box membership-inference indicators under an explicitly stated protocol as privacy-oriented diagnostics, while emphasizing that these measures do not constitute formal privacy guarantees (Sec. 4.4 and Table 6).

The remainder of the paper reviews related work, formalizes the setup and evaluation metrics, presents FedDAM, and reports the experimental results and analysis.

Related work

Federated machine unlearning studies how to reduce or remove the influence of designated training data, such as samples, clients, or classes, from a deployed federated model without performing full retraining. Compared with centralized settings, federated deployments introduce communication limits, partial participation, and non-IID client heterogeneity, all of which can make post-hoc updates both operationally constrained and algorithmically unstable^10,11. In cross-device settings, these challenges are compounded by limited uplink bandwidth and restricted on-device compute, motivating approaches that are explicitly budgeted in both communication and optimization.

Federated unlearning methods

Existing federated unlearning approaches can be grouped into four broad families according to their assumptions and resource requirements. Retraining or replay approximations aim to approximate retrain-on-retain by selectively replaying retained data or reconstructing training dynamics; these methods can improve fidelity, but often remain costly when coordination or recomputation is substantial^12,13. Checkpoint- or trajectory-based removal uses stored checkpoints, update logs, or training traces to roll back and recompose training effects; such strategies can be efficient when rich traces are available, but they rely on storage and logging assumptions that may not hold in practical deployments¹⁴. Adapter or auxiliary-module methods restrict unlearning to a small subset of parameters, such as adapters or auxiliary heads, thereby reducing communication and limiting representational drift, although this restriction may reduce capacity for more complex forget requests²². Finally, gradient-based post-hoc optimization directly optimizes objectives intended to suppress the forget set while preserving retained utility, with outcomes depending strongly on objective design and stability under heterogeneity^7,9,23.

FedDAM is most closely related to auxiliary-module approaches. It performs post-hoc class unlearning by updating only lightweight auxiliary parameters while keeping the backbone fixed, thereby targeting low communication overhead and reduced unintended drift on retained decision functions.

Recent federated unlearning research has also explored geometry-aware update rules that explicitly mitigate gradient conflict, such as projection- and scrubbing-style procedures, as well as certification- and privacy-aware frameworks that aim to provide bounded distance-to-retrain or differential-privacy-calibrated removal guarantees^6,15,23. These directions are complementary to the setting considered here. The focus of FedDAM is budgeted post-hoc class unlearning under strict cross-device constraints, where the update scope is restricted to a lightweight auxiliary head and retain/forget optimizer states are explicitly separated. Within this setting, comparisons to projection- and scrubbing-style mechanisms are implemented under the same auxiliary-head communication budget so that differences reflect the conflict-handling mechanism rather than discrepancies in update scope or communicated parameter count.

Direct head-to-head reproduction of all recent federated unlearning methods is not always comparable within our setting because several recent approaches target full-model updates, different access assumptions, or vertical federated learning rather than cross-device post-hoc class unlearning. We therefore focus our empirical comparisons on controlled same-budget auxiliary-head baselines, namely FedAU, Aux Proj, and Aux Scrub, while positioning certification- and privacy-oriented methods as complementary directions in the broader federated unlearning landscape. This design allows us to isolate conflict-handling differences under the same communication budget and update scope without conflating them with differences in model-update extent or access assumptions.

Verification and privacy diagnostics

Unlearning is often summarized through utility-based measures, such as reduced performance on the forget set and preserved utility on retained data, but such outcomes do not necessarily imply reduced privacy leakage. For this reason, membership inference attacks (MIAs) are frequently used as privacy-oriented diagnostics in both centralized and federated settings^1,2, and prior work has shown that post-hoc procedures can alter attack separability depending on the evaluation protocol¹⁶. Because these outcomes depend on attacker access assumptions, such as whether the attacker queries a single model or compares pre- and post-unlearning behavior, it is important to state the threat model clearly and interpret diagnostic signals conservatively.

Across these lines of work, comparatively less attention has been paid to the role of the optimizer state itself in coupling retain- and forget-driven objectives during post-hoc updates. Under non-IID heterogeneity, gradient directions and magnitudes can vary substantially across clients, and a unified momentum or adaptive state can implicitly mix retain- and forget-driven signals into a single update direction, potentially amplifying interference when the two objectives conflict^8,9. This observation motivates an approach that is parameter-efficient at the system level while explicitly separating retain and forget optimization dynamics at the algorithmic level.

Proposed method: FedDAM

FedDAM performs post-hoc federated class unlearning by freezing the pre-trained backbone and main classifier and updating only a lightweight auxiliary head. Unlearning is executed in standard FL rounds, but the server communicates and aggregates only auxiliary parameters, limiting communicated payload and client-side computation under a fixed unlearning budget. The central mechanism is optimizer-state decoupling: separate momentum buffers track retain- and forget-driven gradients and are combined asymmetrically during local unlearning.

Overview

Let the pre-trained model produce main-head logits $\textbf{z}^{\text {main}}(\textbf{x})$ and auxiliary-head logits $\textbf{z}^{\text {aux}}(\textbf{x})$. During unlearning, the backbone and main classifier are frozen and only the auxiliary linear head $(\textbf{W}^{\text {aux}},\textbf{b}^{\text {aux}})$ is trained. Predictions use blended logits

$$\begin{aligned} \textbf{z}(\textbf{x})=(1-\alpha _b)\textbf{z}^{\text {main}}(\textbf{x})+\alpha _b\textbf{z}^{\text {aux}}(\textbf{x}), \end{aligned}$$

(1)

where $\alpha _b\in [0,1]$ controls the auxiliary correction strength. Because the backbone provides a fixed representation, the auxiliary head acts as a lightweight logit-space correction that can shift decision boundaries for the target class without modifying deep features. This restriction improves efficiency but limits capacity when the required decision change depends on altering backbone representations (discussed in Sec. 6).

Objectives

At each client k, local data are partitioned into retain samples $\mathcal {D}_{r,k}$ and forget samples $\mathcal {D}_{u,k}$. Retain updates minimize cross-entropy on retain minibatches:

$$\begin{aligned} \mathcal {L}_{\text {CE}}(\mathcal {B}_r) = \mathbb {E}_{(\textbf{x},y)\sim \mathcal {B}_r}\left[ -\log p_y(\textbf{x})\right] , \end{aligned}$$

(2)

where $p(\textbf{x})=\textrm{softmax}(\textbf{z}(\textbf{x}))$ and $\textbf{z}(\textbf{x})$ is given by (1).

Forget updates use a knowledge-overwriting objective that suppresses the true class and redistributes probability mass uniformly over non-true classes:

$$\begin{aligned} \mathcal {L}_{\text {KO}}(\mathcal {B}_u) = \mathbb {E}_{(\textbf{x},y)\sim \mathcal {B}_u}\!\left[ -\sum _{c\ne y}\frac{1}{C-1}\log p_c(\textbf{x}) \right] . \end{aligned}$$

(3)

In practice, we estimate (3) by sampling $\tilde{y}\sim \textrm{Unif}(\{1,\dots ,C\}\setminus \{y\})$ per sample and minimizing $-\log p_{\tilde{y}}(\textbf{x})$, yielding an unbiased Monte Carlo estimator of (3).

Dual-asymmetric optimization

Each local unlearning step draws one retain minibatch $\mathcal {B}_r\subset \mathcal {D}_{r,k}$ and one forget minibatch $\mathcal {B}_u\subset \mathcal {D}_{u,k}$ (sampling with replacement when needed). We index local steps by $e\in \{1,\dots ,S_u\}$ (corresponding to the inner-loop index in Algorithm 1). Define retain and forget gradients on auxiliary parameters:

$$\begin{aligned} \textbf{g}_r^{(e)}&= \nabla _{\textbf{W}^{\text {aux}}}\mathcal {L}_{\text {CE}}(\mathcal {B}_r), \end{aligned}$$

(4)

$$\begin{aligned} \textbf{g}_f^{(e)}&= \nabla _{\textbf{W}^{\text {aux}}}\Big (\lambda _{\text {ow}}\mathcal {L}_{\text {KO}}(\mathcal {B}_u)\Big ), \end{aligned}$$

(5)

where $\lambda _{\text {ow}}>0$ scales overwrite strength.

Clients with no forget samples. If $\mathcal {D}_{u,k}=\emptyset$, we set $\mathcal {B}_u=\emptyset$ and $\textbf{g}_f^{(e)}=\textbf{0}$ for all local steps. Because momentum buffers are reset at the start of each unlearning round (Algorithm 1), $\textbf{m}_f^{(0)}=\textbf{0}$ implies $\textbf{m}_f^{(e)}=\textbf{0}$ for all e in that round; the client therefore contributes retain-driven auxiliary updates only. Global forgetting is driven by participating clients with non-empty forget sets and is synchronized through server aggregation of auxiliary parameters. Forget-set availability heterogeneity is summarized in Table 3.

FedDAM maintains separate momentum buffers for the two objectives:

$$\begin{aligned} \textbf{m}_r^{(e+1)}&= \beta _r \textbf{m}_r^{(e)} + (1-\beta _r)\textbf{g}_r^{(e)}, \end{aligned}$$

(6)

$$\begin{aligned} \textbf{m}_f^{(e+1)}&= \beta _f \textbf{m}_f^{(e)} + (1-\beta _f)\textbf{g}_f^{(e)}. \end{aligned}$$

(7)

Auxiliary weights are updated by an asymmetric combination of the two momenta, with decoupled weight decay applied once:

$$\begin{aligned} \textbf{W}^{\text {aux},(e+1)} = (1-\eta _{\text {aux}}\lambda _{\text {aux}})\textbf{W}^{\text {aux},(e)} - \eta _{\text {aux}} \left( \gamma _f \textbf{m}_f^{(e+1)} + \gamma _r \textbf{m}_r^{(e+1)} \right) , \end{aligned}$$

(8)

where $\eta _{\text {aux}}$ is the auxiliary learning rate, $\lambda _{\text {aux}}$ is the auxiliary weight-decay coefficient, and $\gamma _f,\gamma _r>0$ scale forget and retain contributions.

Weight decay is applied to $\textbf{W}^{\text {aux}}$ but not to $\textbf{b}^{\text {aux}}$, following the standard practice of excluding bias terms from $\ell _2$ regularization.

The asymmetric settings are chosen so that $\beta _f<\beta _r$, giving the forget buffer a shorter memory and therefore a faster response to overwrite gradients, while $\gamma _f>\gamma _r$ prioritizes overwrite progress under a fixed step budget. The dual-buffer design avoids mixing retain- and forget-driven directions into a single momentum state; mechanistic support is provided by the gradient-alignment statistics reported in Table 12.

The auxiliary bias is updated analogously (without weight decay):

$$\begin{aligned} \textbf{m}_{r,b}^{(e+1)}&= \beta _r \textbf{m}_{r,b}^{(e)} + (1-\beta _r)\nabla _{\textbf{b}^{\text {aux}}}\mathcal {L}_{\text {CE}}(\mathcal {B}_r), \end{aligned}$$

(9)

$$\begin{aligned} \textbf{m}_{f,b}^{(e+1)}&= \beta _f \textbf{m}_{f,b}^{(e)} + (1-\beta _f)\nabla _{\textbf{b}^{\text {aux}}}\Big (\lambda _{\text {ow}}\mathcal {L}_{\text {KO}}(\mathcal {B}_u)\Big ),\end{aligned}$$

(10)

$$\begin{aligned} \textbf{b}^{\text {aux},(e+1)}&= \textbf{b}^{\text {aux},(e)} -\eta _{\text {aux}} \left( \gamma _f \textbf{m}_{f,b}^{(e+1)} + \gamma _r \textbf{m}_{r,b}^{(e+1)} \right) . \end{aligned}$$

(11)

Theoretical intuition for dual-asymmetric momentum

The asymmetric settings in FedDAM are motivated by the goal of improving forget-direction responsiveness under conflicting retain and forget objectives. The following proposition formalizes this intuition at the level of update alignment.

Proposition 1 (Expected alignment under gradient conflict). Let $\textbf{g}_r$ and $\textbf{g}_f$ denote retain and forget gradients with bounded norms under a Lipschitz-smooth objective. If $\cos (\textbf{g}_r,\textbf{g}_f)<0$, then under dual-asymmetric momentum with $\beta _f<\beta _r$ and $\gamma _f>\gamma _r$, the expected alignment of the applied update direction $\Delta$ with the forget gradient satisfies

$$\begin{aligned} \mathbb {E}\!\left[ \cos (\Delta ,\textbf{g}_f)\right] \ge \mathbb {E}\!\left[ \cos (\Delta _{\textrm{unified}},\textbf{g}_f)\right] , \end{aligned}$$

(12)

where $\Delta _{\textrm{unified}}$ is the update obtained from a single shared momentum buffer.

Proof sketch. Unified momentum accumulates retain and forget gradients in the same buffer, which mixes conflicting signals and can reduce alignment with either objective. Dual buffers preserve objective-specific temporal filtering. Setting $\beta _f<\beta _r$ shortens the forget-gradient memory and improves responsiveness to overwrite signals, while $\gamma _f>\gamma _r$ increases the projection of the applied step onto the forget direction under a fixed budget. Empirical gradient statistics in Table 12 support this intuition: FedDAM reduces the fraction of steps with negative retain–forget alignment from approximately $41\%$ to $19\%$.

This proposition is intended to clarify the role of asymmetric dual momentum under conflicting objectives. A general convergence analysis for the non-convex federated setting remains beyond the scope of the present study.

Bounded-scope convergence perspective. With the backbone and main classifier frozen, FedDAM optimizes only the auxiliary head, thereby defining a lower-dimensional federated subproblem. Under standard assumptions used in smooth non-convex stochastic optimization—namely an L-smooth auxiliary-head objective, bounded stochastic gradient variance, bounded client heterogeneity, and a sufficiently small auxiliary learning rate—the expected averaged gradient norm of this subproblem is expected to decrease with the number of unlearning rounds up to variance- and heterogeneity-dependent residual terms. In addition, under conflicting retain and forget gradients with $\cos (\textbf{g}_r,\textbf{g}_f)<0$, the asymmetric setting $\beta _f<\beta _r$ and $\gamma _f>\gamma _r$ is expected to improve alignment of the applied update with the forget gradient relative to a unified-momentum update.

Limitation. This convergence perspective applies only to the frozen auxiliary-head subproblem under smoothness and bounded-heterogeneity assumptions. It does not constitute a full convergence proof for general non-convex federated unlearning.

Algorithm and communication

FedDAM runs for $T_u$ unlearning rounds. In each round, the server broadcasts the current auxiliary head, clients perform $S_u$ local unlearning steps, and the server aggregates only auxiliary parameters using a FedAvg-style weighted average. Momentum buffers are client-local and are reset at the start of each round; no optimizer state is communicated.

Results

We evaluate FedDAM on CIFAR-10 and CIFAR-100 for global class unlearning^17,18, and additionally report a mid-scale ImageNet-100 evaluation together with sparse-removal experiments for sample-level and client-level requests. Experimental settings, model-selection protocol, and reproducibility controls are described in the Methods section.

Main comparison

Unlearning exposes multiple operating points (e.g., via $\lambda _{\textrm{ow}}$) under a fixed budget. We therefore report two complementary summaries under the same matched-budget sweep: (i) FA/RA at an unconstrained validation-selected operating point $h_{\textrm{val}}$ that maximizes retained validation accuracy under the matched budget, and (ii) the matched-forgetting summary $\mathrm {RA@FA\le \tau }$ computed using the constrained selection rule in Eq. (16). This separation avoids conflating an unconstrained utility-maximizing choice with a threshold-constrained unlearning operating point.

Table 1 reports the main CIFAR comparison. On CIFAR-100, FedDAM increases retained utility at the matched-forgetting threshold, improving $\mathrm {RA@FA\le \tau }$ relative to the unified-state auxiliary baseline (FedAU) within the fixed unlearning budget. The gain is modest on CIFAR-10, where the class structure is simpler and the forget target is easier to suppress, but becomes substantial on CIFAR-100, where richer inter-class structure increases forget–retain interference. This difference is consistent with the proposed motivation for optimizer-state decoupling: the benefit of separating retain- and forget-driven dynamics grows as class overlap and heterogeneity increase. Figure 1 visualizes the operating-point geometry on CIFAR-100 by showing the RA–FA frontier obtained from sweeping the overwrite strength under the fixed unlearning budget.

Table 1 Main comparison under matched unlearning budgets (mean ± std over $S=5$ seeds and multiple forget classes). We report FA/RA at the unconstrained validation-selected operating point $h_{\textrm{val}}$ (columns “FA (val)” and “RA (val)”) and the matched-forgetting summary $\mathrm {RA@FA\le \tau }$ computed using the constrained selection rule in Eq. (16). FA$^\star$ denotes the test-time forget accuracy at the selected constrained operating point.

Dual asymmetric momentum improves federated class unlearning in edge systems - Nature

Introduction

Related work

Federated unlearning methods

Verification and privacy diagnostics

Proposed method: FedDAM

Overview

Objectives

Dual-asymmetric optimization

Theoretical intuition for dual-asymmetric momentum

Algorithm and communication

Results

Main comparison

Robustness under non-IID heterogeneity

System overhead

Verification and privacy diagnostics

Generalization to a stronger backbone

Mid-scale evaluation on ImageNet-100

Sparse unlearning regimes: sample-level and client-level removal

Mechanistic evidence: retain–forget gradient conflict

Comparison with conflict-mitigation adaptations

Sensitivity to aggregation rule

Ablation: asymmetric versus symmetric optimization

Hard forget-class stress test

Methods

Datasets and preprocessing

Federated setup and client heterogeneity

Models and training protocol

Unlearning request and budget

Metrics and operating-point selection

Baselines

System overhead accounting

Reproducibility settings

Compute resources

AI-assisted language editing

Discussion

Conclusion