Deep Unlearn: Benchmarking Machine Unlearning

October 2, 2024

Xavier F. Cadet, Anastasia Borovykh, Mohammad Malekzadeh, Sara Ahmadi-Abhari, Hamed Haddadi

Computer Science

Machine Learning

Artificial Intelligence

Machine unlearning (MU) aims to remove the influence of particular data points from the learnable parameters of a trained machine learning model. This is a crucial capability in light of data privacy requirements, trustworthiness, and safety in deployed models. MU is particularly challenging for deep neural networks (DNNs), such as convolutional nets or vision transformers, as such DNNs tend to memorize a notable portion of their training dataset. Nevertheless, the community lacks a rigorous and multifaceted study that looks into the success of MU methods for DNNs. In this paper, we investigate 18 state-of-the-art MU methods across various benchmark datasets and models, with each evaluation conducted over 10 different initializations, a comprehensive evaluation involving MU over 100K models. We show that, with the proper hyperparameters, Masked Small Gradients (MSG) and Convolution Transpose (CT), consistently perform better in terms of model accuracy and run-time efficiency across different models, datasets, and initializations, assessed by population-based membership inference attacks (MIA) and per-sample unlearning likelihood ratio attacks (U-LiRA). Furthermore, our benchmark highlights the fact that comparing a MU method only with commonly used baselines, such as Gradient Ascent (GA) or Successive Random Relabeling (SRL), is inadequate, and we need better baselines like Negative Gradient Plus (NG+) with proper hyperparameter selection.

Gone but Not Forgotten: Improved Benchmarks for Machine Unlearning

May 29, 2024

94% Match

Keltin Grimes, Collin Abidi, ... , Gallagher Shannon

Machine Learning

Machine learning models are vulnerable to adversarial attacks, including attacks that leak information about the model's training data. There has recently been an increase in interest about how to best address privacy concerns, especially in the presence of data-removal requests. Machine unlearning algorithms aim to efficiently update trained models to comply with data deletion requests while maintaining performance and without having to resort to retraining the model from sc...

Find SimilarView on arXiv

Machine Unlearning Fails to Remove Data Poisoning Attacks

June 25, 2024

93% Match

Martin Pawelczyk, Jimmy Z. Di, Yiwei Lu, Gautam Kamath, ... , Neel Seth

Machine Learning

Artificial Intelligence

Cryptography and Security

Computers and Society

We revisit the efficacy of several practical methods for approximate machine unlearning developed for large-scale deep learning. In addition to complying with data deletion requests, one often-cited potential application for unlearning methods is to remove the effects of training on poisoned data. We experimentally demonstrate that, while existing unlearning methods have been demonstrated to be effective in a number of evaluation settings (e.g., alleviating membership inferen...

Find SimilarView on arXiv

Fast Yet Effective Machine Unlearning

November 17, 2021

93% Match

Ayush K Tarun, Vikram S Chundawat, ... , Kankanhalli Mohan

Machine Learning

Unlearning the data observed during the training of a machine learning (ML) model is an important task that can play a pivotal role in fortifying the privacy and security of ML-based applications. This paper raises the following questions: (i) can we unlearn a single or multiple class(es) of data from a ML model without looking at the full training data even once? (ii) can we make the process of unlearning fast and scalable to large datasets, and generalize it to different de...

Find SimilarView on arXiv

Attack and Reset for Unlearning: Exploiting Adversarial Noise toward Machine Unlearning through Parameter Re-initialization

January 17, 2024

93% Match

Yoonhwa Jung, Ikhyun Cho, ... , Hockenmaier Julia

Machine Learning

Cryptography and Security

Computer Vision and Pattern ...

With growing concerns surrounding privacy and regulatory compliance, the concept of machine unlearning has gained prominence, aiming to selectively forget or erase specific learned information from a trained model. In response to this critical need, we introduce a novel approach called Attack-and-Reset for Unlearning (ARU). This algorithm leverages meticulously crafted adversarial noise to generate a parameter mask, effectively resetting certain parameters and rendering them ...

Find SimilarView on arXiv

Silver Linings in the Shadows: Harnessing Membership Inference for Machine Unlearning

July 1, 2024

92% Match

Nexhi Sula, Abhinav Kumar, Jie Hou, ... , Tourani Reza

Machine Learning

With the continued advancement and widespread adoption of machine learning (ML) models across various domains, ensuring user privacy and data security has become a paramount concern. In compliance with data privacy regulations, such as GDPR, a secure machine learning framework should not only grant users the right to request the removal of their contributed data used for model training but also facilitates the elimination of sensitive data fingerprints within machine learning...

Find SimilarView on arXiv

Deep Unlearning: Fast and Efficient Training-free Approach to Controlled Forgetting

December 1, 2023

92% Match

Sangamesh Kodge, Gobinda Saha, Kaushik Roy

Machine Learning

Artificial Intelligence

Computer Vision and Pattern ...

Machine Learning

Machine unlearning has emerged as a prominent and challenging area of interest, driven in large part by the rising regulatory demands for industries to delete user data upon request and the heightened awareness of privacy. Existing approaches either retrain models from scratch or use several finetuning steps for every deletion request, often constrained by computational resource limitations and restricted access to the original training data. In this work, we introduce a nove...

Find SimilarView on arXiv

Towards Unbounded Machine Unlearning

February 20, 2023

92% Match

Meghdad Kurmanji, Peter Triantafillou, ... , Triantafillou Eleni

Machine Learning

Cryptography and Security

Deep machine unlearning is the problem of `removing' from a trained neural network a subset of its training set. This problem is very timely and has many applications, including the key tasks of removing biases (RB), resolving confusion (RC) (caused by mislabelled data in trained models), as well as allowing users to exercise their `right to be forgotten' to protect User Privacy (UP). This paper is the first, to our knowledge, to study unlearning for different applications (R...

Find SimilarView on arXiv

Inexact Unlearning Needs More Careful Evaluations to Avoid a False Sense of Privacy

March 2, 2024

92% Match

Jamie Hayes, Ilia Shumailov, Eleni Triantafillou, ... , Papernot Nicolas

Machine Learning

Cryptography and Security

The high cost of model training makes it increasingly desirable to develop techniques for unlearning. These techniques seek to remove the influence of a training example without having to retrain the model from scratch. Intuitively, once a model has unlearned, an adversary that interacts with the model should no longer be able to tell whether the unlearned example was included in the model's training set or not. In the privacy literature, this is known as membership inference...

Find SimilarView on arXiv

AMUN: Adversarial Machine UNlearning

March 2, 2025

92% Match

Ali Ebrahimpour-Boroojeny, Hari Sundaram, Varun Chandrasekaran

Machine Learning

Cryptography and Security

Machine unlearning, where users can request the deletion of a forget dataset, is becoming increasingly important because of numerous privacy regulations. Initial works on ``exact'' unlearning (e.g., retraining) incur large computational overheads. However, while computationally inexpensive, ``approximate'' methods have fallen short of reaching the effectiveness of exact unlearning: models produced fail to obtain comparable accuracy and prediction confidence on both the forget...

Find SimilarView on arXiv

Machine Unlearning with Minimal Gradient Dependence for High Unlearning Ratios

June 24, 2024

92% Match

Tao Huang, Ziyang Chen, Jiayang Meng, Qingyu Huang, Xu Yang, ... , Khalil Ibrahim

Machine Learning

Artificial Intelligence

Cryptography and Security

In the context of machine unlearning, the primary challenge lies in effectively removing traces of private data from trained models while maintaining model performance and security against privacy attacks like membership inference attacks. Traditional gradient-based unlearning methods often rely on extensive historical gradients, which becomes impractical with high unlearning ratios and may reduce the effectiveness of unlearning. Addressing these limitations, we introduce Min...

Find SimilarView on arXiv