unlearning-challenge / starting-kit Goto Github PK

View Code? Open in Web Editor NEW

373.0 21.0 134.0 2.5 MB

Starting kit for the NeurIPS 2023 unlearning challenge

Home Page: https://unlearning-challenge.github.io/

License: Apache License 2.0

Jupyter Notebook 100.00%

starting-kit's People

Stargazers

Watchers

Forkers

amrkhalifa christophermichael-stokes ysy9997 bdemo innat monssegcompyu 1ceopresmenzu denisovagap snotresmanon 3multpovcaena 9confliasimpru 7cassoevbo inibtrosi manogna7 quipunini 3compdeunsa culgiofsuyu rema0tata pilswitthendo poestatzdextpu adposasbo itigqblasga conmisiro 1complisgramge putcuftemphi tincconcontse 9cinmozadchi abrahammathews2000 llanfeng709 0carranogisto 8compdefasgi 0tempnulkachi 1confcultranmu 8mendesounpo gaybehagto 8tincnapulza laecapcuncshi delloerne kojokesse hartbacooco branoutaffes hieperporni enlabopaa 7perfmexesko incomanwo 8cripedxigpa 7perpanasa iervolino enadipat subuttfloren 1clemodforge 0paubuexraebo 0aptayferni tiliriashi suppsifracze sashayenchik stupfectpobo maritaparedes sentaconhost yogeshmsharma-architect brunotech aryantiwariofficial emiliaod kgourgou perceptronv eventures-io kishanshukla-2307 ravikothari510 corazju kai-ten moseti1 shreyasinha14468 elvinagam neilshah13 pengningyi mayo66 joshmccarter markrussinovich explcre strukturen sushmaakoju susiesyli126 ed-fish dasika-vaishnavi softvision17 sohamtalukdar ashaza13 devanshi-crypto nitanshjain threemmm curious-x pxu23 vmalgi jai2dev wangjunxiao mohitburkule vishal-2000 arnav10goel ecebgenc amr8ta

starting-kit's Issues

Consideration of network architecture and learning algorithms for unlearning effectiveness

The unlearning challenge could benefit from accounting for the impact of network architecture and training methods on unlearning performance.

Some neural network architectures and training methods, like recursive cortical networks and gated linear networks/supermasks, could have a huge impact on the way the competition is run and how models are evaluated.

It would be nice if future iterations of the challenge could consider:

Incorporating architectures/learning algorithms like recursive cortical networks and gated linear networks/supermasks in the starter kit
Evaluating submissions based on both unlearning performance and the network architecture/learning algorithm used

This could help identify approaches that balance performance and adaptability - crucial for building AI systems that can responsibly adjust to new requirements over time. Studying how architecture and learning algorithms impact unlearnability could drive progress.

Please let me know if you would like me to expand on any part of this feedback or provide more suggestions. I'm happy to discuss ways to improve future iterations of this valuable challenge.

What is the reason not to compare with MIA with "Retrained" model?

Hello I notice the baseline simple MI diagram looks like a "decrease of MIA from pre-trained model to fine-tuned model", which is different from what we consider privacy baseline, where the goal is -- or I think should be -- the MIA of retrained model that does not include the offending data.

Image below:

Am I confused, or is this a developmental area?

Thanks!

Suggestion: Avoid Repeated Download of Weights

I suggest changing In [41] to

# download pre-trained weights
import os
path="weights_resnet18_cifar10.pth"
if not os.path.exists(path):
    response = requests.get(
        "https://unlearning-challenge.s3.eu-west-1.amazonaws.com/weights_resnet18_cifar10.pth"
    )
    open(path, "wb").write(response.content)

weights_pretrained = torch.load("weights_resnet18_cifar10.pth", map_location=DEVICE)

# load model with pre-trained weights
model = resnet18(weights=None, num_classes=10)
model.load_state_dict(weights_pretrained)
model.to(DEVICE)
model.eval();

Reasoning: Researchers will likely be running this code repeatedly, and the above just checks if the model is already downloaded before downloading it.

Question Regarding Optimal MIA and Overall Desired Objective

What is considered optimal for the MIA score?

Obviously, it should be lower than the initial model. But just wanted to clarify, are we aiming to have the last chart with as much overlap as possible between the Test and Forget set and a high overall score on the test set, or would an MIA score of less than 0.5 be ideal(and, yes, I can get this significantly lower than 0.5)?

Just trying to clarify the metrics which will be considered an "improved" result.

Face Synthetics labels

The challenge mentions that we will be using the Face Synthetics dataset on an age prediction task. However, Face Synthetics does not have any labels for age. I am aware that we will get a notebook with this dataset and a trained model sometime later this month but I want to understand how the input model is trained.
Side note: any updates on the next notebook?
Thanks

How are the unlearning evaluation metrics computed?

Is the evaluation metric public?

Please share how the evaluation metric is computed

suggested fix to comment on losses histogram of the pre-trained model.

The comment after the plot of histogram of losses on train vs test set of the pre-trained model (first plot)
says:
"As per the above plot, the distributions of losses are quite different between the forget and retain set. This suggests that the simple MIA that we're considering should be reasonably effective."

I believe this should be changed to:
"As per the above plot, the distributions of losses are quite different between the train and test set. This suggests that the simple MIA that we're considering should be reasonably effective."

unlearning-challenge / starting-kit Goto Github PK

starting-kit's People

Stargazers

Watchers

Forkers

starting-kit's Issues

Consideration of network architecture and learning algorithms for unlearning effectiveness

What is the reason not to compare with MIA with "Retrained" model?

Suggestion: Avoid Repeated Download of Weights

Question Regarding Optimal MIA and Overall Desired Objective

Face Synthetics labels

How are the unlearning evaluation metrics computed?

suggested fix to comment on losses histogram of the pre-trained model.

Hi Mimee,

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent