Name		Name	Last commit message	Last commit date
parent directory ..
TOFU_data		TOFU_data
config		config
data		data
evals		evals
README.md		README.md
data_module.py		data_module.py
dataloader.py		dataloader.py
environment.yml		environment.yml
evaluate_util.py		evaluate_util.py
finetune.py		finetune.py
forget.py		forget.py
forget_wmdp.py		forget_wmdp.py
requirements.txt		requirements.txt
retrain.py		retrain.py
utils.py		utils.py

README.md

TOFU and WMDP

Installation

conda create -n tofu python=3.10
conda activate tofu
conda install pytorch pytorch-cuda=11.8 -c pytorch -c nvidia
conda install -c "nvidia/label/cuda-11.8.0" cuda-toolkit
pip install -r requirements.txt
pip install flash-attn --no-build-isolation

Get the origin model

For TOFU, we use 8 GPUs and fine-tune the model for 5 epochs with a learning rate of 1e-5 to obtain the origin model. The origin model can be downloaded directly from here.
For WMDP, you can obtained it from here.

Get and evaluate the unlearned model

First, you need to update the model_path in the forget.yaml file to the path corresponding to the origin model.
You can also modify the save_dir to change the path where the unlearned model will be saved.

To unlearn a model on a forget set, use the following command:

# forget05
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 --master_port=$master_port forget.py --config-name=forget.yaml split=forget05 npo_coeff=0.1375 beta=2.5

# forget10
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 --master_port=$master_port forget.py --config-name=forget.yaml split=forget10 npo_coeff=0.125 beta=4.5

# wmdp
CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 torchrun --nproc_per_node=8 --master_port=$master_port forget_wmdp.py --config-name=forget_wmdp.yaml

Once the unlearning process is complete, the results will be saved in ${save_dir}/checkpoint/aggregate_stat.txt.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TOFU

TOFU

README.md

TOFU and WMDP

Installation

Get the origin model

Get and evaluate the unlearned model

Download the model directly

Files

TOFU

Directory actions

More options

Directory actions

More options

Latest commit

History

TOFU

Folders and files

parent directory

README.md

TOFU and WMDP

Installation

Get the origin model

Get and evaluate the unlearned model

Download the model directly