DiffGuard

This repository contains the implementation of the ICCV2023 paper:

DiffGuard: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models
Ruiyuan Gao^$\dagger$, Chenchen Zhao^$\dagger$, Lanqing Hong^$\ddagger$, Qiang Xu^$\dagger$
The Chinese University of Hong Kong^$\dagger$, Huawei Noah’s Ark Lab^$\ddagger$

Given a classifier, the inherent property of semantic Out-of-Distribution (OOD) samples is that their contents differ from all legal classes in terms of semantics, namely semantic mismatch. In this work, we propose to directly use pre-trained diffusion models for semantic mismatch-guided OOD detection, named DiffGuard. Specifically, given an OOD input image and the predicted label from the classifier, we try to enlarge the semantic difference between the reconstructed OOD image under these conditions and the original input image. We also present several test-time techniques to further strengthen such differences.

Getting Started

Environment Setup

The code is tested with Pytorch==1.10.2 and torchvision==0.11.3. You should have these packages before starting. To install additional packages, follow:

git clone --recursive https://github.com/cure-lab/DiffGuard.git

cd ${ROOT}
pip install -r requirements.txt

# install external
cd ${ROOT}/external
pip install -e taming-transformers
ln -s pytorch-grad-cam/pytorch_grad_cam

# install current project (i.e., guided-diffusion package)
cd ${ROOT}
pip install -e .

Pretrained Weights

Since we use the pretrained diffusion models, all the pretrained weights can be downloaded from the corresponding open-sourced projects:

GDM: 256x256 diffusion (not class conditional), 256x256_diffusion_uncond.pt
LDM: LDM-VQ-8 on ImageNet, cin.zip

NOTICE: As pointed by #1, LDM used the OpenImage dataset for pre-training. This weight may not fit in OOD benchmarks.
ResNet50 classifier: either from torchvision (ImageNet V1) or OpenOOD

We assume you put them at ${ROOT}/../pretrained/ as follows:

${ROOT}/../pretrained/
├── guided-diffusion
│   └── 256x256_diffusion_uncond.pt
├── ldm
│   └── cin256-v2
├── openood
│   └── imagenet_res50_acc76.10.pth
└── torch_cache (we change the default cache dir of torchvision)
    └── checkpoints

Note that, we change torch cache directory with

torch.hub.set_dir("../pretrained/torch_cache")

in testOOD/test_openood.py. You can comment this line to use the default place.

Datasets

This project use ImageNet benchmark following OpenOOD. We assume the data directory locates at ${ROOT}/../data as follow

${ROOT}/../data/
../data/
├── imagenet_1k
│   └── val
└── images_largescale
    ├── imagenet_o
    │   ├── n01443537
    │   └── ...
    ├── inaturalist
    │   └── images
    ├── openimage_o
    │   └── images
    └── species_sub

together with the image list files from OpenOOD:

${ROOT}/useful_data/
└── benchmark_imglist
    └── imagenet
        └── *.txt

Run the Experiments

Our default log directory is ${ROOT}/../DiffGuard-log/, with is outside of ${ROOT}. Please be prepared.

All experiments can be started with testOOD/start_job.py with parse the override params from both .yaml files and command line. Check the example as follows to reproduce the results in Table.2 of our paper (only with DiffGuard):

GDM with 8 GPUs (V100 32Gb, default config takes about 23Gb)

python testOOD/start_job.py --overrides_file testOOD/exp/exp6.1.yaml \
    --world_size 8 [--init_method ... --task_index 6.1]

LDM with 8 GPUs (V100 32Gb, default config takes about 20Gb)

python testOOD/start_job.py --overrides_file testOOD/exp/ldm-exp2.25.yaml \
    --world_size 8 [--init_method ... --task_index 2.25]

We use Hydra to manage the configs. Using testOOD/start_job.py is similar to testOOD/test_openood.py. However, you need to parse the override parameters through command line to use the latter one.

Besides, we provides config to reproduce the results with oracle classifier, check configs in testOOD/exp/oracle/*.yaml

Other Scripts & configs

testOOD/show_openood.py: used for more visualizations
scripts_cls/*: analyze the classifier-under-protect, as in Fig.2
testOOD/exp/oracle/*: configs assuming an oracle classifier
testOOD/exp/speed/*: used to test inference speed
testOOD/exp/ablation/*: for ablation study on GDM

Cite Us

@inproceedings{gao2023diffguard,
  title={{DiffGuard}: Semantic Mismatch-Guided Out-of-Distribution Detection using Pre-trained Diffusion Models},
  author={Gao, Ruiyuan and Zhao, Chenchen and Hong, Lanqing and Xu, Qiang},
  booktitle={Proceedings of the IEEE/CVF International Conference on Computer Vision},
  year={2023}
}

Credit

We thank following open-sourced projects:

Our code use tools: pytorch-grad-cam and PyTorch Image Quality (PIQ)
Diffusion models from: latent-diffusion (with taming-transformers) and guided-diffusion
CIFAR-10 and ImageNet Benchmark for OOD detection: OpenOOD

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
assets		assets
external		external
guided_diffusion		guided_diffusion
ldm		ldm
network		network
ood_tester		ood_tester
scripts_cls		scripts_cls
testOOD		testOOD
useful_data		useful_data
.gitignore		.gitignore
.gitmodules		.gitmodules
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

DiffGuard

Getting Started

Environment Setup

Pretrained Weights

Datasets

Run the Experiments

Other Scripts & configs

Cite Us

Credit

About

Releases

Packages

Languages

cure-lab/DiffGuard

Folders and files

Latest commit

History

Repository files navigation

DiffGuard

Getting Started

Environment Setup

Pretrained Weights

Datasets

Run the Experiments

Other Scripts & configs

Cite Us

Credit

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages