RDST

A residual dense vision transformer for medical image super-resolution with novel general-purpose perceptual loss.

Introduction

This paper proposes an efficient vision transformer with residual dense connections and local feature fusion to achieve efficient single-image super-resolution (SISR) of medical modalities. Moreover, we implement a general-purpose perceptual loss with manual control for image quality improvements of desired aspects by incorporating prior knowledge of medical image segmentation. Compared with state-of-the-art methods on four public medical image datasets, the proposed method achieves the best PSNR scores of 6 modalities among seven modalities. It leads to an average improvement of +0.09 dB PSNR with only 38% parameters of SwinIR. On the other hand, the segmentation-based perceptual loss increases +0.14 dB PSNR on average for SOTA methods, including CNNs and vision transformers. Additionally, we conduct comprehensive ablation studies to discuss potential factors for the superior performance of vision transformers over CNNs and the impacts of network and loss function components.

Framework of the proposed RDST network.

Results

Broad applicability on medical images

OASIS BraTS ACDC COVID-CT

Comparing with SOTA methods (PSNR + Segmentation)

Segmentation-based perceptual loss

Train & Test

To setup:

git clone https://github.com/GinZhu/RDST.git
cd RDST
pip install -r requirements.txt

To train:

python -W ignore train.py --config-file config_files/RDST_E1_OASIS_example_SRx4.ini

To test:

python -W ignore test.py --config-file config_files/RDST_E1_OASIS_example_SRx4_testing.ini

Pre-trained models

Here we provide pre-trained models to download (on the OASIS dataset):

RDST-E1: +0.16 PSNR than SwinIR with only 38% parameters;
RDST-HRL: [+0.0016, +0.0051, +0.0005, +0.0005] dice coefficients than SwinIR.
RDST-E: +0.02 PSNR than SwinIR with only 20% parameters and +46% faster.

Publications & citations

This work is available at arXiv, please cite as:

@article{zhu2023rdst,
  title={A residual dense vision transformer for medical image super-resolution with segmentation-based perceptual loss fine-tuning},
  author={Zhu, Jin and Yang, Guang and Lio, Pietro},
  journal={arXiv preprint arXiv:2302.11184},
  year={2023}
}

We refer to the previous works for better understanding of this project:

Lesion focused multi-scale GAN: paper1, paper2, code
Meta-learning with GAN for scale-free SR: paper, code

Name		Name	Last commit message	Last commit date
Latest commit History 26 Commits
config_files		config_files
data/OASIS/example		data/OASIS/example
datasets		datasets
figures		figures
loss		loss
metrics		metrics
models		models
networks		networks
utils		utils
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
test.py		test.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RDST

Introduction

Results

Broad applicability on medical images

Comparing with SOTA methods (PSNR + Segmentation)

Segmentation-based perceptual loss

Train & Test

Pre-trained models

Publications & citations

About

Releases

Packages

Languages

GinZhu/RDST

Folders and files

Latest commit

History

Repository files navigation

RDST

Introduction

Results

Broad applicability on medical images

Comparing with SOTA methods (PSNR + Segmentation)

Segmentation-based perceptual loss

Train & Test

Pre-trained models

Publications & citations

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages