fast-neural-style

This is the code for the paper

Perceptual Losses for Real-Time Style Transfer and Super-Resolution
Justin Johnson, Alexandre Alahi, Li Fei-Fei
Presented at ECCV 2016

The paper builds on A Neural Algorithm of Artistic Style by Leon A. Gatys, Alexander S. Ecker, and Matthias Bethge by training feedforward neural networks that apply artistic styles to images. After training, our feedforward networks can stylize images hundreds of times faster than the optimization-based method presented by Gatys et al.

This repository also includes an implementation of instance normalization as described in the paper Instance Normalization: The Missing Ingredient for Fast Stylization by Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky. This simple trick significantly improves the quality of feedforward style transfer models.

Stylizing this image of the Stanford campus at a resolution of 1200x630 takes 50 milliseconds on a Pascal Titan X:

In this repository we provide:

The style transfer models used in the paper
Additional models using instance normalization
Code for running models on new images
A demo that runs models in real-time off a webcam
Code for training new feedforward style transfer models
An implementation of optimization-based style transfer method described by Gatys et al.

If you find this code useful for your research, please cite

@inproceedings{Johnson2016Perceptual,
  title={Perceptual losses for real-time style transfer and super-resolution},
  author={Johnson, Justin and Alahi, Alexandre and Fei-Fei, Li},
  booktitle={European Conference on Computer Vision},
  year={2016}
}

Setup

All code is implemented in Torch.

First install Torch, then update / install the following packages:

luarocks install torch
luarocks install nn
luarocks install image
luarocks install lua-cjson

(Optional) GPU Acceleration

If you have an NVIDIA GPU, you can accelerate all operations with CUDA.

First install CUDA, then update / install the following packages:

luarocks install cutorch
luarocks install cunn

(Optional) cuDNN

When using CUDA, you can use cuDNN to accelerate convolutions.

First download cuDNN and copy the libraries to /usr/local/cuda/lib64/. Then install the Torch bindings for cuDNN:

luarocks install cudnn

Pretrained Models

Download all pretrained style transfer models by running the script

bash models/download_style_transfer_models.sh

This will download ten model files (~200MB) to the folder models/.

Models from the paper

The style transfer models we used in the paper will be located in the folder models/eccv16. Here are some example results where we use these models to stylize this image of the Chicago skyline with at an image size of 512:

Models with instance normalization

As discussed in the paper Instance Normalization: The Missing Ingredient for Fast Stylization by Dmitry Ulyanov, Andrea Vedaldi, and Victor Lempitsky, replacing batch normalization with instance normalization significantly improves the quality of feedforward style transfer models.

We have trained several models with instance normalization; after downloading pretrained models they will be in the folder models/instance_norm.

These models use the same architecture as those used in our paper, except with half the number of filters per layer and with instance normalization instead of batch normalization. Using narrower layers makes the models smaller and faster without sacrificing model quality.

Here are some example outputs from these models, with an image size of 1024:

Running on new images

The script fast_neural_style.lua lets you use a trained model to stylize new images:

th fast_neural_style.lua \
  -model models/eccv16/starry_night.t7 \
  -input_image images/content/chicago.jpg \
  -output_image out.png

You can run the same model on an entire directory of images like this:

th fast_neural_style.lua \
  -model models/eccv16/starry_night.t7 \
  -input_dir images/content/ \
  -output_dir out/

You can control the size of the output images using the -image_size flag.

By default this script runs on CPU; to run on GPU, add the flag -gpu specifying the GPU on which to run.

The full set of options for this script is described here.

Webcam demo

You can use the script webcam_demo.lua to run one or more models in real-time off a webcam stream. To run this demo you need to use qlua instead of th:

qlua webcam_demo.lua -models models/instance_norm/candy.t7 -gpu 0

You can run multiple models at the same time by passing a comma-separated list to the -models flag:

qlua webcam_demo.lua \
  -models models/instance_norm/candy.t7,models/instance_norm/udnie.t7 \
  -gpu 0

With a Pascal Titan X you can easily run four models in realtime at 640x480:

The webcam demo depends on a few extra Lua packages:

You can install / update these packages by running:

luarocks install camera
luarocks install qtlua

The full set of options for this script is described here.

Training new models

You can find instructions for training new models here.

Optimization-based Style Transfer

The script slow_neural_style.lua is similar to the original neural-style, and uses the optimization-based style-transfer method described by Gatys et al.

This script uses the same code for computing losses as the feedforward training script, allowing for fair comparisons between feedforward style transfer networks and optimization-based style transfer.

Compared to the original neural-style, this script has the following improvements:

Remove dependency on protobuf and loadcaffe
Support for many more CNN architectures, including ResNets

The full set of options for this script is described here.

License

Free for personal or research use; for commercial use please contact me.

Name		Name	Last commit message	Last commit date
Latest commit History 156 Commits
doc		doc
fast_neural_style		fast_neural_style
images		images
models		models
scripts		scripts
test		test
.gitignore		.gitignore
README.md		README.md
autoencoder.json		autoencoder.json
autoencoder.t7		autoencoder.t7
autoencoder_chicago_v10.png		autoencoder_chicago_v10.png
batch_my_out_chiago_mosaic.png		batch_my_out_chiago_mosaic.png
checkpoint.json		checkpoint.json
checkpoint.t7		checkpoint.t7
composition_batch.json		composition_batch.json
composition_batch.t7		composition_batch.t7
composition_instance.json		composition_instance.json
composition_instance.t7		composition_instance.t7
fast_neural_style.lua		fast_neural_style.lua
femme-nue-resize.json		femme-nue-resize.json
femme-nue-resize.t7		femme-nue-resize.t7
femme-nue_batch.json		femme-nue_batch.json
femme-nue_batch.t7		femme-nue_batch.t7
femme-nue_instance.json		femme-nue_instance.json
femme-nue_instance.t7		femme-nue_instance.t7
la_muse_instance_out.png		la_muse_instance_out.png
la_muse_vgg_out.png		la_muse_vgg_out.png
mosaic.json		mosaic.json
mosaic.t7		mosaic.t7
mosaic_style.json		mosaic_style.json
mosaic_style.t7		mosaic_style.t7
my_composition_batch_v10.png		my_composition_batch_v10.png
my_composition_batch_v15.png		my_composition_batch_v15.png
my_composition_batch_v21.png		my_composition_batch_v21.png
my_composition_batch_v3.png		my_composition_batch_v3.png
my_composition_batch_v33.png		my_composition_batch_v33.png
my_composition_batch_v5.png		my_composition_batch_v5.png
my_composition_batch_v7.png		my_composition_batch_v7.png
my_femme-nue_batch_v1.png		my_femme-nue_batch_v1.png
my_femme-nue_batch_v11.png		my_femme-nue_batch_v11.png
my_femme-nue_batch_v15.png		my_femme-nue_batch_v15.png
my_femme-nue_batch_v20.png		my_femme-nue_batch_v20.png
my_femme-nue_batch_v25.png		my_femme-nue_batch_v25.png
my_femme-nue_batch_v3.png		my_femme-nue_batch_v3.png
my_femme-nue_batch_v30.png		my_femme-nue_batch_v30.png
my_femme-nue_batch_v33.png		my_femme-nue_batch_v33.png
my_femme-nue_batch_v4.png		my_femme-nue_batch_v4.png
my_femme-nue_batch_v5.png		my_femme-nue_batch_v5.png
my_femme-nue_batch_v6.png		my_femme-nue_batch_v6.png
my_femme-nue_batch_v7.png		my_femme-nue_batch_v7.png
my_femme-nue_batch_v8.png		my_femme-nue_batch_v8.png
my_femme-nue_instance_v1.png		my_femme-nue_instance_v1.png
my_femme-nue_instance_v10.png		my_femme-nue_instance_v10.png
my_femme-nue_instance_v15.png		my_femme-nue_instance_v15.png
my_femme-nue_instance_v2.png		my_femme-nue_instance_v2.png
my_femme-nue_instance_v20.png		my_femme-nue_instance_v20.png
my_femme-nue_instance_v3.png		my_femme-nue_instance_v3.png
my_femme-nue_instance_v30.png		my_femme-nue_instance_v30.png
my_femme-nue_instance_v5.png		my_femme-nue_instance_v5.png
my_femme-nue_resize_v10.png		my_femme-nue_resize_v10.png
my_mosaic_instance_out.png		my_mosaic_instance_out.png
my_mosaic_instance_out_v10.png		my_mosaic_instance_out_v10.png
my_mosaic_instance_out_v2.png		my_mosaic_instance_out_v2.png
my_mosaic_instance_out_v3.png		my_mosaic_instance_out_v3.png
my_mosaic_instance_out_v7.png		my_mosaic_instance_out_v7.png
my_scream_batch_out_10.png		my_scream_batch_out_10.png
my_scream_batch_out_5.png		my_scream_batch_out_5.png
my_scream_batch_out_6.png		my_scream_batch_out_6.png
my_scream_batch_out_v1.png		my_scream_batch_out_v1.png
my_scream_batch_out_v10.png		my_scream_batch_out_v10.png
my_scream_batch_out_v11.png		my_scream_batch_out_v11.png
my_scream_batch_out_v15.png		my_scream_batch_out_v15.png
my_scream_batch_out_v2.png		my_scream_batch_out_v2.png
my_scream_batch_out_v20.png		my_scream_batch_out_v20.png
my_scream_batch_out_v25.png		my_scream_batch_out_v25.png
my_scream_batch_out_v3.png		my_scream_batch_out_v3.png
my_scream_batch_out_v37.png		my_scream_batch_out_v37.png
my_scream_batch_out_v5.png		my_scream_batch_out_v5.png
my_scream_instance_out_v10.png		my_scream_instance_out_v10.png
my_scream_instance_out_v15.png		my_scream_instance_out_v15.png
my_scream_instance_out_v40_resumed.png		my_scream_instance_out_v40_resumed.png
my_scream_instance_out_v5.png		my_scream_instance_out_v5.png
my_starry_night.png		my_starry_night.png
my_starry_night_0_content_v1.png		my_starry_night_0_content_v1.png
my_starry_night_0_content_v12.png		my_starry_night_0_content_v12.png
my_starry_night_0_content_v15.png		my_starry_night_0_content_v15.png
my_starry_night_0_content_v2.png		my_starry_night_0_content_v2.png
my_starry_night_0_content_v20.png		my_starry_night_0_content_v20.png
my_starry_night_0_content_v3.png		my_starry_night_0_content_v3.png
my_starry_night_0_content_v30.png		my_starry_night_0_content_v30.png
my_starry_night_0_content_v4.png		my_starry_night_0_content_v4.png
my_starry_night_0_content_v5.png		my_starry_night_0_content_v5.png
my_starry_night_1content_10style_v1.png		my_starry_night_1content_10style_v1.png
my_starry_night_1content_10style_v10.png		my_starry_night_1content_10style_v10.png
my_starry_night_1content_10style_v2.png		my_starry_night_1content_10style_v2.png
my_starry_night_1content_10style_v20.png		my_starry_night_1content_10style_v20.png
my_starry_night_1content_10style_v3.png		my_starry_night_1content_10style_v3.png
my_starry_night_1content_10style_v30.png		my_starry_night_1content_10style_v30.png
my_starry_night_1content_10style_v5.png		my_starry_night_1content_10style_v5.png
my_starry_night_1content_1style_v1.png		my_starry_night_1content_1style_v1.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

fast-neural-style

Setup

(Optional) GPU Acceleration

(Optional) cuDNN

Pretrained Models

Models from the paper

Models with instance normalization

Running on new images

Webcam demo

Training new models

Optimization-based Style Transfer

License

About

Releases

Packages

Languages

gutihernandez/fast-neural-style

Folders and files

Latest commit

History

Repository files navigation

fast-neural-style

Setup

(Optional) GPU Acceleration

(Optional) cuDNN

Pretrained Models

Models from the paper

Models with instance normalization

Running on new images

Webcam demo

Training new models

Optimization-based Style Transfer

License

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages