Multimodal Style Transfer via Graph Cuts

Pytorch(1.0+) implementation(unofficial) of ICCV 2019 paper "Multimodal Style Transfer via Graph Cuts".

Original tensorflow implementations from the author can be found here.

This repository provides a pre-trained model for you to generate your own image given content image and style image. Also, you can download the training dataset or prepare your own dataset to train the model from scratch.

Requirements

Python 3.6+
PyTorch 1.0+
PyMaxflow
Pillow
TorchVision

Optional but recommended for training

GPU environment

I have provided a jupyter notebook with instructions to run the training and testing python files using the Google Colab's free GPU.

If you wish to run them on your local machine you can follow the instructions below. Note: Testing does not require GPU.

train

Download COCO for content dataset and Wikiart for style dataset and unzip them, rename them as content and style respectively (recommended).
Modify the argument in the train.py such as the path of directory, epoch, learning_rate or you can add your own training code.
Train the model using gpu.

python train.py

 usage: train.py [-h] [--batch_size BATCH_SIZE] [--epoch EPOCH] [--gpu GPU]
                 [--learning_rate LEARNING_RATE]
                 [--snapshot_interval SNAPSHOT_INTERVAL]
                 [--n_cluster N_CLUSTER] [--alpha ALPHA] [--lam LAM]
                 [--max_cycles MAX_CYCLES] [--gamma GAMMA]
                 [--train_content_dir TRAIN_CONTENT_DIR]
                 [--train_style_dir TRAIN_STYLE_DIR]
                 [--test_content_dir TEST_CONTENT_DIR]
                 [--test_style_dir TEST_STYLE_DIR] 
                 [--save_dir SAVE_DIR] [--reuse REUSE]

test

Clone this repository

git clone https://github.com/Rakshit-Shetty/Multimodal-Style-Transfer-via-Graph-Cuts.git
cd Multimodal-Style-Transfer-via-Graph-Cuts

Prepare your content image and style image. I provide some in the content and style and you can try to use them easily.
Download the pretrained model here

Generate the output image. 3 outputs in total. A transferred output image with and without style image and a nested_demo_like image like those before and after image will be generated.

python test.py -c content_image_path -s style_image_path

 usage: test.py [-h] [--content CONTENT] [--style STYLE]
             [--output_name OUTPUT_NAME] [--n_cluster N_CLUSTER]
             [--alpha ALPHA] [--lam LAM] [--max_cycles MAX_CYCLES]
             [--gpu GPU] [--model_state_path MODEL_STATE_PATH]

If output_name is not given, it will use the combination of content image name and style image name.

Result

Some results of content image will be shown here.

rakshit-shetty / multimodal-style-transfer-via-graph-cuts Goto Github PK