BicycleGAN

[Project Page] [Paper] [Demo Video]

Pytorch implementation for multimodal image-to-image translation. For example, given the same night image, our model is able to synthesize possible day images with different types of lighting, sky and clouds. The training requires paired data.

Note: The current software works well with PyTorch 0.4. Check out the older branch that supports PyTorch 0.1-0.3.

Toward Multimodal Image-to-Image Translation.
Jun-Yan Zhu, Richard Zhang, Deepak Pathak, Trevor Darrell, Alexei A. Efros, Oliver Wang, Eli Shechtman.
UC Berkeley and Adobe Research
In NIPS, 2017.

Example results

Other Implementations

[Tensorflow] by Youngwoon Lee (USC CLVR Lab).
[Tensorflow] by Kv Manohar.

Prerequisites

Linux or macOS
Python 2 or 3
CPU or NVIDIA GPU + CUDA CuDNN

Getting Started

Installation

Clone this repo:

git clone -b master --single-branch https://github.com/junyanz/BicycleGAN.git
cd BicycleGAN

Install PyTorch and dependencies from http://pytorch.org
Install python libraries visdom, dominate, and moviepy.

For pip users:

bash ./scripts/install_pip.sh

For conda users:

bash ./scripts/install_conda.sh

Use a Pre-trained Model

Download some test photos (e.g., edges2shoes):

bash ./datasets/download_testset.sh edges2shoes

Download a pre-trained model (e.g., edges2shoes):

bash ./pretrained_models/download_model.sh edges2shoes

Generate results with the model

bash ./scripts/test_edges2shoes.sh

The test results will be saved to a html file here: ./results/edges2shoes/val/index.html.

Generate results with synchronized latent vectors

bash ./scripts/test_edges2shoes.sh --sync

Results can be found at ./results/edges2shoes/val_sync/index.html.

Generate Morphing Videos

We can also produce a morphing video similar to this GIF and Youtube video.

bash ./scripts/video_edges2shoes.sh

Results can be found at ./videos/edges2shoes/.

Model Training

To train a model, download the training images (e.g., edges2shoes).

bash ./datasets/download_dataset.sh edges2shoes

Train a model:

bash ./scripts/train_edges2shoes.sh

To view training results and loss plots, run python -m visdom.server and click the URL http://localhost:8097. To see more intermediate results, check out ./checkpoints/edges2shoes_bicycle_gan/web/index.html
See more training details for other datasets in ./scripts/train.sh.

Datasets (from pix2pix)

Download the datasets using the following script. Many of the datasets are collected by other researchers. Please cite their papers if you use the data.

Download the testset.

bash ./datasets/download_testset.sh dataset_name

Download the training and testset.

bash ./datasets/download_dataset.sh dataset_name

facades: 400 images from CMP Facades dataset. [Citation]
maps: 1096 training images scraped from Google Maps
edges2shoes: 50k training images from UT Zappos50K dataset. Edges are computed by HED edge detector + post-processing. [Citation]
edges2handbags: 137K Amazon Handbag images from iGAN project. Edges are computed by HED edge detector + post-processing. [Citation]

Models

Download the pre-trained models with the following script.

bash ./pretrained_models/download_model.sh model_name

edges2shoes (edge -> photo) trained on UT Zappos50K dataset.
edges2handbags (edge -> photo) trained on Amazon handbags images..

bash ./pretrained_models/download_model.sh edges2handbags
bash ./datasets/download_testset.sh edges2handbags
bash ./scripts/test_edges2handbags.sh

night2day (nighttime scene -> daytime scene) trained on around 100 webcams.

bash ./pretrained_models/download_model.sh night2day
bash ./datasets/download_testset.sh night2day
bash ./scripts/test_night2day.sh

facades (facade label -> facade photo) trained on the CMP Facades dataset.

bash ./pretrained_models/download_model.sh facades
bash ./datasets/download_testset.sh facades
bash ./scripts/test_facades.sh

maps (map photo -> aerial photo) trained on 1096 training images scraped from Google Maps.

bash ./pretrained_models/download_model.sh maps
bash ./datasets/download_testset.sh maps
bash ./scripts/test_maps.sh

Citation

If you find this useful for your research, please use the following.

@incollection{zhu2017multimodal,
	title = {Toward Multimodal Image-to-Image Translation},
	author = {Zhu, Jun-Yan and Zhang, Richard and Pathak, Deepak and Darrell, Trevor and Efros, Alexei A and Wang, Oliver and Shechtman, Eli},
	booktitle = {Advances in Neural Information Processing Systems 30},
	year = {2017},
}

Acknowledgements

This code borrows heavily from the pytorch-CycleGAN-and-pix2pix repository.

ajinkyapuar / bicyclegan Goto Github PK

bicyclegan's Introduction

BicycleGAN

Example results

Other Implementations

Prerequisites

Getting Started

Installation

Use a Pre-trained Model

Generate Morphing Videos

Model Training

Datasets (from pix2pix)

Models

Citation

Acknowledgements

bicyclegan's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent