Coder Social home page Coder Social logo

Text to Image generation

March 10, 2018

* created inverse gans

March 19, 2018

* added is_cuda check for train.py
* using --split=2 for inference (testing set)
* additional loss used by the original code author:
	g_loss = criterion(outputs, real_labels) \
					 + self.l2_coef * l2_loss(activation_fake, activation_real.detach()) \
					 + self.l1_coef * l1_loss(fake_images, right_images)

GAN losses

* forward GAN: 
	* disc_loss = real_loss + fake_loss + wrong_loss(cls)(wrong image + right embedding)
	* gen_loss = g_loss
* inverse GAN: 
	* disc_loss = real_loss + fake_loss + wrong_loss(cls)(wrong embedding + right image)
	* gen_loss = criterion(outputs, real_labels) # that's it?

Questions:

* cls: what would cls loss be for disc_loss in inverse GAN?

TODO:

  1. prepare dataset COCO (dataloader function)
  2. generate word embeddings (using skip thought or gensim?)
  3. GAN for image generation from word embeddings
  4. GAN for caption generation
  5. Cycle GAN structure

csc2548's Projects

char-cnn-rnn icon char-cnn-rnn

Learning Deep Representations of Fine-grained Visual Descriptions

cyclegan icon cyclegan

Software that can generate photos from paintings, turn horses into zebras, perform style transfer, and more.

ganhacks icon ganhacks

starter from "How to Train a GAN?" at NIPS2016

icml2016 icon icml2016

Generative Adversarial Text-to-Image Synthesis

tac-gan icon tac-gan

A Tensorflow implementation of the Text Conditioned Auxiliary Classifier Generative Adversarial Network for Generating Images from text descriptions (https://arxiv.org/abs/1703.06412)

unit icon unit

unsupervised/unpaired image-to-image translation using Coupled GANs

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.