Coder Social home page Coder Social logo

outganfit's Introduction

outGANfit - a cDCGANs-based architecture

Conditional Deep Convolutional Generative Adversarial Networks (cDCGANs‐)based architecture to generate outfit items compatible with the input garment, using the Polyvore Outfits dataset.

Directory tree

  • architecture: in this directory the are the python scripts that define the architecture of the Autoencoder and the Generative Adversarial Network;
  • checkpoints: in this directory there are the saved pre-trained model that can be used to test the model without train it;
  • preprocessing: in this directory there are all the script used to preprocess out dataset;
  • test_results_images: in this directory there are some images extracted testing the model;
  • training: in this directoy there are all the methods written in order to train the autoencoder and the GANs;
  • utility: in this directory there are the utility methods used during the training phase, such as FID metric, Custom Image Dataset and so on.

The dataset

Our dataset, Polyvore Outfits (downloadable at this link) consists of 261,057 RGB images of garments measuring 300x300 pixels, divided into the categories “tops”, “bottoms”, “shoes”, “accessories”, “all-body”, “jewellery”, “bags”, “hats” and “outerwear”. From these, only the first four were chosen. The goal is to input a t-shirt to the model and have it generate a matching pant, pair of shoes and accessory (only sunglasses in our case).

outGANfit architecture

Our architecture includes three different GANs, one for each garment to be predicted (pants, shoes, accessories). For each gan we have a generator and two discriminators, one of which is used to evaluate the compatibility between the conditioning image (t-shirt) and the generated garment. The other discriminator is useful, on the other hand, to classify what is generated as real or fake. Architecture

Results

After the evaluation, we saved our best model in a .pth file, named “trained_generator_bottom.pth”. This was made in order to save the updated weights of our model. We loaded the model and the “testloader”, i.e. iterator of test images, and we tested it. We obtained a FID value of 175.4166, in accordance with our expectations. Below, some outputs: Results

Documentation

You can read the complete documentation at this link

Authors

outganfit's People

Contributors

davide-abbattista avatar vannisil avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.