Building a Deep Autoencoder - Lab

Introduction

Deep auto encoders are characterized by having more than one layers in their encoder and decoder components. In this lab, we will mainly repeat the last experiment, but using a deep architecture instead of a simple feed forward styles networks for encoding and decoding the last lab.

Objectives

You will be able to:

Build a deep autoencoder in Keras
Create the encoder and decoder functions as multiple fully connected layers
Train an autoencoder with selected loss function and optimizer

Deep Autoencoders

The extension of the simple Autoencoder is the Deep Autoencoder. The first layer of the Deep Autoencoder is used for first-order features in the raw input. The second layer is used for second-order features corresponding to patterns in the appearance of first-order features. This is also knows as FoF in deep learning i.e. Features of Features. Deeper layers of the Deep Autoencoder tend to learn even higher-order features.

A deep autoencoder is composed of two, symmetrical deep-belief networks

DBN (Deep belief Network) is a class of deep neural network which comprises of multiple layer of graphical model having both directed and undirected edges. It is composed of multiple layers of hidden units, where each layers are connected with each others but units are not. Visist here for details.

First four or five shallow layers representing the encoding half of the net. The second set of four or five layers that make up the decoding half.

In our previous lab, we used single fully-connected layers for both the encoding and decoding models while building a simple AE. With deep AE, we can stack multiple fully-connected layers to make each of the encoder and decoder functions deep, turning our simple model into a deep architecture.

In this lab, we'll do just that: we'll repeat the initial problem setup, importing the same dataset and performing the same preprocessing. From there, we'll once again build an autoencoder, but this time, we will stack multiple layers in order to improve our performance.

Import the code for reading + preprocessing fashion-MNIST dataset

# Load the fashion dataset

#Normalize the train and test sets to a range of 0 to 1

#Reshape the training and test data to create 1D vectors

Build the Deep Autoencoder

So this time, we are building a deep autoencoder. The code for this wo't be much different to what we saw earlier. Here we are adding a few extra encoding and decoding layers as listed below:

Use 3 fully-connected layers for the encoding model that inputs original 784 dimensions and decrease the dimensionality from 128 to 64 to 32.
Add 3 fully-connected decoder layers that reconstruct the image back to 784 dimensions.
Except for the last layer, use ReLU activation functions in all other layers
Show the model summary

# Build a deep AE

# Encoder Layers

# Decoder Layers

#Check a summary of the autoencoder

Extract the Encoder

As seen previously, we will now extract the encoder model from the above. Remember the encoder model now consists of the first 3 layers in the autoencoder.

# Extract the Encoder model and output a summary like this:

_________________________________________________________________
Layer (type)                 Output Shape              Param #   
=================================================================
input_2 (InputLayer)         (None, 784)               0         
_________________________________________________________________
dense_1 (Dense)              (None, 128)               100480    
_________________________________________________________________
dense_2 (Dense)              (None, 64)                8256      
_________________________________________________________________
dense_3 (Dense)              (None, 32)                2080      
=================================================================
Total params: 110,816
Trainable params: 110,816
Non-trainable params: 0
_________________________________________________________________

Train the Model

We can now compile the model with Adam optimizer and binary cross entropy loss. Use 20 epochs and a batch size of 256.

# Compile and train and the model

View the Code and Reconstruction

Bring in the code from previous experiment to view the reconstruction and encoding performed by our deep AE for 10 random images.

#Plot the original images, encoded representations and output

Do you notice any improvement over the previous model ? Let's admit it, there is not a huge change , due to the fact that we did not train the model to the point of convergence (for saving some time). Also, we did not use any cross validation techniques. We can improve these models by performing following tasks:

Level Up - Optional

Train both (simple and deep) AEs to 100 epochs and compare the results
Apply k-fold cross validation with deep AE (highly recommended for avoiding overfitting in deep networks) and check for any improvements.
Repeat the simple and deep AE labs with MNIST dataset (available in Keras).
Try this experiment with a high resolution (very high dimensionality) dataset. Caution: The training time may reach upto hours for a large dataset (or even days) - Thats where GPU/cloud computing comes into play.

Summary

In this lab, we created a deep Autoencoder following the similar approach and dataset from our previous lab. We developed 3 layer encoder and decoder functions in keras and trained the network for 20 epochs. Next, we shall look into an AE architecture which is highly suitable for Image data - The Convolutional Auto-Encoder.

learn-co-students / dsc-4-45-04-building-deep-autoencoder-lab-online-ds-sp-000 Goto Github PK

dsc-4-45-04-building-deep-autoencoder-lab-online-ds-sp-000's Introduction

Building a Deep Autoencoder - Lab

Introduction

Objectives

Deep Autoencoders

Import the code for reading + preprocessing fashion-MNIST dataset

Build the Deep Autoencoder

Extract the Encoder

Train the Model

View the Code and Reconstruction

Level Up - Optional

Summary

dsc-4-45-04-building-deep-autoencoder-lab-online-ds-sp-000's People

Contributors

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent