Coder Social home page Coder Social logo

voxmenthe / conv-activations Goto Github PK

View Code? Open in Web Editor NEW
5.0 2.0 1.0 57 KB

PyTorch Implementation of several convolutional network models with custom activation functions and automated testing of multiple models.

License: GNU General Public License v3.0

Python 100.00%

conv-activations's Introduction

Pytorch Implmementation of Dual Path Networks with Tweaks

This is a complete training example for Dual Path Networks (https://arxiv.org/abs/1707.01629) some other residual architctures. Currently, only CIFAR10 is supported, but I may at some point extend this to various other datasets.

Main contributions of this repo are:

  • Several interesting tweaks to Dual Path Networks, some of which could potentially lead to improved performance. I will be writing these up in a blog post sometime over the coming weeks.
  • Easy testing of different activation functions.
  • Automated training of several models in succession.
  • Complete logging of trained experiment in CSV files, as well as model checkpoints.

Requirements:

  • Python 3.5+
  • PyTorch 0.3
  • Torchvision

How To Run:

python3 main_multi.py --epochs 10

Detailed instructions and more example training scripts can be found in the TRAINING_INSTRUCTIONS.md file along with detailed descriptions of the arguments

Models and Model Configuration

  • All of the main model architectures are stored in the models directory, each in their own separate file.
  • Each model file must be registered in models/__init__.py
  • For the moment, the actual model names are at the bottom of each model file, and must be typed in to the model_list array in main_multi.py, but I will eventually rewrite so that they can simply be selected as an argument during training.

Activation Function Tweaks

  • Several (but not all) of the models support using custom activation functions. The ones that do are noted in TRAINING_INSTRUCTIONS.
  • The file custom_activations.py includes my implementations of a few new activation functions from recent research that are not yet implemented in PyTorch. There are also a few experiments of my own, some of them promising, to be written up in a blog post soon.

Logging and Checkpoints

  • This project includes a reasonably informative progress bar in the terminal window.
  • There is also logging of training data for each model trained, with historical records saved to csv files.
  • The command line arguments that control this are detailed in TRAINING_INSTRUCTIONS.md

Credits

conv-activations's People

Contributors

voxmenthe avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Forkers

salehhariyanti

conv-activations's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.