Coder Social home page Coder Social logo

kaggle_toxicity's Introduction

Notice

I have forked this repo to reupload the docker contains as the old ones were not available anymore.

Kaggle - Toxicity Challenge

This repositoru aims to share some of the code I wrote for the Toxicity Challenge. I did not make it to the first positions, but getting access to the dataset and being able to try it out on AWS GPU instances was worth it.

The datasets, for both training and testing, are not available in this repository, but they can be easily download at the Toxicity data page: datasets

The model built here uses an activation function I have developed myself, which was part of the IBM Watson AI XPRIZE competition. The function has demonstrated to be better than the ReLU function. More about the ReLUs function (as I call it), will soon be available in a separate paper, as there is a hyper parameter that needs to be tuned depending on the network architecture.

Performance

MacBook Pro

On a MacBook Pro, with 16GB, 4 cores, Intel i7, one epoch takes about 30 minutes.

AWS

On a g2.2xlarge GPU Instance, with 15GB and 8 vCPUs, one epoch takes about 3 minutes. Quite impressive!

Running the Model

Running with Docker

The Docker image, which contains the source code, is executed from another poject which is able to spin up a whole AWS environment for the model to run. Since it depends on the NVIDIA Cuda Drivers, it won't be possible to run it locally.

More information on running it with Terraform can be found in the Automated ML repository.

kaggle_toxicity's People

Contributors

wilderrodrigues avatar jdekoning avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.