Coder Social home page Coder Social logo

transfer-learning-results's Introduction

transfer-learning-results

All experiments reported here were decoded with a language model compiled into a trie via Kenlm.

The language models are pruned trigrams (i.e. singleton bigrams and trigrams removed). The training text comes from wikidumps, collected by Fran Tyers.

The models labeled scratch are baselines, where no transfer learning is happening - these models are trained "from scratch".

All other results represent performance from a model which was bootstrapped from the v0.3.0 release of DeepSpeech's English checkpoints.

There were 5 transfer learning experiments per language, which involve removing layers from the English model, and reinitializing and training to the target langauge.

In these experiments, only the newly reinitialied layers are updated - the copied English layers stay frozen, acting as an non-updateable feature extractor.

The integers in the filenames indicate how many layers from the English model were removed, starting from the last layer. For example, the results in ca/RESULTS.json.ca.2 show the decoding output of a model trained on Catalan, where the last two layers of an English model are re-initialized and trained to Catalan training data.

transfer-learning-results's People

Contributors

ftyers avatar

Watchers

 avatar James Cloos avatar Josh Meyer avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.