Coder Social home page Coder Social logo

Pretrained model details about dataset HOT 6 CLOSED

openimages avatar openimages commented on July 22, 2024
Pretrained model details

from dataset.

Comments (6)

ddofer avatar ddofer commented on July 22, 2024 1

Hi, any update on a pretrained model? (Either for V1 or the new V2?). Thanks!

from dataset.

rkrasin avatar rkrasin commented on July 22, 2024 1

The V2 model has been released: #46
Notes on how it was trained: https://storage.googleapis.com/openimages/2017_07/oidv2-resnet_v1_101.readme.txt

-------------------------------
Details on model training:
-------------------------------
The model was trained using the tf-slim image classification model library
available at https://github.com/tensorflow/models/tree/master/research/slim. Vgg
input preprocessing was used with image resolution 299x299. The classification
layer is defined as
  logits, end_points = resnet_v1.resnet_v1_101(images, num_classes=5000)
  logits = tf.squeeze(logits, name='SpatialSqueeze')
  end_points['multi_predictions'] = tf.nn.sigmoid(
      logits, name='multi_predictions')

The model was trained asynchronously with 50 GPU workers and batch size 32 for
61995903 steps. RMSProp optimizer was used with the following settings:
  learning_rate = tf.train.exponential_decay(
      0.045,   # learning_rate
      slim.get_or_create_global_step(),
      552345,  # decay_steps
      0.94,    # learning_rate_decay_factor
      staircase=True
  )

  opt = tf.train.RMSPropOptimizer(
      learning_rate,
      0.9,  # decay
      0.9,  # momentum
      1.0   #rmsprop_epsilon
  )

The training data was formed by merging the machine-generated and human verified
annotations (filtered to the 5000 trainable classes):
- https://storage.googleapis.com/openimages/2017_07/annotations_machine_2017_07.tar.gz
- https://storage.googleapis.com/openimages/2017_07/annotations_human_2017_07.tar.gz
Human verified annotations were used whenever both were present.

from dataset.

rkrasin avatar rkrasin commented on July 22, 2024

@mayankbhagya I don't have a hand in the second release, but I was involved in the first one. My guess would be that all of this info will become available once the pretrained models are released. They are "coming soon", but the last time it took 5 weeks for us to release the models. Let's hope it would be more smooth this time, but I would not hold my breath.

from dataset.

mayankbhagya avatar mayankbhagya commented on July 22, 2024

Thanks @rkrasin
The pretrained model which is currently available is based on v1.
And I'm unable to find any training code or performance details corresponding to the v1 model.
Is that info available?

from dataset.

rkrasin avatar rkrasin commented on July 22, 2024

@mayankbhagya the best description of the training procedure and performance can be found in Learning From Noisy Large-Scale Datasets With Minimimal Supervision by @andreasveit and others.

No training code was released; mostly, because TensorFlow was being actively massaged in the preparation to 1.0 and it was too hard to keep up with their renames. Sorry.

from dataset.

mayankbhagya avatar mayankbhagya commented on July 22, 2024

Many thanks @rkrasin!

from dataset.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.