Coder Social home page Coder Social logo

Comments (2)

juliendenize avatar juliendenize commented on July 23, 2024

Hello,

I did not provide the training from scratch in the README so I understand the struggle to retrieve these results. I will list here the things I can think off that causes your issue:

  • Fine-tune the whole backbone and do not initialize the classifier as the features are noisy.
  • Increase the number of epochs from 20 to 100 by adding to the command line: trainer.max_epochs=100 (check that it also changes the scheduler params max_epochs value).
  • Vary the learning rate as you did if it is not enough , I tried to look at my logs I found this config for the optimizer, so I hope this is the one and it seems to comply with the one in the paper:
optimizer:
    _target_: eztorch.optimizers.optimizer_factory
    _recursive_: false
    exclude_wd_norm: true
    exclude_wd_bias: true
    name: adamw
    params:
      weight_decay: 0.05
      betas:
      - 0.9
      - 0.999
      eps: 1.0e-08
      amsgrad: false
    batch_size: 128
    initial_lr: 0.0005
    layer_decay_lr: null
    scaler: linear
    scheduler:
      _target_: eztorch.schedulers.scheduler_factory
      _recursive_: false
      name: linear_warmup_cosine_annealing_lr
      params:
        max_epochs: 100
        warmup_epochs: 0
        warmup_start_lr: 5.0e-06
        eta_min: 5.0e-06
      interval: step
    keys_without_decay:
    - pos_embed
    - spatial_cls_token
    - temporal_cls_token
    - time_embed
    - temporal_mask_token

Yes I trained on 2-80GB A100 do you have an issue with that ?

If you can give me more context like your training script maybe I could detect an error but I hope this is already a first step towards resolving your issue.

from eztorch.

juliendenize avatar juliendenize commented on July 23, 2024

Hi @DaniNem , I close the issue as I did not hear back from you but feel free to reopen if I did not solve it with my previous comment 😄.

from eztorch.

Related Issues (3)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.