liuxingbin / dbot Goto Github PK

View Code? Open in Web Editor NEW

52.0 4.0 8.0 2.18 MB

[ICLR2024] Exploring Target Representations for Masked Autoencoders

Home Page: https://arxiv.org/abs/2209.03917

License: Apache License 2.0

Python 98.00% Shell 2.00%

dbot ssl masked-image-modeling unsupervised-learning representation-learning

dbot's People

Contributors

Stargazers

Watchers

Forkers

taokong comebaby520 liuyisi123 zrnupping overbestfitting yian454 whuhxb sailfish009

dbot's Issues

Code for averaged attention distance and SVD

Hi Authors,

Thanks for your work.

Could we have the code for the property analysis described in the paper as well (i.e., averaged attention distance and SVD).

the pretrain data for the experiment of data-richer teacher

Hi, Authors，
Thank you for your good work!
I want to know when you distill from the data-richer teacher, what is the data you use, IN1K or IN1K+400M ITp?
I see in your table 8, you say you use IN1K+400M ITp, so I wonder if the good performance is due to the rich data, not the method. I suppose here you want to show the data of teacher, not the student, is that true?
Thanks!

The settings in downstream tasks

Hi, I notice that the settings for object detection are not consistant in your paper and in this repo.
For ViT-Base model, the setting in your paper is {cascade-maskrcnn; epoch:1x; layerdecay: 0.75}, while the setting in this repo is {cascade-maskrcnn; epoch:3x; layerdecay: 0.65}. So I want to know which setting should we use to reproduce your work.

you say the teacher is not important, but the best results is done by using data-richer teacher, exceed others by large margin.
you say the best practice is multi-stage by distill itself, but the best results is done by 1 stage with longer epochs and good teacher.
So I am confused. Is there some point I don't understand?

Minimum Hardware Requirements

I am new to self-supervised learning, what is the minimum hardware required to use dBOT? Is it possible to learn with my own dataset on a single GPU?

liuxingbin / dbot Goto Github PK

dbot's People

Contributors

Stargazers

Watchers

Forkers

dbot's Issues

Code for averaged attention distance and SVD

the pretrain data for the experiment of data-richer teacher

The settings in downstream tasks

k-NN and Linear Probing

the random teacher 0th performance

confused by the paper

Minimum Hardware Requirements

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent