richarizardd / self-supervised-vit-path Goto Github PK
View Code? Open in Web Editor NEWSelf-Supervised Vision Transformers Learn Visual Concepts in Histopathology (LMRL Workshop, NeurIPS 2021)
License: GNU General Public License v3.0
Self-Supervised Vision Transformers Learn Visual Concepts in Histopathology (LMRL Workshop, NeurIPS 2021)
License: GNU General Public License v3.0
Hi.
Since reading your paper, I'm quite confused about why did you use the loss function like above? or maybe the typos?
(I cannot find your train codes so only way to check is by making issues)
Loss function you used is some weird cause it measures the cross entropy of teacher distribution relative to student distribution and train student network.
Your reference paper DINO use different loss function as below, which measures the cross entropy of student distribution relative to teacher distribution and train student network.
Is there some missing points that I don't know or it's just typos?
Hi,
Thank you for your nice work.
I wander if you could tell me training time consumed and your GPU type.
from pl_bolts.models.self_supervised import resnets
from pl_bolts.utils.semi_supervised import Identity
The two libs can't be found in the project.
Thanks for your great work.
In the figure, there is a global pooling layer after transformer encoder in ViT backbone. In the original implementation of dino, only cls token is used for successive processing. I wonder if you use global pooling of all tokens instead of cls token in this step.
Hi @Richarizardd,
Super interesting work!
Would you mind providing me with the conditions/parameters you used for DINO? I am trying to replicate some parts of your paper, and would really appreciate if you could provide the code for DINO that you used. Furthermore, would you have any advice on the set of parameters I should employ when using a ResNet50 as a backbone encoder with DINO? Or have you only experimented with ViTs as the backbone?
Thanks in advance :)
Hi,
Could you share the code about training too?
Thanks
Thanks for your great work!
In the paper, you said you used 1038 TCGA-BRCA for training. The project TCGA-BRCA provides 1062 Diagnostic Slides and 1093 Tissue Slides, and I wonder which type you used.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.