Comments (2)
Hi @YoojLee ,
Thanks for your insightful discussion.
-
In my opinion, the core of MetaFormer is the repeated MetaFormer blocks. Thus, those models using hierarchical structure, like PVT, Swin and PoolFormer, are regarded as MetaFormer models.
-
For 4-stage hierarchical structure, the four patch embeddings shown in that paper actually can also be called downsampling layers similar to ResNet. Downsampling can also mix tokens, but its main function is to reduce resolution and increase channel numbers. ResNet and PoolFormer have similar hierarchical structures, the better performance of PoolFormer demonstrates the superior of MetaForemer. You may also refer #43.
from poolformer.
Thanks for your reply!
I just want to confirm that what I understand is right. If I get your comment correct, the suggested MetaFormer concept is the mere stack of MetaFormer Block (which consists of normalization, token&channel mixer, and residual connection). Thus, regardless of the extent of inductive bias or whether overall architecture follows a hierarchical structure, the models with repetition of MetaFormer blocks become one of the MetaFormers.
from poolformer.
Related Issues (20)
- How to achieve the grad-CAM visualization? HOT 3
- How to measure MACs? HOT 5
- Aboutu the results graph HOT 3
- Design on positional embedding? HOT 4
- About MLN(Modified Layer Normalization) HOT 3
- s12 model Reproduction experiment HOT 1
- what makes pooling competitive performance or even more than attention? HOT 1
- No module named 'mmcv_ custom.runner.optimizer' HOT 1
- segmentation不使用分布式训练 HOT 1
- Pretrained weights for other versions HOT 8
- About clip_norm HOT 2
- How to check the number of parameters of both object detection and instance segmentation HOT 2
- Object detection training HOT 3
- Welcome update to OpenMMLab 2.0
- Some confusion about random mixing HOT 5
- PoolFormer pretrained using MAE HOT 1
- How can your poolformer model be applied to semantic segmentation tasks? HOT 1
- 你好,我有一些结构的问题 HOT 3
- PoolFormer for Segmentation task
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from poolformer.